# Grok Imagine Image

> An image generation model by xAI designed to generate highly aesthetic images from text descriptions.

## Quick Reference

- Model ID: grok-imagine-image
- Creator: xAI
- Status: active
- Family: grok
- Base URL: https://api.lumenfall.ai/openai/v1

## Specifications

- Max Resolution: 2048x2048
- Max Input Images: 0
- Input Modalities: text, image
- Output Modalities: image

## Model Identifiers

- Primary Slug: grok-imagine-image
- Aliases: grok-imagine

## Dates


## Tags

image-generation, text-to-image, image-editing, commercial

## Available Providers

### xAI

- Config Key: xai/grok-imagine-image
- Provider Model ID: grok-imagine-image
- Pricing:
  - source: official
  - currency: USD
  - components: [{"type" => "input", "metric" => "image", "unit_price" => 0.002}, {"type" => "output", "metric" => "image", "unit_price" => 0.02}]
  - source_url: https://docs.x.ai/developers/models
  - effective_at: 2025-06-01

### fal.ai

- Config Key: fal/grok-imagine-image
- Provider Model ID: xai/grok-imagine-image
- Pricing:
  - source: official
  - currency: USD
  - components: [{"type" => "input", "metric" => "image", "unit_price" => 0.002}, {"type" => "output", "metric" => "image", "unit_price" => 0.02}]
  - source_url: https://fal.ai/models/xai/grok-imagine-image
  - effective_at: 2026-01-29


## Performance Metrics

Provider performance over the last 30 days.

### xai

- Median Generation Time (p50): 6959ms
- 95th Percentile Generation Time (p95): 8266ms
- Average Generation Time: 7051ms
- Success Rate: 97.0%
- Total Requests: 775
- Time to First Byte (p50): 6869ms
- Time to First Byte (p95): 8160ms

### fal

- Median Generation Time (p50): 9965ms
- 95th Percentile Generation Time (p95): 66162ms
- Average Generation Time: 18175ms
- Success Rate: 86.7%
- Total Requests: 15
- Time to First Byte (p50): 9964ms
- Time to First Byte (p95): 65645ms


## Arena Benchmarks

### Modern Clean Menu

- Elo: 1315
- Record: 25W / 8L / 2T (35 battles)
- Rank: #1 of 19

### Neutral Expression to Genuine Smile

- Elo: 1211
- Record: 16W / 13L / 2T (31 battles)
- Rank: #5 of 13

### Studio Ghibli Anime Style

- Elo: 1202
- Record: 13W / 19L / 1T (33 battles)
- Rank: #7 of 13

### Intricate Floral Mandala

- Elo: 1194
- Record: 13W / 8L / 5T (26 battles)
- Rank: #5 of 15

### Apollo 11: Journey to Tranquility

- Elo: 1192
- Record: 14W / 8L / 1T (23 battles)
- Rank: #5 of 19

### Night Sky Transformation

- Elo: 1189
- Record: 16W / 9L / 6T (31 battles)
- Rank: #3 of 15

### Bald man challenge

- Elo: 1187
- Record: 19W / 8L / 5T (32 battles)
- Rank: #5 of 14

### Over-the-top cartoon caricature

- Elo: 1170
- Record: 11W / 16L / 4T (31 battles)
- Rank: #10 of 13

### Man and Car in California

- Elo: 1170
- Record: 8W / 17L / 3T (28 battles)
- Rank: #12 of 13

### Fantasy Warrior

- Elo: 1159
- Record: 8W / 12L / 2T (22 battles)
- Rank: #13 of 19

### Candid Street Photography

- Elo: 1156
- Record: 5W / 14L / 0T (19 battles)
- Rank: #13 of 22

### Golden Hour Stroll

- Elo: 1154
- Record: 13W / 22L / 0T (35 battles)
- Rank: #8 of 12

### Vintage Cafe Logo

- Elo: 1145
- Record: 8W / 19L / 0T (27 battles)
- Rank: #13 of 19

### Geometric Composition

- Elo: 1143
- Record: 7W / 12L / 0T (19 battles)
- Rank: #16 of 22

### Isometric Miniature Diorama Scenes

- Elo: 1140
- Record: 9W / 14L / 3T (26 battles)
- Rank: #15 of 19

### Heroic Super Hero Portrait

- Elo: 1084
- Record: 6W / 14L / 4T (24 battles)
- Rank: #17 of 19

### Adorable Baby Animals in Sunny Meadow

- Elo: 1059
- Record: 1W / 24L / 0T (25 battles)
- Rank: #23 of 23

### Victorian Greenhouse Oasis

- Elo: 1059
- Record: 3W / 20L / 0T (23 battles)
- Rank: #17 of 17


## Use Cases & Category Performance

### Text Rendering (Text-to-Image)

- Rank: #6 of 21
- Elo: 1233
- Record: 46W / 34L / 3T (83 battles)
- Win Rate: 55.4%

### Portrait (Image Editing)

- Rank: #6 of 14
- Elo: 1244
- Record: 35W / 21L / 5T (61 battles)
- Win Rate: 57.4%

### Photorealism (Image Editing)

- Rank: #10 of 16
- Elo: 1218
- Record: 72W / 69L / 14T (155 battles)
- Win Rate: 46.5%

### Portrait (Text-to-Image)

- Rank: #12 of 19
- Elo: 1165
- Record: 8W / 12L / 2T (22 battles)
- Win Rate: 36.4%

### Product, Branding & Commercial (Text-to-Image)

- Rank: #13 of 19
- Elo: 1160
- Record: 8W / 19L / 0T (27 battles)
- Win Rate: 29.6%

### Anime (Image Editing)

- Rank: #11 of 13
- Elo: 1169
- Record: 13W / 19L / 1T (33 battles)
- Win Rate: 39.4%

### Photorealism (Text-to-Image)

- Rank: #19 of 22
- Elo: 1162
- Record: 5W / 14L / 0T (19 battles)
- Win Rate: 26.3%


## Image Gallery

22 images available for this model.
- Curated examples: 4
  - "A breathtaking, ultra-wide cinematic orbital view of Earth at cosmic dawn, with the curved horizon glowing in golden ..."
  - "A majestic wide-angle cinematic view of a high-tech submersible descending into a vibrant deep-ocean abyss. Brilliant..."
  - "A vast, dimly lit ancient library carved into a cliffside at golden hour, with endless towering shelves of leather-bo..."
  - "A serene geothermal valley on a lush exoplanet at twilight, with steaming turquoise hot springs nestled among glowing..."
- Competition results: 18
  - Modern Clean Menu: #1 of 19 (Elo 1315)
  - Neutral Expression to Genuine Smile: #5 of 13 (Elo 1211)
  - Studio Ghibli Anime Style: #7 of 13 (Elo 1202)
  - Intricate Floral Mandala: #5 of 15 (Elo 1194)
  - Apollo 11: Journey to Tranquility: #5 of 19 (Elo 1192)
  - Night Sky Transformation: #3 of 15 (Elo 1189)
  - Bald man challenge: #5 of 14 (Elo 1187)
  - Over-the-top cartoon caricature: #10 of 13 (Elo 1170)
  - Man and Car in California: #12 of 13 (Elo 1170)
  - Fantasy Warrior: #13 of 19 (Elo 1159)
  - Candid Street Photography: #13 of 22 (Elo 1156)
  - Golden Hour Stroll: #8 of 12 (Elo 1154)
  - Vintage Cafe Logo: #13 of 19 (Elo 1145)
  - Geometric Composition: #16 of 22 (Elo 1143)
  - Isometric Miniature Diorama Scenes: #15 of 19 (Elo 1140)
  - Heroic Super Hero Portrait: #17 of 19 (Elo 1084)
  - Adorable Baby Animals in Sunny Meadow: #23 of 23 (Elo 1059)
  - Victorian Greenhouse Oasis: #17 of 17 (Elo 1059)

## Example Prompt

The following prompt was used to generate an example image in our playground:

A serene geothermal valley on a lush exoplanet at twilight, with steaming turquoise hot springs nestled among glowing alien ferns and bioluminescent moss. Multiple small bird-like creatures and one capybara relax together in the shallow warm waters, half-submerged and completely at ease, as if this is their natural habitat. Twin moons rise above jagged crystal mountains, casting soft purple light across the scene. Gentle steam curls into the cool air, distant waterfalls glow faintly, warm amber and teal color grading, peaceful and wondrous atmosphere, 16:9 aspect ratio, hyper-realistic, serene cinematic aesthetic.

## Code Examples

### Text to Image (Generation)

#### cURL

curl -X POST \
  https://api.lumenfall.ai/openai/v1/images/generations \
  -H "Authorization: Bearer $LUMENFALL_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "grok-imagine-image",
    "prompt": "A serene mountain landscape at sunset",
    "size": "1024x1024"
  }'

# Response:
# { "created": 1234567890, "data": [{ "url": "https://...", "revised_prompt": "..." }] }

#### JavaScript

import OpenAI from 'openai';

const client = new OpenAI({
  apiKey: 'YOUR_API_KEY',
  baseURL: 'https://api.lumenfall.ai/openai/v1'
});

const response = await client.images.generate({
  model: 'grok-imagine-image',
  prompt: 'A serene mountain landscape at sunset',
  size: '1024x1024'
});

// { created: 1234567890, data: [{ url: "https://...", revised_prompt: "..." }] }
console.log(response.data[0].url);

#### Python

from openai import OpenAI

client = OpenAI(
    api_key="YOUR_API_KEY",
    base_url="https://api.lumenfall.ai/openai/v1"
)

response = client.images.generate(
    model="grok-imagine-image",
    prompt="A serene mountain landscape at sunset",
    size="1024x1024"
)

# { created: 1234567890, data: [{ url: "https://...", revised_prompt: "..." }] }
print(response.data[0].url)

### Image Editing

#### cURL

curl -X POST \
  https://api.lumenfall.ai/openai/v1/images/edits \
  -H "Authorization: Bearer $LUMENFALL_API_KEY" \
  -F "model=grok-imagine-image" \
  -F "image=@source.png" \
  -F "prompt=Add a starry night sky to this image" \
  -F "size=1024x1024"

# Response:
# { "created": 1234567890, "data": [{ "url": "https://...", "revised_prompt": "..." }] }

#### JavaScript

import OpenAI from 'openai';
import fs from 'fs';

const client = new OpenAI({
  apiKey: 'YOUR_API_KEY',
  baseURL: 'https://api.lumenfall.ai/openai/v1'
});

const response = await client.images.edit({
  model: 'grok-imagine-image',
  image: fs.createReadStream('source.png'),
  prompt: 'Add a starry night sky to this image',
  size: '1024x1024'
});

// { created: 1234567890, data: [{ url: "https://...", revised_prompt: "..." }] }
console.log(response.data[0].url);

#### Python

from openai import OpenAI

client = OpenAI(
    api_key="YOUR_API_KEY",
    base_url="https://api.lumenfall.ai/openai/v1"
)

response = client.images.edit(
    model="grok-imagine-image",
    image=open("source.png", "rb"),
    prompt="Add a starry night sky to this image",
    size="1024x1024"
)

# { created: 1234567890, data: [{ url: "https://...", revised_prompt: "..." }] }
print(response.data[0].url)


## About

## Overview
Grok Imagine Image is a high-fidelity text-to-image generation model developed by xAI. It is engineered to transform complex natural language prompts into visually striking, aesthetic imagery with a particular focus on realism and detailed composition. The model is distinctive for its adherence to user intent and its ability to render high-resolution outputs suitable for both creative exploration and commercial applications.

## Strengths
*   **Aesthetic Consistency:** The model is tuned to prioritize visually appealing compositions, lighting, and textures, reducing the need for extensive "prompt engineering" to achieve professional-looking results.
*   **Human Anatomy and Text Rendering:** It demonstrates improved accuracy in rendering human features—such as hands and eyes—and can incorporate legible, coherent text within generated images more reliably than many first-generation diffusion models.
*   **Prompt Adherence:** The model excels at interpreting multi-layered instructions, accurately placing specific objects and following spatial relationships defined in the text description.
*   **Processing Speed:** Optimized for rapid inference, the model generates high-resolution images quickly, making it suitable for iterative design workflows.

## Limitations
*   **Style Bias:** Because the model is optimized for "highly aesthetic" outputs, it may default to a polished or cinematic look even when a more raw or lo-fi aesthetic is requested.
*   **Niche Concept Gaps:** While strong on general concepts, the model may occasionally struggle with highly technical or obscure domain-specific imagery where training data density is lower.
*   **Image Editing Constraints:** While capable of image-to-image tasks, it may lack the granular "in-painting" controls found in specialized tools dedicated solely to localized image manipulation.

## Technical Background
Grok Imagine Image is built upon a concentrated diffusion architecture designed by xAI, leveraging massive datasets to bridge the gap between semantic understanding and visual synthesis. Its training approach emphasizes "alignment" between the latent visual space and conversational language patterns, allowing it to understand prompts that are phrased naturally rather than as a string of keywords.

## Best For
This model is ideal for creating marketing collateral, concept art, and high-quality social media assets where visual impact is the primary goal. It is also well-suited for rapid prototyping in UI/UX design and architectural visualization. Grok Imagine Image is available through Lumenfall’s unified API and playground, allowing developers to integrate high-end image generation into their applications with minimal overhead.

## Frequently Asked Questions

### How much does Grok Imagine Image cost?

Grok Imagine Image starts at $0.02 per image through Lumenfall. Pricing varies by provider. Lumenfall does not add any markup to provider pricing.

### How do I use Grok Imagine Image via API?

You can use Grok Imagine Image through Lumenfall's OpenAI-compatible API. Send requests to the unified endpoint with model ID "grok-imagine-image". Code examples are available in Python, JavaScript, and cURL.

### Which providers offer Grok Imagine Image?

Grok Imagine Image is available through xAI and fal.ai on Lumenfall. Lumenfall automatically routes requests to the best available provider.

### What is the maximum resolution for Grok Imagine Image?

Grok Imagine Image supports images up to 2048x2048 resolution.

## Links

- Model Page: https://lumenfall.ai/models/xai/grok-imagine-image
- About: https://lumenfall.ai/models/xai/grok-imagine-image/about
- Providers, Pricing & Performance: https://lumenfall.ai/models/xai/grok-imagine-image/providers
- API Reference: https://lumenfall.ai/models/xai/grok-imagine-image/api
- Benchmarks: https://lumenfall.ai/models/xai/grok-imagine-image/benchmarks
- Use Cases: https://lumenfall.ai/models/xai/grok-imagine-image/use-cases
- Gallery: https://lumenfall.ai/models/xai/grok-imagine-image/gallery
- Playground: https://lumenfall.ai/playground?model=grok-imagine-image
- API Documentation: https://docs.lumenfall.ai