# Grok Imagine Image > An image generation model by xAI designed to generate highly aesthetic images from text descriptions. ## Quick Reference - Model ID: grok-imagine-image - Creator: xAI - Status: active - Family: grok - Base URL: https://api.lumenfall.ai/openai/v1 ## Specifications - Max Resolution: 2048x2048 - Max Input Images: 0 - Input Modalities: text, image - Output Modalities: image - Supported Modes: Text to Image, Image Edit ## API Parameters The compiled parameter schema for this model is available via the API: `GET /v1/models/grok-imagine-image?schema=true`. ### Core Parameters - `prompt` (string) — REQUIRED: Text prompt for image generation. Modes: Text to Image, Image Edit ### Size & Layout - `size` (string): Image dimensions as WxH pixels (e.g. "1024x1024") or aspect ratio (e.g. "16:9"). Modes: Text to Image, Image Edit - `aspect_ratio` (string): Aspect ratio of the output image (e.g. "16:9", "1:1"). Modes: Text to Image, Image Edit - `resolution` (string): Output resolution tier (e.g. "1K", "4K"). Values: 1K. Modes: Text to Image, Image Edit ### Media Inputs - `image` (file) — REQUIRED: Input image(s) to edit. Modes: Image Edit ### Output & Format - `response_format` (string): How to return the image. Default: url. Values: url, b64_json. Modes: Text to Image, Image Edit - `output_format` (string): Output image format. Values: png, jpeg, gif, webp, avif. Modes: Text to Image, Image Edit - `output_compression` (integer): Compression level for lossy formats (JPEG, WebP, AVIF). Modes: Text to Image, Image Edit - `n` (integer): Number of images to generate. Default: 1. Modes: Text to Image, Image Edit ### Additional Parameters - `sync_mode` (boolean): If `True`, the media will be returned as a data URI and the output data won't be available in the request history.. Modes: Text to Image, Image Edit. Only available via fal ## Model Identifiers - Primary Slug: grok-imagine-image - Aliases: grok-imagine ## Tags image-generation, text-to-image, image-editing, commercial ## Available Providers ### xAI - Config Key: xai/grok-imagine-image - Provider Model ID: grok-imagine-image - Pricing: $0.0020/image, $0.020/image - Source: https://docs.x.ai/developers/models ### fal.ai - Config Key: fal/grok-imagine-image - Provider Model ID: xai/grok-imagine-image - Pricing: $0.020/image - Source: https://fal.ai/models/xai/grok-imagine-image ### fal.ai - Config Key: fal/grok-imagine-image-edit - Provider Model ID: xai/grok-imagine-image/edit - Pricing: $0.0020/image, $0.020/image - Source: https://fal.ai/models/xai/grok-imagine-image ## Performance Metrics Provider performance over the last 30 days. ### xai - Median Generation Time (p50): 8014ms - 95th Percentile Generation Time (p95): 11584ms - Average Generation Time: 8193ms - Success Rate: 88.6% - Total Requests: 905 - Time to First Byte (p50): 7322ms - Time to First Byte (p95): 10660ms ### fal - Median Generation Time (p50): 8291ms - 95th Percentile Generation Time (p95): 11617ms - Average Generation Time: 8759ms - Success Rate: 100.0% - Total Requests: 859 - Time to First Byte (p50): 8291ms - Time to First Byte (p95): 11613ms ## Arena Benchmarks ### Modern Clean Menu - Elo: 1313 - Record: 27W / 8L / 2T (37 battles) - Rank: #1 of 19 ### Neutral Expression to Genuine Smile - Elo: 1232 - Record: 18W / 16L / 2T (36 battles) - Rank: #6 of 14 ### Studio Ghibli Anime Style - Elo: 1225 - Record: 14W / 19L / 1T (34 battles) - Rank: #8 of 14 ### Bald man challenge - Elo: 1222 - Record: 22W / 10L / 5T (37 battles) - Rank: #6 of 15 ### Apollo 11: Journey to Tranquility - Elo: 1212 - Record: 15W / 8L / 1T (24 battles) - Rank: #4 of 19 ### Intricate Floral Mandala - Elo: 1205 - Record: 14W / 9L / 5T (28 battles) - Rank: #7 of 15 ### Night Sky Transformation - Elo: 1180 - Record: 19W / 12L / 6T (37 battles) - Rank: #11 of 16 ### Man and Car in California - Elo: 1178 - Record: 11W / 25L / 3T (39 battles) - Rank: #12 of 13 ### Heroic Super Hero Portrait - Elo: 1175 - Record: 8W / 16L / 4T (28 battles) - Rank: #11 of 21 ### Candid Street Photography - Elo: 1168 - Record: 6W / 15L / 0T (21 battles) - Rank: #13 of 24 ### Golden Hour Stroll - Elo: 1168 - Record: 17W / 26L / 0T (43 battles) - Rank: #10 of 13 ### Isometric Miniature Diorama Scenes - Elo: 1165 - Record: 10W / 15L / 3T (28 battles) - Rank: #14 of 21 ### Over-the-top cartoon caricature - Elo: 1164 - Record: 11W / 17L / 4T (32 battles) - Rank: #11 of 13 ### Geometric Composition - Elo: 1160 - Record: 8W / 13L / 0T (21 battles) - Rank: #18 of 22 ### Vintage Cafe Logo - Elo: 1155 - Record: 9W / 20L / 0T (29 battles) - Rank: #16 of 21 ### Fantasy Warrior - Elo: 1154 - Record: 8W / 16L / 2T (26 battles) - Rank: #16 of 21 ### Adorable Baby Animals in Sunny Meadow - Elo: 1060 - Record: 1W / 25L / 0T (26 battles) - Rank: #25 of 25 ### Victorian Greenhouse Oasis - Elo: 1060 - Record: 3W / 25L / 0T (28 battles) - Rank: #17 of 17 ## Use Cases & Category Performance ### Text Rendering (Text-to-Image) - Rank: #8 of 23 - Elo: 1238 - Record: 50W / 35L / 3T (88 battles) - Win Rate: 56.8% ### Portrait (Image Editing) - Rank: #7 of 15 - Elo: 1243 - Record: 40W / 26L / 5T (71 battles) - Win Rate: 56.3% ### Photorealism (Image Editing) - Rank: #9 of 16 - Elo: 1218 - Record: 87W / 89L / 14T (190 battles) - Win Rate: 45.8% ### Portrait (Text-to-Image) - Rank: #15 of 21 - Elo: 1156 - Record: 8W / 16L / 2T (26 battles) - Win Rate: 30.8% ### Photorealism (Text-to-Image) - Rank: #17 of 23 - Elo: 1167 - Record: 6W / 15L / 0T (21 battles) - Win Rate: 28.6% ### Product, Branding & Commercial (Text-to-Image) - Rank: #16 of 21 - Elo: 1161 - Record: 9W / 20L / 0T (29 battles) - Win Rate: 31.0% ### Anime (Image Editing) - Rank: #12 of 14 - Elo: 1176 - Record: 14W / 19L / 1T (34 battles) - Win Rate: 41.2% ## Image Gallery 22 images available for this model. Browse all at https://lumenfall.ai/models/xai/grok-imagine-image/gallery ### Curated Examples - [A breathtaking, ultra-wide cinematic orbital view of Earth at cosmic dawn, with the curved horizo...](https://assets.lumenfall.ai/YwauaVkQ-xT1etbFoMQzQ5N_U98z8woPvrR-Kq_nPI4/rs:fit:1500:1500/plain/gs://lumenfall-prod-assets/k02a90j2bv1kh9gxnu7ey4n48jub@jpeg) - [A majestic wide-angle cinematic view of a high-tech submersible descending into a vibrant deep-oc...](https://assets.lumenfall.ai/d7hXgLjAsszm95mI66QAMbjPUoMWwYCdZDhPCNKhe9U/rs:fit:1500:1500/plain/gs://lumenfall-prod-assets/cqc05g4uea4ghj62gaamz765tgor@jpeg) - [A vast, dimly lit ancient library carved into a cliffside at golden hour, with endless towering s...](https://assets.lumenfall.ai/Hc3Fzu9c4FfWqZyh4HyGGhjD_8Xji_Dlnzc86gcTvWQ/rs:fit:1500:1500/plain/gs://lumenfall-prod-assets/8kxx862x8y969a0cuojs300qkuzs@jpeg) - [A serene geothermal valley on a lush exoplanet at twilight, with steaming turquoise hot springs n...](https://assets.lumenfall.ai/kW_plP7_4bb5zGPktPkwUU1zVXTXPK_ILeKoJt06dzI/rs:fit:1500:1500/plain/gs://lumenfall-prod-assets/30d7b4l1clz6opojtep1349wv2nl@jpeg) ### Arena Competition Results - [Modern Clean Menu](https://assets.lumenfall.ai/F8oI_UvOT2BRkpazP8a-BD-0zgHX_0AGaj2ZMC-QySg/rs:fit:1500:1500/plain/gs://lumenfall-prod-assets/0ew81uuxmcmviwlsc0tn7rs2wk9e@jpeg): #1 of 19 (Elo 1313) - [Neutral Expression to Genuine Smile](https://assets.lumenfall.ai/07FZEEmXjeCxPF8UbF9XkJR9l20DnJ0DsPZIGB7x6u0/rs:fit:1500:1500/plain/gs://lumenfall-prod-assets/oov2chyz855i1eomqkuwxqbvjruu@jpeg): #6 of 14 (Elo 1232) - [Studio Ghibli Anime Style](https://assets.lumenfall.ai/7zZ0m44lqQVQs_xyMYYWFwXunqt5wCKGN25XothDw-s/rs:fit:1500:1500/plain/gs://lumenfall-prod-assets/5thq7jq5mjttlpxnr4r8o9z2zkn4@jpeg): #8 of 14 (Elo 1225) - [Bald man challenge](https://assets.lumenfall.ai/wP1hlV2a11_atTf_zaggaHopweFEsotmt167N44hANE/rs:fit:1500:1500/plain/gs://lumenfall-prod-assets/cfw24coz4zmk5xobneryz1hvmqqv@jpeg): #6 of 15 (Elo 1222) - [Apollo 11: Journey to Tranquility](https://assets.lumenfall.ai/-SLrQFkkcKKpysb9Aw-GeC0Tq4Ke87xO575kUGFpzyI/rs:fit:1500:1500/plain/gs://lumenfall-prod-assets/c4f02fp0frialpw8a41980wl77hq@jpeg): #4 of 19 (Elo 1212) - [Intricate Floral Mandala](https://assets.lumenfall.ai/PGRK6VLKJv9mhQ4x4F_KfTnnxklh_bVfYQIF82kpxUs/rs:fit:1500:1500/plain/gs://lumenfall-prod-assets/taaw32sgdg9otjd4wb99b1frsg2o@jpeg): #7 of 15 (Elo 1205) - [Night Sky Transformation](https://assets.lumenfall.ai/KznHcGUG4-xh1WRNB730wpWk_-NLA15qChs2w4Uctoc/rs:fit:1500:1500/plain/gs://lumenfall-prod-assets/b9ly39nbn0fnmepjdxhdinss0zli@jpeg): #11 of 16 (Elo 1180) - [Man and Car in California](https://assets.lumenfall.ai/Kt-X_a--Dc0BFS3eS252Bx751DPhVf1MBh07QKKlch4/rs:fit:1500:1500/plain/gs://lumenfall-prod-assets/eclf377ra6ty2l2ywa6x09nltvcu@jpeg): #12 of 13 (Elo 1178) - [Heroic Super Hero Portrait](https://assets.lumenfall.ai/3Pq-PSC-0ZvNlXu07qg0a41qGMwSneBczo8b5XASrKY/rs:fit:1500:1500/plain/gs://lumenfall-prod-assets/hg6sfs2sq4944r4qmjfz3g74a43j@jpeg): #11 of 21 (Elo 1175) - [Candid Street Photography](https://assets.lumenfall.ai/bdV5efAZjtoycbyxIWywwSefzYO-mC0dIBkzVbuM_Dw/rs:fit:1500:1500/plain/gs://lumenfall-prod-assets/d853zzf0lp599fmm1dckwaca5lc6@jpeg): #13 of 24 (Elo 1168) - [Golden Hour Stroll](https://assets.lumenfall.ai/sv3S-ZPwovdroVgyiuRYk9TzELUiP-a9pDEMjO3lmTM/rs:fit:1500:1500/plain/gs://lumenfall-prod-assets/1ta7tadm2l6zj9phdlu82o3hq9a8@jpeg): #10 of 13 (Elo 1168) - [Isometric Miniature Diorama Scenes](https://assets.lumenfall.ai/W77LOVTrsciMMJHjdqwghCM0QlSV38npTLJBjtLKhRQ/rs:fit:1500:1500/plain/gs://lumenfall-prod-assets/79oeweenklg79y8r7mwkd67lx09g@jpeg): #14 of 21 (Elo 1165) - [Over-the-top cartoon caricature](https://assets.lumenfall.ai/-wvgQncxokLI8fqx3XkTt2mCRE1FfTrsj4CdgBtqV84/rs:fit:1500:1500/plain/gs://lumenfall-prod-assets/24sk3c6mcear8slqrxi65vxagbup@jpeg): #11 of 13 (Elo 1164) - [Geometric Composition](https://assets.lumenfall.ai/PzZqZtabLr2yUvEu0dSF17Lft_bfKAqu9MZcpMMXA-0/rs:fit:1500:1500/plain/gs://lumenfall-prod-assets/q233e84ditmv7hxcca9q9ldj6hr4@jpeg): #18 of 22 (Elo 1160) - [Vintage Cafe Logo](https://assets.lumenfall.ai/5xd9jLiV_SsfvTNbOGKmV08v5zWQgqfrcOm-QR_YOw8/rs:fit:1500:1500/plain/gs://lumenfall-prod-assets/sdsz5q3h4fpebhgshe76soocavq1@jpeg): #16 of 21 (Elo 1155) - [Fantasy Warrior](https://assets.lumenfall.ai/h7rSTHGHEfdy1ZDy1GLR9lkaS2LbnNwiP9_FKpmoyqI/rs:fit:1500:1500/plain/gs://lumenfall-prod-assets/wzcq0j19ao929jugoetmxh3nwuqp@jpeg): #16 of 21 (Elo 1154) - [Adorable Baby Animals in Sunny Meadow](https://assets.lumenfall.ai/27TS8NY4hro9OUvnxgqclSeS8N39QziWGX9Pd9sXQfM/rs:fit:1500:1500/plain/gs://lumenfall-prod-assets/t5wkhzqdg0qu5bh0axdrve6li0e9@jpeg): #25 of 25 (Elo 1060) - [Victorian Greenhouse Oasis](https://assets.lumenfall.ai/IkEAEmWEqeJGtl2dWRgQ40eUbbQgVqdJq3FM351jnio/rs:fit:1500:1500/plain/gs://lumenfall-prod-assets/vzuxwrctq70upuxepiwax8vruq14@jpeg): #17 of 17 (Elo 1060) ## Example Prompt The following prompt was used to generate an example image in our playground: A serene geothermal valley on a lush exoplanet at twilight, with steaming turquoise hot springs nestled among glowing alien ferns and bioluminescent moss. Multiple small bird-like creatures and one capybara relax together in the shallow warm waters, half-submerged and completely at ease, as if this is their natural habitat. Twin moons rise above jagged crystal mountains, casting soft purple light across the scene. Gentle steam curls into the cool air, distant waterfalls glow faintly, warm amber and teal color grading, peaceful and wondrous atmosphere, 16:9 aspect ratio, hyper-realistic, serene cinematic aesthetic. ## Code Examples ### Text to Image (/v1/images/generations) #### cURL curl -X POST \ https://api.lumenfall.ai/openai/v1/images/generations \ -H "Authorization: Bearer $LUMENFALL_API_KEY" \ -H "Content-Type: application/json" \ -d '{ "model": "grok-imagine-image", "prompt": "", "size": "1024x1024" }' # Response: # { "created": 1234567890, "data": [{ "url": "https://...", "revised_prompt": "..." }] } #### JavaScript import OpenAI from 'openai'; const client = new OpenAI({ apiKey: 'YOUR_API_KEY', baseURL: 'https://api.lumenfall.ai/openai/v1' }); const response = await client.images.generate({ model: 'grok-imagine-image', prompt: '', size: '1024x1024' }); // { created: 1234567890, data: [{ url: "https://...", revised_prompt: "..." }] } console.log(response.data[0].url); #### Python from openai import OpenAI client = OpenAI( api_key="YOUR_API_KEY", base_url="https://api.lumenfall.ai/openai/v1" ) response = client.images.generate( model="grok-imagine-image", prompt="", size="1024x1024" ) # { created: 1234567890, data: [{ url: "https://...", revised_prompt: "..." }] } print(response.data[0].url) ### Image Edit (/v1/images/edits) #### cURL curl -X POST \ https://api.lumenfall.ai/openai/v1/images/edits \ -H "Authorization: Bearer $LUMENFALL_API_KEY" \ -F "model=grok-imagine-image" \ -F "image=@source.png" \ -F "prompt=Add a starry night sky to this image" \ -F "size=1024x1024" # Response: # { "created": 1234567890, "data": [{ "url": "https://...", "revised_prompt": "..." }] } #### JavaScript import OpenAI from 'openai'; import fs from 'fs'; const client = new OpenAI({ apiKey: 'YOUR_API_KEY', baseURL: 'https://api.lumenfall.ai/openai/v1' }); const response = await client.images.edit({ model: 'grok-imagine-image', image: fs.createReadStream('source.png'), prompt: 'Add a starry night sky to this image', size: '1024x1024' }); // { created: 1234567890, data: [{ url: "https://...", revised_prompt: "..." }] } console.log(response.data[0].url); #### Python from openai import OpenAI client = OpenAI( api_key="YOUR_API_KEY", base_url="https://api.lumenfall.ai/openai/v1" ) response = client.images.edit( model="grok-imagine-image", image=open("source.png", "rb"), prompt="Add a starry night sky to this image", size="1024x1024" ) # { created: 1234567890, data: [{ url: "https://...", revised_prompt: "..." }] } print(response.data[0].url) ## About ## Overview Grok Imagine Image is a high-fidelity text-to-image generation model developed by xAI. It is engineered to transform complex natural language prompts into visually striking, aesthetic imagery with a particular focus on realism and detailed composition. The model is distinctive for its adherence to user intent and its ability to render high-resolution outputs suitable for both creative exploration and commercial applications. ## Strengths * **Aesthetic Consistency:** The model is tuned to prioritize visually appealing compositions, lighting, and textures, reducing the need for extensive "prompt engineering" to achieve professional-looking results. * **Human Anatomy and Text Rendering:** It demonstrates improved accuracy in rendering human features—such as hands and eyes—and can incorporate legible, coherent text within generated images more reliably than many first-generation diffusion models. * **Prompt Adherence:** The model excels at interpreting multi-layered instructions, accurately placing specific objects and following spatial relationships defined in the text description. * **Processing Speed:** Optimized for rapid inference, the model generates high-resolution images quickly, making it suitable for iterative design workflows. ## Limitations * **Style Bias:** Because the model is optimized for "highly aesthetic" outputs, it may default to a polished or cinematic look even when a more raw or lo-fi aesthetic is requested. * **Niche Concept Gaps:** While strong on general concepts, the model may occasionally struggle with highly technical or obscure domain-specific imagery where training data density is lower. * **Image Editing Constraints:** While capable of image-to-image tasks, it may lack the granular "in-painting" controls found in specialized tools dedicated solely to localized image manipulation. ## Technical Background Grok Imagine Image is built upon a concentrated diffusion architecture designed by xAI, leveraging massive datasets to bridge the gap between semantic understanding and visual synthesis. Its training approach emphasizes "alignment" between the latent visual space and conversational language patterns, allowing it to understand prompts that are phrased naturally rather than as a string of keywords. ## Best For This model is ideal for creating marketing collateral, concept art, and high-quality social media assets where visual impact is the primary goal. It is also well-suited for rapid prototyping in UI/UX design and architectural visualization. Grok Imagine Image is available through Lumenfall’s unified API and playground, allowing developers to integrate high-end image generation into their applications with minimal overhead. ## Frequently Asked Questions ### How much does Grok Imagine Image cost? Grok Imagine Image starts at $0.02 per image through Lumenfall. Pricing varies by provider. Lumenfall does not add any markup to provider pricing. ### How do I use Grok Imagine Image via API? You can use Grok Imagine Image through Lumenfall's OpenAI-compatible API. Send requests to the unified endpoint with model ID "grok-imagine-image". Code examples are available in Python, JavaScript, and cURL. ### Which providers offer Grok Imagine Image? Grok Imagine Image is available through xAI and fal.ai on Lumenfall. Lumenfall automatically routes requests to the best available provider. ### What is the maximum resolution for Grok Imagine Image? Grok Imagine Image supports images up to 2048x2048 resolution. ## Links - Model Page: https://lumenfall.ai/models/xai/grok-imagine-image - About: https://lumenfall.ai/models/xai/grok-imagine-image/about - Providers, Pricing & Performance: https://lumenfall.ai/models/xai/grok-imagine-image/providers - API Reference: https://lumenfall.ai/models/xai/grok-imagine-image/api - Benchmarks: https://lumenfall.ai/models/xai/grok-imagine-image/benchmarks - Use Cases: https://lumenfall.ai/models/xai/grok-imagine-image/use-cases - Gallery: https://lumenfall.ai/models/xai/grok-imagine-image/gallery - Playground: https://lumenfall.ai/playground?model=grok-imagine-image - API Documentation: https://docs.lumenfall.ai