# Qwen Image 2512 > Improved version of Alibaba's Qwen image model with better text rendering, finer natural textures, and more realistic human generation. ## Quick Reference - Model ID: qwen-image-2512 - Creator: Alibaba - Status: active - Family: qwen - Base URL: https://api.lumenfall.ai/openai/v1 ## Specifications - Input Modalities: text - Output Modalities: image - Supported Modes: Text to Image, Image Edit ## API Parameters The compiled parameter schema for this model is available via the API: `GET /v1/models/qwen-image-2512?schema=true`. ### Core Parameters - `prompt` (string) — REQUIRED: Text prompt for image generation. Modes: Text to Image, Image Edit - `negative_prompt` (string): Negative prompt to guide generation away from undesired content. Modes: Text to Image, Image Edit - `seed` (integer): Random seed for reproducibility. Modes: Text to Image, Image Edit ### Size & Layout - `size` (string): Image dimensions as WxH pixels (e.g. "1024x1024") or aspect ratio (e.g. "16:9"). Values: 1365x768, 768x1365, 1254x836, 836x1254, 887x1182, 1024x1024, 1183x887. Modes: Text to Image, Image Edit - `aspect_ratio` (string): Aspect ratio of the output image (e.g. "16:9", "1:1"). Values: 9:16, 2:3, 3:4, 1:1, 4:3, 3:2, 16:9. Modes: Text to Image, Image Edit - `resolution` (string): Output resolution tier (e.g. "1K", "4K"). Values: 1K. Modes: Text to Image, Image Edit ### Media Inputs - `image` (file) — REQUIRED: Input image(s) to edit. Modes: Image Edit ### Output & Format - `response_format` (string): How to return the image. Default: url. Values: url, b64_json. Modes: Text to Image, Image Edit - `output_format` (string): Output image format. Values: png, jpeg, gif, webp, avif. Modes: Text to Image, Image Edit - `output_compression` (integer): Compression level for lossy formats (JPEG, WebP, AVIF). Modes: Text to Image, Image Edit - `n` (integer): Number of images to generate. Default: 1. Modes: Text to Image, Image Edit ### Additional Parameters - `cfg_scale` (number): Classifier-free guidance scale — higher values stick more closely to the prompt. Modes: Text to Image, Image Edit - `strength` (number): How much to transform the input image: 0 keeps it unchanged, 1 fully regenerates from the prompt. Modes: Image Edit - `acceleration` (string): The acceleration level to use.. Values: high, none, regular. Modes: Text to Image. Only available via fal - `disable_safety_checker` (boolean): Disable safety checker for generated images.. Modes: Text to Image, Image Edit. Only available via replicate - `enable_safety_checker` (boolean): If set to true, the safety checker will be enabled.. Modes: Text to Image. Only available via fal - `go_fast` (boolean): Use the model with additional optimizations for faster generation.. Modes: Text to Image, Image Edit. Only available via replicate - `height` (integer): Height of the generated image. Only used when aspect_ratio=custom. Must be a multiple of 16.. Modes: Text to Image, Image Edit. Only available via replicate - `num_inference_steps` (integer): The number of inference steps to perform.. Modes: Text to Image, Image Edit - `output_quality` (integer): Quality when saving the output images, from 0 to 100. 100 is best quality, 0 is lowest quality. Not relevant for .png outputs.. Modes: Text to Image, Image Edit. Only available via replicate - `sync_mode` (boolean): If `True`, the media will be returned as a data URI and the output data won't be available in the request history.. Modes: Text to Image. Only available via fal - `width` (integer): Width of the generated image. Only used when aspect_ratio=custom. Must be a multiple of 16.. Modes: Text to Image, Image Edit. Only available via replicate ## Model Identifiers - Primary Slug: qwen-image-2512 ## Dates - Released: December 2025 ## Tags image-generation ## Available Providers ### fal.ai - Config Key: fal/qwen-image-2512 - Provider Model ID: fal-ai/qwen-image-2512 - Pricing: $0.020/megapixel - Source: https://fal.ai/models/fal-ai/qwen-image-2512 ### Replicate - Config Key: replicate/qwen-image-2512 - Provider Model ID: qwen/qwen-image-2512 - Pricing: $0.020/image - Source: https://replicate.com/qwen/qwen-image-2512 ## Performance Metrics Provider performance over the last 30 days. ### fal - Median Generation Time (p50): 6941ms - 95th Percentile Generation Time (p95): 25624ms - Average Generation Time: 13768ms - Success Rate: 100.0% - Total Requests: 11 - Time to First Byte (p50): 6937ms - Time to First Byte (p95): 25314ms ### replicate - Median Generation Time (p50): 12086ms - 95th Percentile Generation Time (p95): 31235ms - Average Generation Time: 16681ms - Success Rate: 94.4% - Total Requests: 18 - Time to First Byte (p50): 11623ms - Time to First Byte (p95): 30872ms ## Arena Benchmarks ### Adorable Baby Animals in Sunny Meadow - Elo: 1179 - Record: 12W / 17L / 3T (32 battles) - Rank: #10 of 25 ### Vintage Cafe Logo - Elo: 1175 - Record: 9W / 9L / 0T (18 battles) - Rank: #8 of 21 ### Geometric Composition - Elo: 1175 - Record: 12W / 17L / 1T (30 battles) - Rank: #15 of 22 ### Candid Street Photography - Elo: 1160 - Record: 8W / 8L / 2T (18 battles) - Rank: #15 of 24 ### Isometric Miniature Diorama Scenes - Elo: 1143 - Record: 8W / 10L / 0T (18 battles) - Rank: #17 of 21 ### Modern Clean Menu - Elo: 1114 - Record: 5W / 15L / 4T (24 battles) - Rank: #18 of 19 ## Use Cases & Category Performance ### Product, Branding & Commercial (Text-to-Image) - Rank: #7 of 21 - Elo: 1187 - Record: 9W / 9L / 0T (18 battles) - Win Rate: 50.0% ### Photorealism (Text-to-Image) - Rank: #16 of 23 - Elo: 1174 - Record: 8W / 8L / 2T (18 battles) - Win Rate: 44.4% ### Text Rendering (Text-to-Image) - Rank: #17 of 23 - Elo: 1195 - Record: 14W / 24L / 4T (42 battles) - Win Rate: 33.3% ## Image Gallery 11 images available for this model. Browse all at https://lumenfall.ai/models/alibaba/qwen-image-2512/gallery ### Curated Examples - [An epic, wide-angle cinematic shot of a majestic celestial phoenix made of swirling solar flares ...](https://assets.lumenfall.ai/0Zt36bF4PRAZNaMHcEpzfdJ5qKO5-kTJYPxWBEyAKUI/rs:fit:1500:1500/plain/gs://lumenfall-prod-assets/hwy8ssmsglxjasd3u24uw7lo6diy@jpeg) - [Cinematic wide shot of a high-end, contemporary artisan perfume boutique. In the center of the fr...](https://assets.lumenfall.ai/cag6FlwBAKpKfFPlEPfbU2ah3TgbImMwXSsd2NvjGCw/rs:fit:1500:1500/plain/gs://lumenfall-prod-assets/szirml60n0ewsln4luinskcfglx8@jpeg) - [A hyper-realistic close-up portrait of an elderly artisan woodcarver in a sun-drenched workshop. ...](https://assets.lumenfall.ai/-qij1qtu0SlIur7ASBqC74zEIhzOfwAo85kijryQHjA/rs:fit:1500:1500/plain/gs://lumenfall-prod-assets/9pv4thq1v0gnbswooyst8gj4ghqn@jpeg) - [Cinematic wide shot of a master carpenter's sun-drenched workshop. In the foreground, a pair of w...](https://assets.lumenfall.ai/BUKjN6DYMo3IC8CUMlIJEU0pC9QPW6zUs7bKmdJuMwA/rs:fit:1500:1500/plain/gs://lumenfall-prod-assets/zw1xsfg4b8njlgxjtzmem1atnxvu@jpeg) - [A sun-drenched, airy pottery studio with shelves of handmade ceramic vases and earthy textures. A...](https://assets.lumenfall.ai/myYFy64XaXfz2rk7CmyMv6RxTtkYiteSxbrBo4IRfo4/rs:fit:1500:1500/plain/gs://lumenfall-prod-assets/o2jfiiq32n0fek7b24genhuq6gu1@jpeg) ### Arena Competition Results - [Adorable Baby Animals in Sunny Meadow](https://assets.lumenfall.ai/DUwaUZyPFCQ6egxKXdVaKRCZeB2IOKMil3JCP2Khrc0/rs:fit:1500:1500/plain/gs://lumenfall-prod-assets/w18kqgvdid91hbtjof7jv0r1a9ws@jpeg): #10 of 25 (Elo 1179) - [Vintage Cafe Logo](https://assets.lumenfall.ai/RPXVZCyqio-AfsHzq6JDAe3Z7waUKrGv69SlDaF3qh4/rs:fit:1500:1500/plain/gs://lumenfall-prod-assets/kqk8rsdx7bljut4omkceyq1ph3kl@jpeg): #8 of 21 (Elo 1175) - [Geometric Composition](https://assets.lumenfall.ai/mG1b_BBDH5cZu99_tWiN5Uaxdt_Woeq-EA7VLIuNH48/rs:fit:1500:1500/plain/gs://lumenfall-prod-assets/uz0it0l4dbt6i1dt3b6puk6jrbnk@jpeg): #15 of 22 (Elo 1175) - [Candid Street Photography](https://assets.lumenfall.ai/kCykxzYqSfmaaBspHJ6k-PISCY41U7Pend6YxyfR9ts/rs:fit:1500:1500/plain/gs://lumenfall-prod-assets/bsh4hkl3tdo5v669bsx6wbie4xoc@jpeg): #15 of 24 (Elo 1160) - [Isometric Miniature Diorama Scenes](https://assets.lumenfall.ai/3gTo9Vu1HOM-N0kehsQzdPGQmW0Xmi3bFoXG4Hm40xQ/rs:fit:1500:1500/plain/gs://lumenfall-prod-assets/hxj06pvbrf5x85tssy08vo7suzjl@jpeg): #17 of 21 (Elo 1143) - [Modern Clean Menu](https://assets.lumenfall.ai/ZL1Pt0Y9p8h9YzRjiwV14-HQujCB1pXPHJp9voQnfCc/rs:fit:1500:1500/plain/gs://lumenfall-prod-assets/ks6uc55delamjaipxt0fm467heza@jpeg): #18 of 19 (Elo 1114) ## Example Prompt The following prompt was used to generate an example image in our playground: A sun-drenched, airy pottery studio with shelves of handmade ceramic vases and earthy textures. A focused artisan is molding clay on a spinning wheel. In the soft-focus background, a capybara sits peacefully on a woven rug near the open door. ## Code Examples ### Text to Image (/v1/images/generations) #### cURL curl -X POST \ https://api.lumenfall.ai/openai/v1/images/generations \ -H "Authorization: Bearer $LUMENFALL_API_KEY" \ -H "Content-Type: application/json" \ -d '{ "model": "qwen-image-2512", "prompt": "", "size": "1024x1024" }' # Response: # { "created": 1234567890, "data": [{ "url": "https://...", "revised_prompt": "..." }] } #### JavaScript import OpenAI from 'openai'; const client = new OpenAI({ apiKey: 'YOUR_API_KEY', baseURL: 'https://api.lumenfall.ai/openai/v1' }); const response = await client.images.generate({ model: 'qwen-image-2512', prompt: '', size: '1024x1024' }); // { created: 1234567890, data: [{ url: "https://...", revised_prompt: "..." }] } console.log(response.data[0].url); #### Python from openai import OpenAI client = OpenAI( api_key="YOUR_API_KEY", base_url="https://api.lumenfall.ai/openai/v1" ) response = client.images.generate( model="qwen-image-2512", prompt="", size="1024x1024" ) # { created: 1234567890, data: [{ url: "https://...", revised_prompt: "..." }] } print(response.data[0].url) ### Image Edit (/v1/images/edits) #### cURL curl -X POST \ https://api.lumenfall.ai/openai/v1/images/edits \ -H "Authorization: Bearer $LUMENFALL_API_KEY" \ -F "model=qwen-image-2512" \ -F "image=@source.png" \ -F "prompt=Add a starry night sky to this image" \ -F "size=1024x1024" # Response: # { "created": 1234567890, "data": [{ "url": "https://...", "revised_prompt": "..." }] } #### JavaScript import OpenAI from 'openai'; import fs from 'fs'; const client = new OpenAI({ apiKey: 'YOUR_API_KEY', baseURL: 'https://api.lumenfall.ai/openai/v1' }); const response = await client.images.edit({ model: 'qwen-image-2512', image: fs.createReadStream('source.png'), prompt: 'Add a starry night sky to this image', size: '1024x1024' }); // { created: 1234567890, data: [{ url: "https://...", revised_prompt: "..." }] } console.log(response.data[0].url); #### Python from openai import OpenAI client = OpenAI( api_key="YOUR_API_KEY", base_url="https://api.lumenfall.ai/openai/v1" ) response = client.images.edit( model="qwen-image-2512", image=open("source.png", "rb"), prompt="Add a starry night sky to this image", size="1024x1024" ) # { created: 1234567890, data: [{ url: "https://...", revised_prompt: "..." }] } print(response.data[0].url) ## About ## Overview Qwen Image 2512 is an advanced text-to-image diffusion model developed by Alibaba, designed to generate high-fidelity visual content from natural language descriptions. Released as an iterative improvement within the Qwen model family, it focuses on bridging the gap between complex prompt comprehension and realistic visual execution. Its primary distinction lies in its upgraded ability to handle intricate details that typically challenge generative models, such as anatomical accuracy and legible typography. ## Strengths * **Text Rendering Accuracy:** The model shows significant improvement in generating legible, correctly spelled text within images, making it suitable for graphic design mockups and signage. * **Human Anatomy and Textures:** It excels at producing realistic human features, specifically addressing common issues with limb proportions and skin textures. * **Fine-Grained Natural Detail:** The model renders complex organic textures—such as fur, foliage, and fabric weaves—with high clarity and reduced blurring. * **Nuanced Prompt Adherence:** It demonstrates a strong capability to interpret multi-subject prompts and maintain spatial relationships defined in the text. ## Limitations * **Compositional Drift:** Like many diffusion models, it may struggle with very long or contradictory prompts where later instructions override earlier ones. * **Stylistic Consistency:** While highly capable at realism, it may require more specific prompting to achieve hyper-niche artistic styles compared to models fine-tuned exclusively for digital art. * **Inference Latency:** Depending on the requested resolution and step count, generation times may be longer than smaller, distilled latent consistency models. ## Technical Background Qwen Image 2512 is built upon the Qwen architecture family, utilizing a transformer-based diffusion framework that leverages Alibaba's proprietary linguistic models for text encoding. This version introduces refined training datasets that prioritize high-resolution image-text pairs, specifically targeting the improvement of fine textures and human geometry. The training approach emphasizes a balanced distribution between photographic realism and structured graphic elements. ## Best For This model is best suited for professional workflows requiring high-fidelity realistic imagery, advertising assets involving specific text elements, and character design where anatomical precision is a priority. It is also an excellent choice for rapid prototyping of UI elements or environmental concept art. Qwen Image 2512 is available for testing and integration through Lumenfall’s unified API and interactive playground, allowing developers to compare its output consistency against other state-of-the-art weights. ## Frequently Asked Questions ### How much does Qwen Image 2512 cost? Qwen Image 2512 starts at $0.02 per image through Lumenfall. Pricing varies by provider. Lumenfall does not add any markup to provider pricing. ### How do I use Qwen Image 2512 via API? You can use Qwen Image 2512 through Lumenfall's OpenAI-compatible API. Send requests to the unified endpoint with model ID "qwen-image-2512". Code examples are available in Python, JavaScript, and cURL. ### Which providers offer Qwen Image 2512? Qwen Image 2512 is available through fal.ai and Replicate on Lumenfall. Lumenfall automatically routes requests to the best available provider. ## Links - Model Page: https://lumenfall.ai/models/alibaba/qwen-image-2512 - About: https://lumenfall.ai/models/alibaba/qwen-image-2512/about - Providers, Pricing & Performance: https://lumenfall.ai/models/alibaba/qwen-image-2512/providers - API Reference: https://lumenfall.ai/models/alibaba/qwen-image-2512/api - Benchmarks: https://lumenfall.ai/models/alibaba/qwen-image-2512/benchmarks - Use Cases: https://lumenfall.ai/models/alibaba/qwen-image-2512/use-cases - Gallery: https://lumenfall.ai/models/alibaba/qwen-image-2512/gallery - Playground: https://lumenfall.ai/playground?model=qwen-image-2512 - API Documentation: https://docs.lumenfall.ai