# Qwen Image > Alibaba's Qwen image model ## Quick Reference - Model ID: qwen-image - Creator: Alibaba - Status: active - Family: qwen - Base URL: https://api.lumenfall.ai/openai/v1 ## Specifications - Input Modalities: text - Output Modalities: image - Supported Modes: Text to Image, Image Edit ## API Parameters The compiled parameter schema for this model is available via the API: `GET /v1/models/qwen-image?schema=true`. ### Core Parameters - `prompt` (string) — REQUIRED: Text prompt for image generation. Modes: Text to Image ### Size & Layout - `size` (string): Image dimensions as WxH pixels (e.g. "1024x1024") or aspect ratio (e.g. "16:9"). Modes: Text to Image, Image Edit - `aspect_ratio` (string): Aspect ratio of the output image (e.g. "16:9", "1:1"). Modes: Text to Image, Image Edit - `resolution` (string): Output resolution tier (e.g. "1K", "4K"). Modes: Text to Image, Image Edit ### Media Inputs - `image` (file) — REQUIRED: Input image(s) to edit. Modes: Image Edit ### Output & Format - `response_format` (string): How to return the image. Default: url. Values: url, b64_json. Modes: Text to Image, Image Edit - `output_format` (string): Output image format. Values: png, jpeg, gif, webp, avif. Modes: Text to Image, Image Edit - `output_compression` (integer): Compression level for lossy formats (JPEG, WebP, AVIF). Modes: Text to Image, Image Edit - `n` (integer): Number of images to generate. Default: 1. Modes: Text to Image, Image Edit ## Model Identifiers - Primary Slug: qwen-image ## Dates - Released: August 2025 ## Tags image-generation ## Available Providers ### fal.ai - Config Key: fal/qwen-image - Provider Model ID: fal-ai/qwen-image - Pricing: $0.020/megapixel - Source: https://fal.ai/models/fal-ai/qwen-image ### Replicate - Config Key: replicate/qwen-image - Provider Model ID: qwen/qwen-image - Pricing: $0.025/image - Source: https://replicate.com/qwen/qwen-image ### Alibaba Cloud - Config Key: alibaba/qwen-image - Provider Model ID: qwen-image-plus - Pricing: $0.030/image - Source: https://modelstudio.console.alibabacloud.com/ap-southeast-1?tab=doc#/doc/?type=model&url=2840914_2&modelId=qwen-image-plus ## Image Gallery 4 images available for this model. Browse all at https://lumenfall.ai/models/alibaba/qwen-image/gallery ### Curated Examples - [A wide, cinematic shot of a meticulously detailed, handcrafted leather-bound journal lying on a r...](https://assets.lumenfall.ai/U6jPphhKw_bxl8dW8mVrCVekMugu1gTy1f7fxxb7HH4/rs:fit:1500:1500/plain/gs://lumenfall-prod-assets/q3esp9djd7p31j9czudq0qkhrcia@jpeg) - [A close-up, cinematic macro shot of an weathered leather craftsman's workbench. In sharp focus ar...](https://assets.lumenfall.ai/hoc9kJDXVEskksxBxGodikFuR30V3lh9v3h5RuiQD5U/rs:fit:1500:1500/plain/gs://lumenfall-prod-assets/fwyl12wcr70bzg3e7jvsg7ysy65r@jpeg) - [A hyper-realistic close-up of an elderly sculptor's hands working on a delicate clay bust. Fine d...](https://assets.lumenfall.ai/2GIzCedaxzwgJBSckVrs-tDYzRScilOdtNAKJFnbkr8/rs:fit:1500:1500/plain/gs://lumenfall-prod-assets/851ivvxbzh9ljkfxgg685swxw200@jpeg) - [A sun-drenched Mediterranean balcony overlooking the sea, overflowing with vibrant bougainvillea ...](https://assets.lumenfall.ai/0RC6PC-B3nxLv8oTsMXo3euVq4oaR_8WVm0VhWt_4Gc/rs:fit:1500:1500/plain/gs://lumenfall-prod-assets/98fcqefagktq6km7bp14x0ijgx62@jpeg) ## Example Prompt The following prompt was used to generate an example image in our playground: A sun-drenched Mediterranean balcony overlooking the sea, overflowing with vibrant bougainvillea and terracotta pots. In the soft background shadows near a wooden bench, a capybara naps peacefully while a breakfast spread sits on the foreground table. ## Code Examples ### Text to Image (/v1/images/generations) #### cURL curl -X POST \ https://api.lumenfall.ai/openai/v1/images/generations \ -H "Authorization: Bearer $LUMENFALL_API_KEY" \ -H "Content-Type: application/json" \ -d '{ "model": "qwen-image", "prompt": "", "size": "1024x1024" }' # Response: # { "created": 1234567890, "data": [{ "url": "https://...", "revised_prompt": "..." }] } #### JavaScript import OpenAI from 'openai'; const client = new OpenAI({ apiKey: 'YOUR_API_KEY', baseURL: 'https://api.lumenfall.ai/openai/v1' }); const response = await client.images.generate({ model: 'qwen-image', prompt: '', size: '1024x1024' }); // { created: 1234567890, data: [{ url: "https://...", revised_prompt: "..." }] } console.log(response.data[0].url); #### Python from openai import OpenAI client = OpenAI( api_key="YOUR_API_KEY", base_url="https://api.lumenfall.ai/openai/v1" ) response = client.images.generate( model="qwen-image", prompt="", size="1024x1024" ) # { created: 1234567890, data: [{ url: "https://...", revised_prompt: "..." }] } print(response.data[0].url) ### Image Edit (/v1/images/edits) #### cURL curl -X POST \ https://api.lumenfall.ai/openai/v1/images/edits \ -H "Authorization: Bearer $LUMENFALL_API_KEY" \ -F "model=qwen-image" \ -F "image=@source.png" \ -F "prompt=Add a starry night sky to this image" \ -F "size=1024x1024" # Response: # { "created": 1234567890, "data": [{ "url": "https://...", "revised_prompt": "..." }] } #### JavaScript import OpenAI from 'openai'; import fs from 'fs'; const client = new OpenAI({ apiKey: 'YOUR_API_KEY', baseURL: 'https://api.lumenfall.ai/openai/v1' }); const response = await client.images.edit({ model: 'qwen-image', image: fs.createReadStream('source.png'), prompt: 'Add a starry night sky to this image', size: '1024x1024' }); // { created: 1234567890, data: [{ url: "https://...", revised_prompt: "..." }] } console.log(response.data[0].url); #### Python from openai import OpenAI client = OpenAI( api_key="YOUR_API_KEY", base_url="https://api.lumenfall.ai/openai/v1" ) response = client.images.edit( model="qwen-image", image=open("source.png", "rb"), prompt="Add a starry night sky to this image", size="1024x1024" ) # { created: 1234567890, data: [{ url: "https://...", revised_prompt: "..." }] } print(response.data[0].url) ## About ## Overview Qwen Image is a text-to-image generation model developed by Alibaba Cloud’s Qwen team. It serves as the visual synthesis component of the broader Qwen ecosystem, designed to transform natural language prompts into high-fidelity imagery. The model is distinguished by its strong alignment with complex linguistic instructions and its ability to handle both English and Chinese prompts with high semantic accuracy. ## Strengths * **Multilingual Prompt Comprehension:** The model demonstrates superior performance in processing Chinese-language prompts, accurately capturing cultural nuances and idioms that Western-centric models often misinterpret. * **Compositional Accuracy:** It excels at spatial reasoning and multi-object placement, ensuring that elements described in a prompt maintain the correct relationship to one another. * **Text Rendering:** Qwen Image shows higher-than-average stability when generating legible text within images, such as signage, labels, or posters, reducing the common "gibberish" artifacts found in earlier diffusion models. * **Fine-Grained Detail:** The model is optimized for high-resolution output with a focus on realistic textures, particularly in skin tones, fabric weaves, and architectural materials. ## Limitations * **Anatomical Consistency:** Like many diffusion-based models, it can occasionally struggle with complex human anatomy, such as the specific number of digits on hands or complex overlapping limbs in action shots. * **Stylistic Range:** While versatile, the model tends toward a "digital photography" or "clean 3D render" aesthetic by default; achieving hyper-abstract or specific traditional art styles may require more intensive prompt engineering compared to models like Midjourney. ## Technical Background Qwen Image belongs to the Qwen family of models, leveraging a large-scale diffusion transformer architecture tailored for high-dimensional visual synthesis. The training process involves a multi-stage pipeline that utilizes high-quality captioned image datasets, with a specific focus on cross-modal alignment between the Qwen LLM's text embeddings and the visual latent space. This allows the model to inherit the deep semantic understanding found in Alibaba's flagship language models. ## Best For Qwen Image is particularly effective for marketing localization projects involving Chinese text, technical illustrations requiring precise object placement, and general-purpose asset generation for web and mobile interfaces. Its price point of $0.02 makes it a cost-effective choice for developers building high-volume image generation workflows. Qwen Image is available for immediate deployment and testing through **Lumenfall’s unified API and playground**, allowing you to integrate its generative capabilities into your applications with minimal setup. ## Frequently Asked Questions ### How much does Qwen Image cost? Qwen Image starts at $0.02 per image through Lumenfall. Pricing varies by provider. Lumenfall does not add any markup to provider pricing. ### How do I use Qwen Image via API? You can use Qwen Image through Lumenfall's OpenAI-compatible API. Send requests to the unified endpoint with model ID "qwen-image". Code examples are available in Python, JavaScript, and cURL. ### Which providers offer Qwen Image? Qwen Image is available through fal.ai, Replicate, and Alibaba Cloud on Lumenfall. Lumenfall automatically routes requests to the best available provider. ## Links - Model Page: https://lumenfall.ai/models/alibaba/qwen-image - About: https://lumenfall.ai/models/alibaba/qwen-image/about - Providers, Pricing & Performance: https://lumenfall.ai/models/alibaba/qwen-image/providers - API Reference: https://lumenfall.ai/models/alibaba/qwen-image/api - Benchmarks: https://lumenfall.ai/models/alibaba/qwen-image/benchmarks - Use Cases: https://lumenfall.ai/models/alibaba/qwen-image/use-cases - Gallery: https://lumenfall.ai/models/alibaba/qwen-image/gallery - Playground: https://lumenfall.ai/playground?model=qwen-image - API Documentation: https://docs.lumenfall.ai