# Qwen Image

> Alibaba's Qwen image model

## Quick Reference

- Model ID: qwen-image
- Creator: Alibaba
- Status: active
- Family: qwen
- Base URL: https://api.lumenfall.ai/openai/v1

## Specifications

- Input Modalities: text
- Output Modalities: image

## Model Identifiers

- Primary Slug: qwen-image

## Dates


## Tags

image-generation

## Available Providers

### Replicate

- Config Key: replicate/qwen-image
- Provider Model ID: qwen/qwen-image
- Pricing:
  - source: official
  - currency: USD
  - components: [{"type" => "output", "metric" => "image", "unit_price" => 0.025}]
  - source_url: https://replicate.com/qwen/qwen-image
  - effective_at: 2026-01-02

### fal.ai

- Config Key: fal/qwen-image
- Provider Model ID: fal-ai/qwen-image
- Pricing:
  - source: official
  - currency: USD
  - components: [{"type" => "output", "metric" => "megapixel", "unit_price" => 0.02, "rounding_mode" => "ceil", "rounding_step" => 1}]
  - source_url: https://fal.ai/models/fal-ai/qwen-image
  - effective_at: 2025-12-29


## Image Gallery

4 images available for this model.
- Curated examples: 4
  - "A wide, cinematic shot of a meticulously detailed, handcrafted leather-bound journal lying on a rustic wooden table i..."
  - "A close-up, cinematic macro shot of an weathered leather craftsman's workbench. In sharp focus are aged brass tools, ..."
  - "A hyper-realistic close-up of an elderly sculptor's hands working on a delicate clay bust. Fine details show the text..."
  - "A sun-drenched Mediterranean balcony overlooking the sea, overflowing with vibrant bougainvillea and terracotta pots...."

## Example Prompt

The following prompt was used to generate an example image in our playground:

A sun-drenched Mediterranean balcony overlooking the sea, overflowing with vibrant bougainvillea and terracotta pots. In the soft background shadows near a wooden bench, a capybara naps peacefully while a breakfast spread sits on the foreground table.

## Code Examples

### Text to Image (Generation)

#### cURL

curl -X POST \
  https://api.lumenfall.ai/openai/v1/images/generations \
  -H "Authorization: Bearer $LUMENFALL_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "qwen-image",
    "prompt": "A serene mountain landscape at sunset",
    "size": "1024x1024"
  }'

# Response:
# { "created": 1234567890, "data": [{ "url": "https://...", "revised_prompt": "..." }] }

#### JavaScript

import OpenAI from 'openai';

const client = new OpenAI({
  apiKey: 'YOUR_API_KEY',
  baseURL: 'https://api.lumenfall.ai/openai/v1'
});

const response = await client.images.generate({
  model: 'qwen-image',
  prompt: 'A serene mountain landscape at sunset',
  size: '1024x1024'
});

// { created: 1234567890, data: [{ url: "https://...", revised_prompt: "..." }] }
console.log(response.data[0].url);

#### Python

from openai import OpenAI

client = OpenAI(
    api_key="YOUR_API_KEY",
    base_url="https://api.lumenfall.ai/openai/v1"
)

response = client.images.generate(
    model="qwen-image",
    prompt="A serene mountain landscape at sunset",
    size="1024x1024"
)

# { created: 1234567890, data: [{ url: "https://...", revised_prompt: "..." }] }
print(response.data[0].url)


## About

## Overview
Qwen Image is a text-to-image generation model developed by Alibaba Cloud’s Qwen team. It serves as the visual synthesis component of the broader Qwen ecosystem, designed to transform natural language prompts into high-fidelity imagery. The model is distinguished by its strong alignment with complex linguistic instructions and its ability to handle both English and Chinese prompts with high semantic accuracy.

## Strengths
*   **Multilingual Prompt Comprehension:** The model demonstrates superior performance in processing Chinese-language prompts, accurately capturing cultural nuances and idioms that Western-centric models often misinterpret.
*   **Compositional Accuracy:** It excels at spatial reasoning and multi-object placement, ensuring that elements described in a prompt maintain the correct relationship to one another.
*   **Text Rendering:** Qwen Image shows higher-than-average stability when generating legible text within images, such as signage, labels, or posters, reducing the common "gibberish" artifacts found in earlier diffusion models.
*   **Fine-Grained Detail:** The model is optimized for high-resolution output with a focus on realistic textures, particularly in skin tones, fabric weaves, and architectural materials.

## Limitations
*   **Anatomical Consistency:** Like many diffusion-based models, it can occasionally struggle with complex human anatomy, such as the specific number of digits on hands or complex overlapping limbs in action shots.
*   **Stylistic Range:** While versatile, the model tends toward a "digital photography" or "clean 3D render" aesthetic by default; achieving hyper-abstract or specific traditional art styles may require more intensive prompt engineering compared to models like Midjourney.

## Technical Background
Qwen Image belongs to the Qwen family of models, leveraging a large-scale diffusion transformer architecture tailored for high-dimensional visual synthesis. The training process involves a multi-stage pipeline that utilizes high-quality captioned image datasets, with a specific focus on cross-modal alignment between the Qwen LLM's text embeddings and the visual latent space. This allows the model to inherit the deep semantic understanding found in Alibaba's flagship language models.

## Best For
Qwen Image is particularly effective for marketing localization projects involving Chinese text, technical illustrations requiring precise object placement, and general-purpose asset generation for web and mobile interfaces. Its price point of $0.02 makes it a cost-effective choice for developers building high-volume image generation workflows.

Qwen Image is available for immediate deployment and testing through **Lumenfall’s unified API and playground**, allowing you to integrate its generative capabilities into your applications with minimal setup.

## Frequently Asked Questions

### How much does Qwen Image cost?

Qwen Image starts at $0.02 per image through Lumenfall. Pricing varies by provider. Lumenfall does not add any markup to provider pricing.

### How do I use Qwen Image via API?

You can use Qwen Image through Lumenfall's OpenAI-compatible API. Send requests to the unified endpoint with model ID "qwen-image". Code examples are available in Python, JavaScript, and cURL.

### Which providers offer Qwen Image?

Qwen Image is available through Replicate and fal.ai on Lumenfall. Lumenfall automatically routes requests to the best available provider.

## Links

- Model Page: https://lumenfall.ai/models/alibaba/qwen-image
- About: https://lumenfall.ai/models/alibaba/qwen-image/about
- Providers, Pricing & Performance: https://lumenfall.ai/models/alibaba/qwen-image/providers
- API Reference: https://lumenfall.ai/models/alibaba/qwen-image/api
- Benchmarks: https://lumenfall.ai/models/alibaba/qwen-image/benchmarks
- Use Cases: https://lumenfall.ai/models/alibaba/qwen-image/use-cases
- Gallery: https://lumenfall.ai/models/alibaba/qwen-image/gallery
- Playground: https://lumenfall.ai/playground?model=qwen-image
- API Documentation: https://docs.lumenfall.ai