Featured

Qwen Image 2512

Improved version of Alibaba's Qwen image model with better text rendering, finer natural textures, and more realistic human generation.

Explore Model

Models
Alibaba

Alibaba

5 AI Image Generation Models, 7 AI Image Editing Models, and 1 AI Image & Video Model

Models

13 models

Alibaba

Qwen Image

Text to Image Image Edit

Alibaba's Qwen image model

$0.0200 /img

Alibaba

Qwen Image 2.0

Text to Image

Alibaba's Qwen Image 2.0 model with enhanced text rendering, supporting both Chinese and English prompts with up to 6 images per request

$0.0350 /img

Alibaba

Qwen Image 2.0 Pro

Text to Image

Alibaba's Qwen Image 2.0 Pro model offering higher quality image generation with enhanced detail and accuracy

$0.0750 /img

Alibaba

Qwen Image 2512

Text to Image Image Edit

Improved version of Alibaba's Qwen image model with better text rendering, finer natural textures, and more realistic human generation.

$0.0200 /img

Alibaba

Qwen Image Edit 2509

Image Edit

Alibaba's Qwen image editing model for instruction-based image modifications and transformations

$0.0300 /img

Alibaba

Qwen Image Edit 2511

Image Edit

Alibaba's Qwen image editing model for instruction-based image modifications and transformations

$0.0300 /img

Alibaba

Qwen Image Edit Latest

Image Edit

Alibaba's Qwen image editing model for instruction-based image modifications and transformations

$0.0300 /img

Qwen Image Max AI generated image example

Alibaba

The Max series of Tongyi Qwen’s image generation model excels across a wide range of generation tasks. Compared with the Plus series, it significantly reduces the “AI-like” feel in generated images, enhancing their realism. It delivers more lifelike material textures for human subjects, finer and more detailed natural textures, and more visually appealing text rendering.

$0.0750 /img

Alibaba

Wan 2.5 (Preview)

Text to Image Image Edit

Alibaba's text-to-image and image-to-image generation model from the Wan AI suite, offering high-quality visual generation capabilities

$0.0500 /img

Alibaba

Wan 2.6

Text to Image Image Edit Text to Video Image to Video Video to Video

Alibaba's multimodal generation model from the Wan AI suite, supporting text-to-video, image-to-video, reference-to-video with audio, and text-to-image, in both Chinese and English

$0.0300 /video

Alibaba

Wan 2.7

Text to Image Image Edit

Alibaba's Wan 2.7 image generation and editing model for text-to-image, reference-guided generation, and instruction-based image edits

$0.0300 /img

Alibaba

Wan 2.7 Pro

Text to Image Image Edit

Alibaba's Wan 2.7 Pro image generation and editing model with higher-quality outputs and support for 4K image generation

$0.0750 /img

Alibaba

Z-Image Turbo

Text to Image Image Edit

Tongyi-MAI's 6-billion parameter distilled text-to-image model optimized for speed, achieving high-quality generation in 8 steps or fewer with support for bilingual text rendering

$0.0050 /img

Examples

20 images

Arena Rankings

Text to Image

View full leaderboard

#	Model	Elo
12	Z-Image Turbo	1253
13	Wan 2.5 (Preview)	1250
10 more models
24	Qwen Image 2.0	1236
3 more models
28	Wan 2.6	1232
2 more models
31	Qwen Image 2512	1226
32	Qwen Image 2.0 Pro	1225
2 more models
35	Qwen Image	1214
2 more models
38	Qwen Image Max	1209
1 more model
40	Wan 2.7 Pro	1207
41	Wan 2.7	1205
21 more models

62 models ranked

Image Editing

View full leaderboard

#	Model	Elo
9	Wan 2.6	1209
6 more models
16	Qwen Image Edit 2511	1202
2 more models
19	Wan 2.7	1175
4 more models
24	Wan 2.5 (Preview)	1146
25	Wan 2.7 Pro	1143
2 more models
28	Z-Image Turbo	1109
3 more models
32	Qwen Image Edit 2509	1078

32 models ranked

Image Editing

View full leaderboard

#	Model	Elo
2	Wan 2.6	1188
5 more models

7 models ranked

Image Editing

View full leaderboard

#	Model	Elo
2	Wan 2.6	1065
1 more model

3 models ranked

About Alibaba

Alibaba is a global leader in artificial intelligence research, specifically through its Cloud and Tongyi divisions. The organization focuses on large-scale multimodal models, developing sophisticated architectures for image generation, video synthesis, and instruction-based image editing. They are widely recognized for the Qwen and Wan series, which offer high-performance alternatives to Western proprietary models with a particular emphasis on bilingual proficiency and architectural efficiency.

Wan 2.6 / 2.7 Pro: A versatile multimodal suite capable of high-fidelity text-to-image and video generation. The Pro variants support up to 4K resolution and advanced reference-guided generation, ranking highly for both visual quality and editing precision.
Z-Image Turbo: A distilled 6-billion parameter model optimized for inference speed. It achieves competitive results in as few as eight steps and maintains a top-20 position on global Elo leaderboards for text-to-image generation.
Qwen Image 2.0 / 2512: These models provide enhanced text rendering and realistic human textures. They are designed to handle complex prompt instructions in both Chinese and English, supporting multiple image inputs per request for contextual generation.
Qwen Image Edit: A specialized series of models focused on instruction-based transformations. These allow users to modify existing images through natural language, consistently appearing in the top rankings for image editing tasks.

Alibaba excels at technical distillation and multimodal integration, producing models that balance high-quality output with computational efficiency. Their systems are particularly effective for workflows requiring precise text rendering within images and nuanced bilingual understanding across diverse creative tasks. These models are available through Lumenfall’s unified API, allowing developers to integrate Alibaba’s generative capabilities into their applications with a single interface.