Alibaba's Qwen Image 2.0 model with enhanced text rendering, supporting both Chinese and English prompts with up to 6 images per request
Settled by community votes across 1 shared challenge, with an AI judge weighing in on each.
Qwen Image 2.0
#32 of 44 in Text-to-Image
Not enough comparable category data
The chart appears once both models have ratings across at least three shared arena categories.
Z-Image Turbo
#15 of 44 in Text-to-Image
Where the votes landed
Qwen Image 2.0
0.0%
win rate
Ties
0.0%
Z-Image Turbo
100.0%
win rate
Challenge by challenge
The strongest take from each model on every shared challenge, with the AI judge's read.
The Halloween Invitation
Text-to-Image“Vintage gothic Halloween party invitation. Dark parchment poster, spooky border with webs and thorns, central glowing jack-o-lantern, bats, twisted trees, moody night sky. Add elegant gothic title text saying "Halloween Party Invitation", a small scroll banner saying "You are invited to a night of frights", and event details at the bottom: Date: 30.10.2026 Time: 7pm Location: The Arches, NYC Spooky but polished, cinematic lighting, square format.”
AI Judge Analysis
Qwen Image 2.0
- + Excellent typography with a cohesive gothic font style across all text.
- + High-quality, cinematic rendering of the jack-o-lantern and misty forest.
- + Perfect spelling of all requested event details.
Z-Image Turbo
- + Creative use of layering with the torn parchment on a dark background.
- + Good use of multiple scrolls for different text sections.
- + Includes graveyards and additional spooky elements for atmosphere.
- − Typos in the text: 'The Archves' instead of 'The Arches' and missing letters in the top banner.
- − The text font is inconsistent between the title and the details.
- − Composition is a bit cluttered with overlapping thorn and web elements.
Verdict: Qwen Image 2.0 followed all instructions perfectly, delivering an elegant and legible design with no spelling errors. While Z-Image Turbo had a creative parchment layered effect, it failed on fine details like spelling ('Archves') and font consistency, making Qwen Image 2.0 the superior choice for a usable invitation.
Explore each model
Tongyi-MAI's 6-billion parameter distilled text-to-image model optimized for speed, achieving high-quality generation in 8 steps or fewer with support for bilingual text rendering