Alibaba's Qwen Image 2.0 Pro model offering higher quality image generation with enhanced detail and accuracy
Settled by community votes across 3 shared challenges, with an AI judge weighing in on each.
Qwen Image 2.0 Pro
#27 of 44 in Text-to-Image
Not enough comparable category data
The chart appears once both models have ratings across at least three shared arena categories.
Wan 2.7 Pro
#29 of 44 in Text-to-Image
Where the votes landed
Qwen Image 2.0 Pro
100.0%
win rate
Ties
0.0%
Wan 2.7 Pro
0.0%
win rate
Challenge by challenge
The strongest take from each model on every shared challenge, with the AI judge's read.
Chalkboard Menu
Text-to-Image“Handwritten-style chalkboard menu in a cozy café, all text rendered in the exact same realistic chalk handwriting style with natural variations in letter size, slight slant, and chalk texture — no printed or digital fonts anywhere on the board. Title at the top in elegant cursive chalk handwriting: ‘TODAY’S SPECIALS – APRIL 30, 2026’. Below it, three menu items also in the same handwritten chalk style: ‘Truffle Mushroom Risotto – $24’, ‘Grilled Octopus with Lemon & Herbs – $28’, ‘Brown Butter Chocolate Chip Cookies – $9’. At the very bottom, smaller text in the identical handwritten chalk style (slightly smaller but still clearly legible with the same handwriting characteristics): ‘All items made fresh daily • Ask about our gluten-free options’. Warm ambient café lighting, visible chalk dust and smudges, realistic handwriting imperfections, no clean printed text anywhere.”
AI Judge Analysis
Qwen Image 2.0 Pro
- + Excellent chalk texture with realistic smudges and dust
- + Perfect handwriting variation that looks genuinely hand-drawn
- + Perfectly followed the truncated 'Brown But...' prompt to complete the dessert item naturally
- − The layout is slightly more cramped than model B
Wan 2.7 Pro
- + Clean layout with decorative lines
- + Consistent and readable lettering
- + High resolution background elements
- − The text looks more like a digital font than actual chalk on a board
- − The 'chalk' texture is too uniform and lacks realistic pressure variations
- − Missing the natural grit and smudging found in real hand-drawn chalk art
Verdict: Qwen Image 2.0 Pro is the clear winner as it perfectly captures the requested 'realistic chalk handwriting style' with authentic texture, smudging, and letter variation. Wan 2.1 Pro produces text that looks like a clean digital overlay/font, failing the core aesthetic requirement of the prompt despite having a nice overall composition.
The Capybara Taxi Driver
Text-to-Image“Photorealistic scene inside a yellow New York taxi at night. A capybara is driving, wearing a yellow taxi driver cap and a dark jacket. It has a calm, professional expression and both front paws on the steering wheel. In the back seat sits a human businesswoman in a coat, looking at her phone with a completely normal, bored expression (as if this is just another normal ride). Through the windows you can see the streets of Manhattan at night with blurred lights. Realistic taxi interior, photorealistic, detailed fur and fabric, 35mm lens, night lighting with reflections, shallow depth of field.”
AI Judge Analysis
Qwen Image 2.0 Pro
- + Excellent photorealism in the capybara's fur and textures.
- + Accurately captures the 'bored' expression of the passenger.
- + The lighting and taxi interior feel very authentic to a New York night.
- − The capybara's paws on the steering wheel look more like human hands than paws.
- − The composition places the passenger in the front seat next to the driver instead of the back seat as requested.
Wan 2.7 Pro
- + Better adherence to the 'back seat' instruction for the passenger.
- + Incredibly high detail on the capybara's fur and the small badge on the hat.
- + Stronger sense of cinematic composition through the window.
- − The capybara's paws appear to be monkey or human hands with dark skin/hair, which is anatomically strange.
- − Perspective issues make it look like the steering wheel is at an odd angle relative to the dashboard.
Verdict: Both models struggled with renders of the capybara's hands, producing human-like digits instead of paws. Wan 2.7 Pro followed the spatial instructions better by placing the passenger in the back seat, whereas Qwen Image 2.0 Pro placed her in the passenger seat. Wan 2.7 Pro also provided a more dynamic and clear view of the New York street scene.
The Halloween Invitation
Text-to-Image“Vintage gothic Halloween party invitation. Dark parchment poster, spooky border with webs and thorns, central glowing jack-o-lantern, bats, twisted trees, moody night sky. Add elegant gothic title text saying "Halloween Party Invitation", a small scroll banner saying "You are invited to a night of frights", and event details at the bottom: Date: 30.10.2026 Time: 7pm Location: The Arches, NYC Spooky but polished, cinematic lighting, square format.”
AI Judge Analysis
Qwen Image 2.0 Pro
- + Perfect text rendering for all lines including the banner and event details.
- + Excellent cinematic lighting with a glowing green jack-o-lantern and volumetric shadows.
- + Highly detailed illustration style for the bats and the thorn-and-web border.
- − The jack-o-lantern's interior candle placement is slightly illogical.
- − The overall tone is a bit more 'digital art' than 'vintage parchment'.
Wan 2.7 Pro
- + Captures the 'vintage gothic' and 'parchment poster' aesthetic very well with a warm, weathered look.
- + Strong composition with many thematic elements like the cauldron, moon, and gravestones.
- + Clean, legible typography that fits the gothic theme.
- − The jack-o-lantern has a candle wick growing out of its nose area, which looks strange.
- − The small bat silhouettes are repetitive and feel like a pattern rather than part of the scene.
Verdict: Qwen Image 2.0 Pro is the winner due to its superior lighting, high-fidelity details, and perfectly rendered text that follows the prompt exactly. While Wan 2.1 Pro captures the 'vintage' aesthetic more effectively, Qwen's cinematic quality and polished execution of the specific elements like the bats and border make it a more professional-looking invitation.
Explore each model
Alibaba's Wan 2.7 Pro image generation and editing model with higher-quality outputs and support for 4K image generation