Head to head
Esc

Models · slot A

to navigate to pick

Qwen Image 2.0 Pro Alibaba Z-Image Turbo Alibaba

Settled by community votes across 3 shared challenges, with an AI judge weighing in on each.

Qwen Image 2.0 Pro

22.3 arena score

#27 of 44 in Text-to-Image

Skill signature

Not enough comparable category data

The chart appears once both models have ratings across at least three shared arena categories.

Z-Image Turbo

24.7 arena score

#15 of 44 in Text-to-Image

Vote tally

Where the votes landed

Qwen Image 2.0 Pro

0.0%

win rate

Ties

100.0%

Z-Image Turbo

0.0%

win rate

0.0% 100.0% ties 0.0%
Shared challenges 3

Challenge by challenge

The strongest take from each model on every shared challenge, with the AI judge's read.

Chalkboard Menu

Text-to-Image

“Handwritten-style chalkboard menu in a cozy café, all text rendered in the exact same realistic chalk handwriting style with natural variations in letter size, slight slant, and chalk texture — no printed or digital fonts anywhere on the board. Title at the top in elegant cursive chalk handwriting: ‘TODAY’S SPECIALS – APRIL 30, 2026’. Below it, three menu items also in the same handwritten chalk style: ‘Truffle Mushroom Risotto – $24’, ‘Grilled Octopus with Lemon & Herbs – $28’, ‘Brown Butter Chocolate Chip Cookies – $9’. At the very bottom, smaller text in the identical handwritten chalk style (slightly smaller but still clearly legible with the same handwriting characteristics): ‘All items made fresh daily • Ask about our gluten-free options’. Warm ambient café lighting, visible chalk dust and smudges, realistic handwriting imperfections, no clean printed text anywhere.”

Qwen Image 2.0 Pro
Z-Image Turbo

AI Judge Analysis

Qwen Image 2.0 Pro

  • + Excellent authentic chalk texture with dusting and smudges
  • + Accurate spelling of all complex menu items
  • + Highly realistic café background with depth of field
  • The title is in print-style handwriting rather than the requested elegant cursive cursive
  • Lettering slant varies significantly between lines

Z-Image Turbo

  • + Clean and legible text presentation
  • + Good use of space on the chalkboard
  • + Effective chalk-style font
  • Contains a spelling error ('Mustroom')
  • Missing the 'and' symbol or word in the middle items compared to the prompt
  • Lacks the atmospheric background requested for a 'cozy café'

Verdict: Qwen Image 2.0 Pro is the superior choice because it captures the atmosphere of a café and renders all technical spelling correctly, despite failing the specific cursive requirement. Z-Image Turbo produces a more generic output with a noticeable typo in 'Mustroom' and a less realistic overall texture.

The Capybara Taxi Driver

Text-to-Image

“Photorealistic scene inside a yellow New York taxi at night. A capybara is driving, wearing a yellow taxi driver cap and a dark jacket. It has a calm, professional expression and both front paws on the steering wheel. In the back seat sits a human businesswoman in a coat, looking at her phone with a completely normal, bored expression (as if this is just another normal ride). Through the windows you can see the streets of Manhattan at night with blurred lights. Realistic taxi interior, photorealistic, detailed fur and fabric, 35mm lens, night lighting with reflections, shallow depth of field.”

Qwen Image 2.0 Pro
Z-Image Turbo
0% wins 100% ties 0% wins

AI Judge Analysis

Qwen Image 2.0 Pro

  • + Excellent photorealism with gritty, cinematic lighting that feels like New York at night.
  • + Strong prompt adherence, correctly placing the woman in the back seat and the capybara in the front.
  • + Highly detailed textures on the capybara's fur and the weathered taxi interior.
  • Text on the 'TLC Licensed' sticker is slightly misspelled as 'Licesed'.
  • The hands of the capybara look a bit more like human hands than paws.

Z-Image Turbo

  • + Natural, professional expression on the capybara.
  • + Clean, modern lighting and high-resolution quality.
  • Failed the spatial instructions by placing the passenger in the front passenger seat instead of the back seat.
  • The blurred background is generic and doesn't clearly convey the Manhattan cityscape.
  • The capybara's hands are anatomically distorted where they meet the steering wheel.

Verdict: Qwen Image 2.0 Pro is the clear winner as it correctly followed the spatial instruction to place the passenger in the back seat, whereas Z-Image Turbo placed her in the front. Qwen also captured the 'New York taxi' atmosphere much more effectively with realistic interior grime and iconic street lighting, while Z-Image Turbo felt more generic.

The Halloween Invitation

Text-to-Image

“Vintage gothic Halloween party invitation. Dark parchment poster, spooky border with webs and thorns, central glowing jack-o-lantern, bats, twisted trees, moody night sky. Add elegant gothic title text saying "Halloween Party Invitation", a small scroll banner saying "You are invited to a night of frights", and event details at the bottom: Date: 30.10.2026 Time: 7pm Location: The Arches, NYC Spooky but polished, cinematic lighting, square format.”

Qwen Image 2.0 Pro
Z-Image Turbo

AI Judge Analysis

Qwen Image 2.0 Pro

  • + Excellent text rendering with perfect spelling and high-quality gothic fonts.
  • + Strong adherence to the border requirement featuring detailed thorns and webs.
  • + Cinematic lighting with a distinct green glow that adds character.
  • The parchment texture is less 'vintage' and more of a flat background color.
  • The bats look slightly realistic in a way that clashes with the illustrative border.

Z-Image Turbo

  • + Beautiful vintage parchment texture with burnt/torn edges.
  • + Great atmospheric depth with the burial ground and twisted trees visible through the paper cutout.
  • + Captures the 'vintage poster' aesthetic very effectively.
  • Spelling error in the location ('The Archves' instead of 'The Arches').
  • The banner for the sub-text is poorly integrated and the text is very small.

Verdict: Qwen Image 2.0 Pro is the superior choice for a functional invitation as it perfectly rendered all requested text without spelling errors. While Z-Image Turbo captured a more authentic vintage parchment aesthetic, its failure to correctly spell the location 'The Arches' and the placement of the sub-text makes it less useful as a final product.

Next steps

Explore each model