Head to head
Esc

Models · slot A

to navigate to pick

Grok Imagine Image Pro xAI Qwen Image 2.0 Pro Alibaba

Settled by community votes across 2 shared challenges, with an AI judge weighing in on each.

Grok Imagine Image Pro

24.8 arena score

#14 of 44 in Text-to-Image

Skill signature

Not enough comparable category data

The chart appears once both models have ratings across at least three shared arena categories.

Qwen Image 2.0 Pro

22.3 arena score

#27 of 44 in Text-to-Image

Vote tally

Where the votes landed

Grok Imagine Image Pro

0.0%

win rate

Ties

0.0%

Qwen Image 2.0 Pro

100.0%

win rate

0.0% 0.0% ties 100.0%
Shared challenges 2

Challenge by challenge

The strongest take from each model on every shared challenge, with the AI judge's read.

Chalkboard Menu

Text-to-Image

“Handwritten-style chalkboard menu in a cozy café, all text rendered in the exact same realistic chalk handwriting style with natural variations in letter size, slight slant, and chalk texture — no printed or digital fonts anywhere on the board. Title at the top in elegant cursive chalk handwriting: ‘TODAY’S SPECIALS – APRIL 30, 2026’. Below it, three menu items also in the same handwritten chalk style: ‘Truffle Mushroom Risotto – $24’, ‘Grilled Octopus with Lemon & Herbs – $28’, ‘Brown Butter Chocolate Chip Cookies – $9’. At the very bottom, smaller text in the identical handwritten chalk style (slightly smaller but still clearly legible with the same handwriting characteristics): ‘All items made fresh daily • Ask about our gluten-free options’. Warm ambient café lighting, visible chalk dust and smudges, realistic handwriting imperfections, no clean printed text anywhere.”

Grok Imagine Image Pro
Qwen Image 2.0 Pro

AI Judge Analysis

Grok Imagine Image Pro

  • + Excellent text legibility and accuracy of the requested menu items.
  • + Realistic chalk texture with visible smudges and varying pressure.
  • + Perfectly centered and balanced composition within a wooden frame.
  • The 'cursive' for the title is more of a print-script hybrid than elegant cursive.
  • The bottom line of text appears slightly more like a digital font compared to the main items.

Qwen Image 2.0 Pro

  • + Natural, dynamic perspective that adds to the 'cozy café' atmosphere.
  • + Great handwriting style with natural slants and varied letter sizes.
  • + Includes realistic chalk dust and board wear that feels authentic.
  • Text layout is slightly less clean, with 'Herbs' appearing on its own line poorly compared to Model A.
  • The 'special' date line at the top is a bit cramped at the edge.

Verdict: Both models followed the complex text prompt remarkably well, correctly rendering the specific items and prices. Grok Imagine Image Pro is preferred for its superior legibility and perfectly centered presentation, whereas Qwen Image 2.0 Pro offers a slightly more artistic, albeit slightly more cluttered, photographic composition.

The Capybara Taxi Driver

Text-to-Image

“Photorealistic scene inside a yellow New York taxi at night. A capybara is driving, wearing a yellow taxi driver cap and a dark jacket. It has a calm, professional expression and both front paws on the steering wheel. In the back seat sits a human businesswoman in a coat, looking at her phone with a completely normal, bored expression (as if this is just another normal ride). Through the windows you can see the streets of Manhattan at night with blurred lights. Realistic taxi interior, photorealistic, detailed fur and fabric, 35mm lens, night lighting with reflections, shallow depth of field.”

Grok Imagine Image Pro
Qwen Image 2.0 Pro
0% wins 0% ties 100% wins

AI Judge Analysis

Grok Imagine Image Pro

  • + Excellent photographic quality with realistic depth of field and lighting.
  • + The woman is correctly positioned in the back seat as requested.
  • + Very clean rendering of facial features and hand details for both subjects.
  • The cap is a modern baseball cap rather than a traditional taxi driver cap.

Qwen Image 2.0 Pro

  • + Features a more authentic, traditional taxi driver uniform cap.
  • + The interior details feel gritty and realistic for an older New York cab.
  • The passenger is sitting in the front seat or is scaled incorrectly, appearing next to the driver.
  • Noticeable anatomy issues with the capybara's paws on the steering wheel.
  • The text on the dashboard 'TLC Licesed' is misspelled.

Verdict: Grok Imagine Image Pro is the clear winner for its superior composition and technical execution, correctly placing the human passenger in the back seat while maintaining high photorealistic quality. Qwen Image 2.0 Pro fails the spatial requirements of the prompt by placing the passenger in the front and suffers from significant anatomical and spelling errors.

Next steps

Explore each model