Head to head
Esc

Models · slot A

to navigate to pick

DALL-E 2 OpenAI Qwen Image 2.0 Pro Alibaba

Settled by community votes across 3 shared challenges, with an AI judge weighing in on each.

DALL-E 2

17.7 arena score

#37 of 44 in Text-to-Image

Skill signature

Not enough comparable category data

The chart appears once both models have ratings across at least three shared arena categories.

Qwen Image 2.0 Pro

22.3 arena score

#27 of 44 in Text-to-Image

Vote tally

Where the votes landed

DALL-E 2

0.0%

win rate

Ties

0.0%

Qwen Image 2.0 Pro

100.0%

win rate

0.0% 0.0% ties 100.0%
Shared challenges 3

Challenge by challenge

The strongest take from each model on every shared challenge, with the AI judge's read.

Chalkboard Menu

Text-to-Image

“Handwritten-style chalkboard menu in a cozy café, all text rendered in the exact same realistic chalk handwriting style with natural variations in letter size, slight slant, and chalk texture — no printed or digital fonts anywhere on the board. Title at the top in elegant cursive chalk handwriting: ‘TODAY’S SPECIALS – APRIL 30, 2026’. Below it, three menu items also in the same handwritten chalk style: ‘Truffle Mushroom Risotto – $24’, ‘Grilled Octopus with Lemon & Herbs – $28’, ‘Brown Butter Chocolate Chip Cookies – $9’. At the very bottom, smaller text in the identical handwritten chalk style (slightly smaller but still clearly legible with the same handwriting characteristics): ‘All items made fresh daily • Ask about our gluten-free options’. Warm ambient café lighting, visible chalk dust and smudges, realistic handwriting imperfections, no clean printed text anywhere.”

DALL-E 2
Qwen Image 2.0 Pro

AI Judge Analysis

DALL-E 2

  • + The text has a high-contrast chalk-like texture.
  • The text is completely illegible and contains gibberish.
  • It fails to follow any specific menu items requested in the prompt.
  • The image quality is low resolution and lacks any environment context.

Qwen Image 2.0 Pro

  • + Exceptional text rendering with perfect adherence to the requested menu items and date.
  • + Highly realistic chalk texture with natural smudge marks and consistent handwriting style.
  • + Excellent composition showing the menu in a believable cafe environment.
  • The 'cursive' request for the title is interpreted more as a neat print/script than flowing cursive.

Verdict: Qwen Image 2.0 Pro is the clear winner, demonstrating near-perfect text rendering capabilities by accurately displaying all requested menu items and the specific date. In contrast, DALL-E 2 produced an illegible, low-quality image that failed to follow any of the prompt's linguistic requirements.

The Capybara Taxi Driver

Text-to-Image

“Photorealistic scene inside a yellow New York taxi at night. A capybara is driving, wearing a yellow taxi driver cap and a dark jacket. It has a calm, professional expression and both front paws on the steering wheel. In the back seat sits a human businesswoman in a coat, looking at her phone with a completely normal, bored expression (as if this is just another normal ride). Through the windows you can see the streets of Manhattan at night with blurred lights. Realistic taxi interior, photorealistic, detailed fur and fabric, 35mm lens, night lighting with reflections, shallow depth of field.”

DALL-E 2
Qwen Image 2.0 Pro

AI Judge Analysis

DALL-E 2

  • Completely ignores the prompt by generating a black leather bag instead of a taxi scene.
  • Terrible image quality with strange artifacts and melted textures.
  • Single-subject focus that lacks any of the requested characters or environment.

Qwen Image 2.0 Pro

  • + Excellent prompt adherence including the capybara's outfit, pose, and the businesswoman's expression.
  • + High-quality photorealistic rendering with believable lighting and bokeh effects.
  • + Rich in specific details like the TLC license on the dashboard and accurate phone-using posture.
  • The capybara's paws are rendered with slightly uncanny, human-like finger articulation.
  • Minor spelling error on the dashboard plaque ('Licesed' instead of 'Licensed').

Verdict: Qwen Image 2.0 Pro followed every detail of the prompt, creating a highly detailed and atmospheric scene that captured the humor of the request perfectly. DALL-E 2 suffered a total failure, producing an irrelevant image of a distorted black bag that bore no resemblance to the prompt.

The Halloween Invitation

Text-to-Image

“Vintage gothic Halloween party invitation. Dark parchment poster, spooky border with webs and thorns, central glowing jack-o-lantern, bats, twisted trees, moody night sky. Add elegant gothic title text saying "Halloween Party Invitation", a small scroll banner saying "You are invited to a night of frights", and event details at the bottom: Date: 30.10.2026 Time: 7pm Location: The Arches, NYC Spooky but polished, cinematic lighting, square format.”

DALL-E 2
Qwen Image 2.0 Pro
0% wins 0% ties 100% wins

AI Judge Analysis

DALL-E 2

  • + Captures a strong vintage hand-drawn aesthetic.
  • + Includes the requested twisted trees and bats.
  • All text is gibberish and doesn't follow the specific wording requested.
  • Lacks the central jack-o-lantern requested in the prompt.
  • Low visual clarity and poor resolution with significant artifacting.

Qwen Image 2.0 Pro

  • + Excellent text rendering, following all specific names, dates, and locations perfectly.
  • + Very high cinematic visual quality with polished lighting and textures.
  • + Strictly adheres to every prompt element including the thorn/web border and glowing pumpkin.
  • The Bats have slightly cartoonish facial features compared to the gothic setting.
  • The font on the scroll is a bit modern compared to the gothic theme.

Verdict: Qwen Image 2.0 Pro is the clear winner as it successfully rendered all the complex text requirements with perfect spelling, whereas DALL-E 2 produced illegible text. Qwen also followed the compositional instructions for the jack-o-lantern and border accurately, resulting in a professional-grade invitation image.

Next steps

Explore each model