Head to head
Esc

Models · slot A

to navigate to pick

DALL-E 2 OpenAI Qwen Image Max Alibaba

Settled by community votes across 1 shared challenge, with an AI judge weighing in on each.

DALL-E 2

17.7 arena score

#37 of 44 in Text-to-Image

Skill signature

Not enough comparable category data

The chart appears once both models have ratings across at least three shared arena categories.

Qwen Image Max

20.4 arena score

#31 of 44 in Text-to-Image

Vote tally

Where the votes landed

DALL-E 2

0%

win rate

Ties

0%

Qwen Image Max

0%

win rate

Shared challenges 1

Challenge by challenge

The strongest take from each model on every shared challenge, with the AI judge's read.

Magic Burger Explosion: Fiery Photorealism Challenge

Text-to-Image

“Ad for 'Magic Burger'. Dynamic, exploded burger with all components (bun, patty, cheese, lettuce, tomato, sauce) suspended in mid-air. Emphasize photorealistic detail and a sense of motion. Dark, fiery background with glowing embers. Integrate text: 'MAGIC BURGER' as a prominent title, 'LIMITED TIME ONLY' as a secondary message, and '€6.99' in a starburst, all rendered with a fiery, glowing effect.”

DALL-E 2
Qwen Image Max

AI Judge Analysis

DALL-E 2

  • + Successfully captures a chaotic, fiery atmosphere
  • + Includes a glowing effect on the central elements
  • Text is garbled and unreadable (e.g., 'MARGIC BAGUEC')
  • Food looks unappetizing and lacks photorealistic detail
  • Extremely low image resolution and clarity

Qwen Image Max

  • + Perfect text rendering for all requested elements
  • + High-quality photorealistic textures on the food
  • + Excellent layout and adherence to the 'exploded' and 'starburst' requirements
  • The burger is slightly less 'exploded' than typical deconstructed ads, appearing more as a tilted stack

Verdict: Qwen Image Max followed every instruction perfectly, producing a professional-grade advertisement with clear, correctly spelled text and appetizing textures. In contrast, DALL-E 2 produced a low-quality, blurry image with severe text errors and unrecognizable food components.

Next steps

Explore each model

The Max series of Tongyi Qwen’s image generation model excels across a wide range of generation tasks. Compared with the Plus series, it significantly reduces the “AI-like” feel in generated images, enhancing their realism. It delivers more lifelike material textures for human subjects, finer and more detailed natural textures, and more visually appealing text rendering.