Head to head
Esc

Models · slot A

to navigate to pick

DALL-E 3 OpenAI Qwen Image Max Alibaba

Settled by community votes across 1 shared challenge, with an AI judge weighing in on each.

DALL-E 3

18.5 arena score

#35 of 44 in Text-to-Image

Skill signature

Not enough comparable category data

The chart appears once both models have ratings across at least three shared arena categories.

Qwen Image Max

20.4 arena score

#31 of 44 in Text-to-Image

Vote tally

Where the votes landed

DALL-E 3

0%

win rate

Ties

0%

Qwen Image Max

0%

win rate

Shared challenges 1

Challenge by challenge

The strongest take from each model on every shared challenge, with the AI judge's read.

Magic Burger Explosion: Fiery Photorealism Challenge

Text-to-Image

“Ad for 'Magic Burger'. Dynamic, exploded burger with all components (bun, patty, cheese, lettuce, tomato, sauce) suspended in mid-air. Emphasize photorealistic detail and a sense of motion. Dark, fiery background with glowing embers. Integrate text: 'MAGIC BURGER' as a prominent title, 'LIMITED TIME ONLY' as a secondary message, and '€6.99' in a starburst, all rendered with a fiery, glowing effect.”

DALL-E 3
Qwen Image Max

AI Judge Analysis

DALL-E 3

  • + Excellent exploded view with clear separation of every ingredient
  • + Highly creative lighting that makes the food appear to glow internally
  • + Crisp, photorealistic texture on the patty and bun
  • Several spelling errors in the text including 'MAGIC BURGR' and 'Limiited'
  • The price is contained in a rectangle rather than the requested starburst

Qwen Image Max

  • + Perfect text rendering for all requested elements with no spelling errors
  • + Accurately follows the starburst requirement for the price tag
  • + Strong adherence to the fiery, glowing effect on a dark background
  • The burger is not fully 'exploded'; many components are still touching or only slightly offset
  • The lettuce and tomato on the right look slightly less integrated into the physics of the scene

Verdict: Qwen Image Max is the superior choice because it perfectly rendered all requested text without spelling errors and followed specific layout instructions like the starburst. While DALL-E 3 achieved a more dramatic exploded effect and high-end food photography lighting, the significant typos in the main title and secondary text make it unusable for an advertisement.

Next steps

Explore each model

The Max series of Tongyi Qwen’s image generation model excels across a wide range of generation tasks. Compared with the Plus series, it significantly reduces the “AI-like” feel in generated images, enhancing their realism. It delivers more lifelike material textures for human subjects, finer and more detailed natural textures, and more visually appealing text rendering.