Head to head
Esc

Models · slot A

to navigate to pick

Grok Imagine Image xAI Qwen Image Max Alibaba

Settled by community votes across 1 shared challenge, with an AI judge weighing in on each.

Grok Imagine Image

24.1 arena score

#19 of 44 in Text-to-Image

Skill signature

Not enough comparable category data

The chart appears once both models have ratings across at least three shared arena categories.

Qwen Image Max

20.4 arena score

#31 of 44 in Text-to-Image

Vote tally

Where the votes landed

Grok Imagine Image

0%

win rate

Ties

0%

Qwen Image Max

0%

win rate

Shared challenges 1

Challenge by challenge

The strongest take from each model on every shared challenge, with the AI judge's read.

Magic Burger Explosion: Fiery Photorealism Challenge

Text-to-Image

“Ad for 'Magic Burger'. Dynamic, exploded burger with all components (bun, patty, cheese, lettuce, tomato, sauce) suspended in mid-air. Emphasize photorealistic detail and a sense of motion. Dark, fiery background with glowing embers. Integrate text: 'MAGIC BURGER' as a prominent title, 'LIMITED TIME ONLY' as a secondary message, and '€6.99' in a starburst, all rendered with a fiery, glowing effect.”

Grok Imagine Image
Qwen Image Max

AI Judge Analysis

Grok Imagine Image

  • + Excellent 'exploded' layout with clearly suspended components as requested
  • + Clean and highly legible text rendering for all elements
  • + Dynamic splash effects with sauces add to the sense of motion
  • The starburst for the price looks like a flat clip-art element rather than integrated into the 3D scene

Qwen Image Max

  • + Price starburst is beautifully rendered with light rays and glowing effects
  • + High texture detail on the patty and bun
  • + Very atmospheric lighting and ember effects
  • Failed the 'exploded' instruction as the burger is mostly assembled
  • The secondary text is slightly less polished compared to the main title

Verdict: Grok Imagine followed the complex 'exploded burger' layout much better than Qwen Image Max, showing a clear separation of all ingredients. While Qwen Image Max produced a more integrated and visually impressive price starburst, the failure to separate the burger components makes it a less accurate response to the specific prompt instructions.

Next steps

Explore each model

The Max series of Tongyi Qwen’s image generation model excels across a wide range of generation tasks. Compared with the Plus series, it significantly reduces the “AI-like” feel in generated images, enhancing their realism. It delivers more lifelike material textures for human subjects, finer and more detailed natural textures, and more visually appealing text rendering.