Grok Imagine Image xAI Qwen Image Max Alibaba

Settled by community votes across 1 shared challenge, with an AI judge weighing in on each.

Grok Imagine Image

24.1 arena score

#19 of 44 in Text-to-Image

Skill signature

Not enough comparable category data

The chart appears once both models have ratings across at least three shared arena categories.

Qwen Image Max

20.4 arena score

#31 of 44 in Text-to-Image

Vote tally

Where the votes landed

Grok Imagine Image

win rate

Ties

Qwen Image Max

win rate

Shared challenges 1

Challenge by challenge

The strongest take from each model on every shared challenge, with the AI judge's read.

Magic Burger Explosion: Fiery Photorealism Challenge

Text-to-Image

“Ad for 'Magic Burger'. Dynamic, exploded burger with all components (bun, patty, cheese, lettuce, tomato, sauce) suspended in mid-air. Emphasize photorealistic detail and a sense of motion. Dark, fiery background with glowing embers. Integrate text: 'MAGIC BURGER' as a prominent title, 'LIMITED TIME ONLY' as a secondary message, and '€6.99' in a starburst, all rendered with a fiery, glowing effect.”

Grok Imagine Image

Qwen Image Max

AI Judge Analysis

Grok Imagine Image

+ Excellent 'exploded' layout with clearly suspended components as requested
+ Clean and highly legible text rendering for all elements
+ Dynamic splash effects with sauces add to the sense of motion

− The starburst for the price looks like a flat clip-art element rather than integrated into the 3D scene

Qwen Image Max

+ Price starburst is beautifully rendered with light rays and glowing effects
+ High texture detail on the patty and bun
+ Very atmospheric lighting and ember effects

− Failed the 'exploded' instruction as the burger is mostly assembled
− The secondary text is slightly less polished compared to the main title

Verdict: Grok Imagine followed the complex 'exploded burger' layout much better than Qwen Image Max, showing a clear separation of all ingredients. While Qwen Image Max produced a more integrated and visually impressive price starburst, the failure to separate the burger components makes it a less accurate response to the specific prompt instructions.

Next steps

Explore each model

Grok Imagine Image

xAI

An image generation model by xAI designed to generate highly aesthetic images from text descriptions.

Vote this model in the arena

Arena profile Lumenfall catalog

Qwen Image Max

Alibaba

The Max series of Tongyi Qwen’s image generation model excels across a wide range of generation tasks. Compared with the Plus series, it significantly reduces the “AI-like” feel in generated images, enhancing their realism. It delivers more lifelike material textures for human subjects, finer and more detailed natural textures, and more visually appealing text rendering.

Vote this model in the arena

Arena profile Lumenfall catalog