Gemini 3 Pro with image generation capabilities. Combines advanced reasoning with the ability to generate and edit images.
Settled by community votes across 1 shared challenge, with an AI judge weighing in on each.
Nano Banana Pro
#2 of 44 in Text-to-Image
Not enough comparable category data
The chart appears once both models have ratings across at least three shared arena categories.
Qwen Image Max
#31 of 44 in Text-to-Image
Where the votes landed
Nano Banana Pro
100.0%
win rate
Ties
0.0%
Qwen Image Max
0.0%
win rate
Challenge by challenge
The strongest take from each model on every shared challenge, with the AI judge's read.
Magic Burger Explosion: Fiery Photorealism Challenge
Text-to-Image“Ad for 'Magic Burger'. Dynamic, exploded burger with all components (bun, patty, cheese, lettuce, tomato, sauce) suspended in mid-air. Emphasize photorealistic detail and a sense of motion. Dark, fiery background with glowing embers. Integrate text: 'MAGIC BURGER' as a prominent title, 'LIMITED TIME ONLY' as a secondary message, and '€6.99' in a starburst, all rendered with a fiery, glowing effect.”
AI Judge Analysis
Nano Banana Pro
- + Excellent adherence to the 'exploded' layout requirement.
- + Superior text clarity and consistent glowing effect.
- − The '€' symbol has a slight visual artifact in its stroke.
Qwen Image Max
- + Very high detail on the meat texture and sauce drips.
- + Energetic background with strong motion effects.
- − Failed to provide an exploded view, showing a mostly assembled burger.
- − Text is less integrated with the requested glowing ember style.
Verdict: Nano Banana Pro followed the structural requirements of the prompt much more accurately, providing a clear exploded view and legible text. While Qwen Image Max has impressive texture work on the ingredients, it ignored the request for a deconstructed layout. Nano Banana Pro is the winner for its balance of composition and strict prompt adherence.
Explore each model
The Max series of Tongyi Qwen’s image generation model excels across a wide range of generation tasks. Compared with the Plus series, it significantly reduces the “AI-like” feel in generated images, enhancing their realism. It delivers more lifelike material textures for human subjects, finer and more detailed natural textures, and more visually appealing text rendering.