OpenAI's state-of-the-art image generation model with better instruction following and adherence to prompts
Settled by community votes across 1 shared challenge, with an AI judge weighing in on each.
GPT Image 1.5
#7 of 44 in Text-to-Image
Not enough comparable category data
The chart appears once both models have ratings across at least three shared arena categories.
Qwen Image Max
#31 of 44 in Text-to-Image
Where the votes landed
GPT Image 1.5
100.0%
win rate
Ties
0.0%
Qwen Image Max
0.0%
win rate
Challenge by challenge
The strongest take from each model on every shared challenge, with the AI judge's read.
Magic Burger Explosion: Fiery Photorealism Challenge
Text-to-Image“Ad for 'Magic Burger'. Dynamic, exploded burger with all components (bun, patty, cheese, lettuce, tomato, sauce) suspended in mid-air. Emphasize photorealistic detail and a sense of motion. Dark, fiery background with glowing embers. Integrate text: 'MAGIC BURGER' as a prominent title, 'LIMITED TIME ONLY' as a secondary message, and '€6.99' in a starburst, all rendered with a fiery, glowing effect.”
AI Judge Analysis
GPT Image 1.5
- + Excellent adherence to the 'exploded' request with clear vertical separation of all ingredients.
- + High-quality, photorealistic textures on the patty and vegetables.
- + Superior integration of the starburst element with the fiery background.
- − The 'MAGIC BURGER' text is slightly cut off at the top corners.
- − The lighting is very intense, which makes some areas look slightly over-sharpened.
Qwen Image Max
- + Clean, professional layout with well-balanced text elements.
- + Effective fiery effect on the typography that feels very integrated.
- + Good sense of motion and dynamic energy in the splash effect.
- − Failed the 'exploded' prompt as the bun is still resting on the patty.
- − The burger components are mostly bunched together rather than suspended in mid-air.
- − The starburst is more of a light flare than a traditional ad starburst shape.
Verdict: GPT Image 1.5 followed the prompt much more accurately, creating a true 'exploded' view where every ingredient is suspended individually. While Qwen Image Max produced a very clean and professional advertisement, it failed to separate the core components of the burger as requested. GPT Image 1.5 is the winner for its superior composition and adherence to the technical structure of the prompt.
Explore each model
The Max series of Tongyi Qwen’s image generation model excels across a wide range of generation tasks. Compared with the Plus series, it significantly reduces the “AI-like” feel in generated images, enhancing their realism. It delivers more lifelike material textures for human subjects, finer and more detailed natural textures, and more visually appealing text rendering.