OpenAI's cost-effective image generation model for when image quality isn't the top priority
Settled by community votes across 2 shared challenges, with an AI judge weighing in on each.
GPT Image 1 Mini
#12 of 44 in Text-to-Image
Not enough comparable category data
The chart appears once both models have ratings across at least three shared arena categories.
Wan 2.6
#23 of 44 in Text-to-Image
Where the votes landed
GPT Image 1 Mini
0%
win rate
Ties
0%
Wan 2.6
0%
win rate
Challenge by challenge
The strongest take from each model on every shared challenge, with the AI judge's read.
Chalkboard Menu
Text-to-Image“Handwritten-style chalkboard menu in a cozy café, all text rendered in the exact same realistic chalk handwriting style with natural variations in letter size, slight slant, and chalk texture — no printed or digital fonts anywhere on the board. Title at the top in elegant cursive chalk handwriting: ‘TODAY’S SPECIALS – APRIL 30, 2026’. Below it, three menu items also in the same handwritten chalk style: ‘Truffle Mushroom Risotto – $24’, ‘Grilled Octopus with Lemon & Herbs – $28’, ‘Brown Butter Chocolate Chip Cookies – $9’. At the very bottom, smaller text in the identical handwritten chalk style (slightly smaller but still clearly legible with the same handwriting characteristics): ‘All items made fresh daily • Ask about our gluten-free options’. Warm ambient café lighting, visible chalk dust and smudges, realistic handwriting imperfections, no clean printed text anywhere.”
AI Judge Analysis
GPT Image 1 Mini
- + Excellent text legibility and accuracy with zero spelling errors.
- + Stable and clean composition.
- + Consistent chalk texture across all lettering.
- − The 'cursive' request for the title was not followed, as it remains blocky.
- − The text looks slightly too uniform, bordering on a digital font look rather than true organic handwriting.
Wan 2.6
- + Captures the 'elegant cursive' and 'handwritten' request much more authentically.
- + The texture of the board with chalk dust and smudges provides superior realism and atmosphere.
- + Good adherence to the slanted handwriting requested in the prompt.
- − Small spelling error in the date ('APRIL' is misspelled as 'APRIL' but the 'L' is mangled and there is an extra 'I').
- − The composition is slightly tighter at the edges.
Verdict: GPT Image 1 Mini provides a very clean and perfectly legible menu, but fails to deliver the 'elegant cursive' requested for the title. Wan 2.1 captures the artistic soul of the prompt much better, featuring beautiful cursive and a realistic chalkboard texture, despite a minor character artifact in the date. Wan 2.1 is the preferred choice for its superior interpretation of the requested style and atmosphere.
The Capybara Taxi Driver
Text-to-Image“Photorealistic scene inside a yellow New York taxi at night. A capybara is driving, wearing a yellow taxi driver cap and a dark jacket. It has a calm, professional expression and both front paws on the steering wheel. In the back seat sits a human businesswoman in a coat, looking at her phone with a completely normal, bored expression (as if this is just another normal ride). Through the windows you can see the streets of Manhattan at night with blurred lights. Realistic taxi interior, photorealistic, detailed fur and fabric, 35mm lens, night lighting with reflections, shallow depth of field.”
AI Judge Analysis
GPT Image 1 Mini
- + Excellent photorealism and cinematic lighting
- + High quality texture detail on the capybara's fur
- + Accurate moody atmosphere for a night taxi ride
- − The passenger is seated correctly in the back but slightly out of focus
- − Only one paw is clearly visible and positioned on the steering wheel
Wan 2.6
- + Follows the instruction for both front paws on the steering wheel better
- + Includes vibrant New York street lights in the background
- + Captures the bored expression of the businesswoman well
- − Layout error where the passenger appears to be in the front seat next to the driver rather than the back seat
- − The capybara's hat looks more like a police officer hat than a taxi driver cap
- − Visual artifacts on the car roof
Verdict: GPT Image 1 Mini creates a much more believable and photorealistic scene with superior lighting and texture, though it ignores the specific 'two paws' detail. Wan 2.6 attempts more of the literal prompt details but fails significantly on composition by placing the passenger in the front seat, which contradicts the prompt and realistic taxi layouts.
Explore each model
Alibaba's multimodal generation model from the Wan AI suite, supporting text-to-video, image-to-video, reference-to-video with audio, and text-to-image, in both Chinese and English