Black Forest Labs' state-of-the-art image generation model with maximum quality and speed, supporting text-to-image and multi-reference image editing with up to 4MP output
Settled by community votes across 1 shared challenge, with an AI judge weighing in on each.
FLUX.2 [pro]
#9 of 44 in Text-to-Image
Not enough comparable category data
The chart appears once both models have ratings across at least three shared arena categories.
Wan 2.7
#34 of 44 in Text-to-Image
Where the votes landed
FLUX.2 [pro]
0%
win rate
Ties
0%
Wan 2.7
0%
win rate
Challenge by challenge
The strongest take from each model on every shared challenge, with the AI judge's read.
The Capybara Taxi Driver
Text-to-Image“Photorealistic scene inside a yellow New York taxi at night. A capybara is driving, wearing a yellow taxi driver cap and a dark jacket. It has a calm, professional expression and both front paws on the steering wheel. In the back seat sits a human businesswoman in a coat, looking at her phone with a completely normal, bored expression (as if this is just another normal ride). Through the windows you can see the streets of Manhattan at night with blurred lights. Realistic taxi interior, photorealistic, detailed fur and fabric, 35mm lens, night lighting with reflections, shallow depth of field.”
AI Judge Analysis
FLUX.2 [pro]
- + Excellent photorealism with cinematic lighting and realistic textures on the capybara's fur.
- + Very detailed car interior including a dashboard, radio, and hanging accessories.
- + Perfectly captures the perspective from inside the car as requested.
- − The 'hands' gripping the wheel look more like human hands in gloves than capybara paws.
- − The businesswoman in the back is slightly out of focus and less detailed than the driver.
Wan 2.7
- + Accurately depicts capybara claws/paws on the steering wheel.
- + Clear view of both the driver and the bored passenger in a single frame.
- − The perspective is from outside the car, failing the 'inside a yellow New York taxi' prompt requirement.
- − The passenger is sitting in the front seat instead of the back seat.
- − The capybara's head has a 'copy-pasted' look with a noticeable harsh edge where its neck meets the jacket.
Verdict: FLUX.2 [pro] followed the prompt's perspective much better, creating an immersive shot from inside the vehicle with high-quality cinematic lighting. While Wan 2.7 handled the capybara's paws more realistically, it failed the spatial requirements of the prompt by placing the passenger in the front seat and moving the camera outside the taxi.
Explore each model
Alibaba's Wan 2.7 image generation and editing model for text-to-image, reference-guided generation, and instruction-based image edits