Black Forest Labs' state-of-the-art image generation model with maximum quality and speed, supporting text-to-image and multi-reference image editing with up to 4MP output
Settled by community votes across 1 shared challenge, with an AI judge weighing in on each.
FLUX.2 [pro]
#9 of 44 in Text-to-Image
Not enough comparable category data
The chart appears once both models have ratings across at least three shared arena categories.
Qwen Image 2.0 Pro
#27 of 44 in Text-to-Image
Where the votes landed
FLUX.2 [pro]
0%
win rate
Ties
0%
Qwen Image 2.0 Pro
0%
win rate
Challenge by challenge
The strongest take from each model on every shared challenge, with the AI judge's read.
The Capybara Taxi Driver
Text-to-Image“Photorealistic scene inside a yellow New York taxi at night. A capybara is driving, wearing a yellow taxi driver cap and a dark jacket. It has a calm, professional expression and both front paws on the steering wheel. In the back seat sits a human businesswoman in a coat, looking at her phone with a completely normal, bored expression (as if this is just another normal ride). Through the windows you can see the streets of Manhattan at night with blurred lights. Realistic taxi interior, photorealistic, detailed fur and fabric, 35mm lens, night lighting with reflections, shallow depth of field.”
AI Judge Analysis
FLUX.2 [pro]
- + Excellent photorealistic lighting and depth of field
- + Highly detailed taxi dashboard and interior textures
- + Superior composition with a cinematic feel
- − The capybara's hands look more like human-like gloves than animal paws
- − The capybara is looking through the side window rather than the windshield
Qwen Image 2.0 Pro
- + Successfully positions the woman in the back seat as requested
- + Includes a 'TLC Licesed' tag that adds to the New York taxi theme
- + Good interpretation of the capybara's front paws on the wheel
- − The woman is placed in the front passenger area rather than the back seat
- − Noticeable spelling error on the 'Licensed' tag
- − Lower overall image clarity and slightly muddy textures compared to the competitor
Verdict: FLUX.2 [pro] produces a significantly more cinematic and high-quality image with impressive lighting and realistic textures, although it places the passenger in a way that suggests a smaller vehicle or different seating arrangement. Qwen Image 2.0 Pro captures the specific prompt details like the paws on the wheel well, but fails the spatial requirement of putting the woman in the back seat and has lower visual fidelity.
Explore each model
Alibaba's Qwen Image 2.0 Pro model offering higher quality image generation with enhanced detail and accuracy