Head to head
Esc

Models · slot A

to navigate to pick

FLUX.2 [max] Black Forest Labs Qwen Image 2.0 Pro Alibaba

Settled by community votes across 1 shared challenge, with an AI judge weighing in on each.

FLUX.2 [max]

25.9 arena score

#11 of 44 in Text-to-Image

Skill signature

Not enough comparable category data

The chart appears once both models have ratings across at least three shared arena categories.

Qwen Image 2.0 Pro

22.3 arena score

#27 of 44 in Text-to-Image

Vote tally

Where the votes landed

FLUX.2 [max]

0%

win rate

Ties

0%

Qwen Image 2.0 Pro

0%

win rate

Shared challenges 1

Challenge by challenge

The strongest take from each model on every shared challenge, with the AI judge's read.

The Capybara Taxi Driver

Text-to-Image

“Photorealistic scene inside a yellow New York taxi at night. A capybara is driving, wearing a yellow taxi driver cap and a dark jacket. It has a calm, professional expression and both front paws on the steering wheel. In the back seat sits a human businesswoman in a coat, looking at her phone with a completely normal, bored expression (as if this is just another normal ride). Through the windows you can see the streets of Manhattan at night with blurred lights. Realistic taxi interior, photorealistic, detailed fur and fabric, 35mm lens, night lighting with reflections, shallow depth of field.”

FLUX.2 [max]
Qwen Image 2.0 Pro

AI Judge Analysis

FLUX.2 [max]

  • + Excellent photographic lighting and depth of field
  • + High resolution textures on the capybara's fur and the jacket
  • The capybara has human hands instead of front paws
  • The camera angle makes it look like the businesswoman is sitting in the passenger seat rather than the back seat

Qwen Image 2.0 Pro

  • + Successfully depicts the capybara with its actual paws on the wheel
  • + Better spatial composition with the passenger clearly in the back seat
  • + Included relevant NYC taxi details like the TLC license sticker
  • The passenger's hands and phone interaction are slightly warped
  • The capybara's cap looks a bit small and sits awkwardly on its head

Verdict: While FLUX.2 initially appears to have higher visual fidelity, it fails significantly on anatomical logic by giving the capybara human hands. Qwen Image 2.0 Pro followed the prompt much more accurately, correctly placing the passenger in the back seat and depicting the capybara with paws, while also capturing the specific atmosphere of a New York taxi.

Next steps

Explore each model