The Capybara Taxi Driver

Vote
Text-to-Image Photorealism

15 models were given the same prompt, and the community voted blind on which outputs looked best. How it works

This challenge seems to be difficult for models because it mixes reality with fiction. Most models struggle to keep the taxi realistic or loose instructions like placing the passenger not in the backseat.

Prompt
Photorealistic scene inside a yellow New York taxi at night. A capybara is driving, wearing a yellow taxi driver cap and a dark jacket. It has a calm, professional expression and both front paws on the steering wheel. In the back seat sits a human businesswoman in a coat, looking at her phone with a completely normal, bored expression (as if this is just another normal ride). Through the windows you can see the streets of Manhattan at night with blurred lights. Realistic taxi interior, photorealistic, detailed fur and fabric, 35mm lens, night lighting with reflections, shallow depth of field.
Voters were asked to judge by Photorealistic Realistic NYC taxi interior + night atmosphere Positioning of passenger Capybara wearing taxi driver cap

Challenge Rankings

15 models
# Model Elo
1 1210
2 1180
3 1158
4 1150
5 1138
6 1133
7 1133
8 1128
9 1112
10 1110
11 1103
12 1098
13 1087
14 991
15 989

GPT Image 2 leads with a 1210 Elo and 71.4% win rate, though the budget-friendly Z-Image Turbo (1180 Elo) remains highly competitive at nearly one-eighth of the price and significantly faster generation speeds. Seedream 4.5 and Nano Banana Pro share the highest individual win rate of 87.5%, demonstrating superior handling of the complex passenger-placement constraints compared to lower-ranked premium models.

1 model without pricing omitted

Elo vs Speed

6 models waiting for enough speed data

Competitors

15 models, ranked by Elo
1

GPT Image 2

Try in Playground →
2

Z-Image Turbo

Try in Playground →
3

Seedream 4.5

Try in Playground →

Nano Banana Pro

Try in Playground →

Wan 2.6

Try in Playground →

Nano Banana 2

Try in Playground →

Wan 2.7 Pro

Try in Playground →

GPT Image 1 Mini

Try in Playground →

FLUX.2 [dev] Turbo

Try in Playground →

Qwen Image 2.0 Pro

Try in Playground →

Wan 2.7

Try in Playground →

Grok Imagine Image Pro

Try in Playground →

Recraft V4 Pro

Try in Playground →

DALL-E 2

Try in Playground →

DALL-E 3

Try in Playground →