The Capybara Taxi Driver
Vote
Text-to-Image
Photorealism
All models were given the same prompt, and the community voted blind on which outputs looked best. How it works
This challenge seems to be difficult for models because it mixes reality with fiction. Most models struggle to keep the taxi realistic or loose instructions like placing the passenger not in the backseat.
Prompt
“Photorealistic scene inside a yellow New York taxi at night. A capybara is driving, wearing a yellow taxi driver cap and a dark jacket. It has a calm, professional expression and both front paws on the steering wheel.
In the back seat sits a human businesswoman in a coat, looking at her phone with a completely normal, bored expression (as if this is just another normal ride). Through the windows you can see the streets of Manhattan at night with blurred lights. Realistic taxi interior, photorealistic, detailed fur and fabric, 35mm lens, night lighting with reflections, shallow depth of field.”
Voters were asked to judge by
Photorealistic
Realistic NYC taxi interior + night atmosphere
Positioning of passenger
Capybara wearing taxi driver cap