Text-to-Image Prompt Adherence Aesthetics Creativity Photorealism Art

The Reversed Rodeo

This competition tests how well AI image models truly understand language versus how much they rely on visual habits from their training data. The prompt is deliberately simple on the surface but devilishly hard in practice. Most models default to the familiar trope of an astronaut riding a horse. By forcing the reversal, we measure three critical capabilities that separate good models from great ones:

Strict instruction following (including negations)
Accurate subject-object relationships and spatial hierarchy
Resistance to strong dataset biases

Voters judged on

Horse actually on the back of the astronaut one Horse and Astronaut Cinematic atmosphere

Blind Vote This Challenge How voting works

Winner

#1 GPT Image 2

Prompt

“Horse riding astronaut in space — horse on top, not vice versa. Surreal, highly detailed, cinematic.”

The leaderboard

Challenge rankings

17 models · ranked by blind vote

Through time: how it evolved

Stable Diffusion 3.5 Medium

score 3.8

0

Head to head 4 matchups

Notable battles

The matchups worth a second look from this challenge's blind voting: the closest rivalries, the biggest upsets (a lower seed taking down a favorite), and the clashes at the top of the board. Each bar shows how the community split its votes.

Qwen Image 2.0 GPT Image 2

20% 80%

Nano Banana Pro GPT Image 1.5

0% 0%

DALL-E 2 FLUX.1 [schnell] FP8

0% 0%

Stable Diffusion 3.5 Medium Wan 2.7

50% 0%

Want to compare other models? Pick any two and see them go head to head.

The archive

Through time

How The Reversed Rodeo has evolved, release by release.

2022 to 2026· 5 reigns· 17 models

The Reign Chain

5 reigns

How the crown changed hands

Every model that has held #1 on this challenge, in order. Click any era to travel through time.

R01

Inception

DALL-E 2
Apr 2022 → Oct 2023

reigned 1.5 yrs
handoff
R02

DALL-E 3
Oct 2023 → Nov 2025

reigned 2.1 yrs · overtook DALL-E 2
handoff
R03

Gemini 3 Pro Image Preview
Nov 2025 → Dec 2025

reigned 1 mo · overtook DALL-E 3
handoff
R04

Seedream 4.5
Dec 2025 → Apr 2026

reigned 4 mo · overtook Gemini 3 Pro Image Preview
handoff
R05

Current

GPT Image 2
Apr 2026 → today

reigned 2 mo · overtook Seedream 4.5

Generation history

17 models · newest first

Show full history 12

Apr 2026

GPT Image 2 3 takes
Apr 2026

Wan 2.7 3 takes
Feb 2026

Gemini 3.1 Flash Image Preview 3 takes
Feb 2026

Recraft V4 3 takes
Feb 2026

Recraft V4 Pro 3 takes
12 more models Apr 2022 – Feb 2026 Show