Arena
Man and Car in California
13 models were given the same image and edit instruction, and the community voted blind on which outputs looked best.
How it works
#1 — GPT Image 1.5
Source Image
Man and Car in California
Source Image
Man and Car in California
Edit instruction
“ Make a photo of the man driving the car down the California coastline ”
Challenge Rankings
13 models
#
Model
Price
¢/img
Speed
Elo
1
$
0.9¢
1294
2
$$$
6.7¢
1292
3
$$
3¢
1273
4
$$
3¢
1268
5
$$
2.2¢
1267
6
$$
3.9¢
1266
7
$$
1.5¢
1260
8
$$$
6¢
1251
9
$$
3¢
1242
10
$$
4¢
1219
11
$$
3.5¢
1192
12
$$
2¢
1173
13
$$
3¢
1134
GPT Image 1.5 secures the top position with an Elo of 1294, despite being roughly 85% cheaper than the second-place Nano Banana Pro (Elo 1292). While Qwen Image Edit 2511 maintains a competitive 1273 Elo, it offers a significant speed advantage, processing images nearly four times faster than the two leading models.
4 models waiting for enough speed data
Competitors
13 models, ranked by Elo
GPT Image 1.5
3 attempts · best result used for ranking
Best
1294
Elo
77%W
·
21%L
·
2%T
1217
Elo
50%W
·
50%L
·
0%T
1188
Elo
29%W
·
65%L
·
6%T
Nano Banana Pro
3 attempts · best result used for ranking
Best
1292
Elo
75%W
·
25%L
·
0%T
1261
Elo
67%W
·
33%L
·
0%T
1216
Elo
42%W
·
54%L
·
4%T
Qwen Image Edit 2511
3 attempts · best result used for ranking
Best
1273
Elo
57%W
·
41%L
·
2%T
1260
Elo
58%W
·
39%L
·
3%T
1240
Elo
55%W
·
43%L
·
2%T
Wan 2.6
3 attempts · best result used for ranking
Best
1268
Elo
68%W
·
29%L
·
3%T
1265
Elo
70%W
·
24%L
·
6%T
1239
Elo
61%W
·
35%L
·
4%T
Nano Banana 2
3 attempts · best result used for ranking
Best
1267
Elo
64%W
·
31%L
·
5%T
1167
Elo
31%W
·
46%L
·
23%T
1152
Elo
21%W
·
79%L
·
0%T
Nano Banana
3 attempts · best result used for ranking
Best
1266
Elo
60%W
·
36%L
·
4%T
1233
Elo
50%W
·
47%L
·
3%T
1140
Elo
18%W
·
77%L
·
5%T
FLUX.2 [pro]
3 attempts · best result used for ranking
Best
1260
Elo
61%W
·
39%L
·
0%T
1232
Elo
43%W
·
43%L
·
14%T
1172
Elo
21%W
·
73%L
·
6%T
FLUX.2 [flex]
3 attempts · best result used for ranking
Best
1251
Elo
57%W
·
38%L
·
5%T
1245
Elo
54%W
·
42%L
·
4%T
1244
Elo
63%W
·
34%L
·
3%T
FLUX.2 [max]
3 attempts · best result used for ranking
Best
1242
Elo
47%W
·
51%L
·
2%T
1158
Elo
20%W
·
76%L
·
4%T
1085
Elo
5%W
·
95%L
·
0%T
Seedream 4.5
3 attempts · best result used for ranking
Best
1219
Elo
34%W
·
64%L
·
2%T
1177
Elo
21%W
·
68%L
·
11%T
1158
Elo
29%W
·
67%L
·
4%T
Seedream 5.0 Lite
3 attempts · best result used for ranking
Best
1192
Elo
28%W
·
69%L
·
3%T
1170
Elo
24%W
·
71%L
·
5%T
1104
Elo
17%W
·
83%L
·
0%T
Grok Imagine Image
3 attempts · best result used for ranking
Best
1173
Elo
28%W
·
67%L
·
5%T
1165
Elo
22%W
·
67%L
·
11%T
1146
Elo
18%W
·
82%L
·
0%T
Seedream 4.0
3 attempts · best result used for ranking
Best
1134
Elo
10%W
·
88%L
·
2%T
1010
Elo
0%W
·
100%L
·
0%T
990
Elo
0%W
·
100%L
·
0%T
Original
Original 100%
Upscaled 100%
×
Hover the original to compare crops at 100%
Drag on the original to compare crops at 100%
RESULT
The most competitive head-to-head matchups, selected by closeness and vote count.