13 models were given the same image and edit instruction, and the community voted blind on which outputs looked best.
How it works
#1 — Wan 2.6
Source Image
Golden Hour Stroll
Edit instruction
“ Add dynamic motion to this photo: make hair blow in the wind, add leaves flying, energetic and lively feel. ”
Challenge Rankings
13 models
#
Model
Price
¢/img
Speed
Elo
1
$$
3¢
1269
2
$$
4¢
1265
3
$$$
6.7¢
1230
4
$
0.9¢
1226
5
$$
3¢
1202
6
$$
4¢
1199
7
$$
3.9¢
1199
8
$$$
7¢
1194
9
$$
3.5¢
1193
10
$$
3¢
1177
11
$$
1.5¢
1175
12
$$
2¢
1170
13
$$
3¢
1136
Wan 2.6 (1269 Elo) and Seedream 4.5 (1265 Elo) lead the photorealistic editing challenge with a 35+ point Elo gap over the rest of the field, while GPT Image 1.5 (1226 Elo) offers a competitive 77.8% win rate at roughly 25% of the cost of the top performers. Despite the premium price tier of Nano Banana Pro (1230 Elo) and Grok Imagine Image Pro (1194 Elo), they fail to surpass the performance of Alibaba and ByteDance's mid-tier priced models.
4 models waiting for enough speed data
Competitors
13 models, ranked by Elo
Wan 2.6
3 attempts · best result used for ranking
Best
1269
Elo
67%W
·
17%L
·
16%T
1244
Elo
81%W
·
19%L
·
0%T
1227
Elo
69%W
·
23%L
·
8%T
Seedream 4.5
3 attempts · best result used for ranking
Best
1265
Elo
86%W
·
14%L
·
0%T
1204
Elo
75%W
·
17%L
·
8%T
1164
Elo
63%W
·
38%L
·
-1%T
Nano Banana Pro
3 attempts · best result used for ranking
Best
1230
Elo
77%W
·
23%L
·
0%T
1191
Elo
69%W
·
31%L
·
0%T
1180
Elo
67%W
·
25%L
·
8%T
GPT Image 1.5
3 attempts · best result used for ranking
Best
1226
Elo
88%W
·
13%L
·
-1%T
1181
Elo
67%W
·
33%L
·
0%T
1177
Elo
86%W
·
14%L
·
0%T
FLUX.2 [max]
3 attempts · best result used for ranking
Best
1202
Elo
50%W
·
50%L
·
0%T
1180
Elo
11%W
·
78%L
·
11%T
1154
Elo
64%W
·
36%L
·
0%T
Reve Image 1.0
3 attempts · best result used for ranking
Best
1199
Elo
54%W
·
42%L
·
4%T
1070
Elo
20%W
·
70%L
·
10%T
1032
Elo
8%W
·
92%L
·
0%T
Nano Banana
3 attempts · best result used for ranking
Best
1199
Elo
71%W
·
29%L
·
0%T
1141
Elo
50%W
·
40%L
·
10%T
1109
Elo
40%W
·
50%L
·
10%T
Grok Imagine Image Pro
3 attempts · best result used for ranking
Best
1194
Elo
58%W
·
42%L
·
0%T
1134
Elo
33%W
·
56%L
·
11%T
1104
Elo
33%W
·
67%L
·
0%T
Seedream 5.0 Lite
3 attempts · best result used for ranking
Best
1193
Elo
50%W
·
50%L
·
0%T
1155
Elo
56%W
·
44%L
·
0%T
1091
Elo
50%W
·
50%L
·
0%T
Qwen Image Edit 2511
3 attempts · best result used for ranking
Best
1177
Elo
33%W
·
63%L
·
4%T
1171
Elo
30%W
·
70%L
·
0%T
987
Elo
0%W
·
100%L
·
0%T
FLUX.2 [pro]
3 attempts · best result used for ranking
Best
1175
Elo
33%W
·
67%L
·
0%T
1163
Elo
39%W
·
61%L
·
0%T
1044
Elo
17%W
·
83%L
·
0%T
Grok Imagine Image
3 attempts · best result used for ranking
Best
1170
Elo
43%W
·
57%L
·
0%T
1147
Elo
29%W
·
71%L
·
0%T
1142
Elo
36%W
·
57%L
·
7%T
Seedream 4.0
3 attempts · best result used for ranking
Best
1136
Elo
28%W
·
72%L
·
0%T
1064
Elo
17%W
·
83%L
·
0%T
1048
Elo
17%W
·
83%L
·
0%T
Original
Original 100%
Upscaled 100%
×
Hover the original to compare crops at 100%
Drag on the original to compare crops at 100%
RESULT
The most competitive head-to-head matchups, selected by closeness and vote count.