Outfit Transfer Challenge
Vote8 models were given the same image and edit instruction, and the community voted blind on which outputs looked best. How it works
This is extremely practical (think e-commerce, fashion, virtual dressing rooms). It’s visually very obvious when it fails, and success looks impressive. It tests a model’s understanding of clothing physics, body shape, and lighting.
#1 — Nano Banana Pro
Challenge Rankings
| # | Model | Elo |
|---|---|---|
| 1 | 1182 | |
| 2 | 1135 | |
| 3 | 1132 | |
| 4 | 1127 | |
| 5 | 1095 | |
| 6 | 1082 | |
| 7 | 1062 | |
| 8 | 996 |
Nano Banana Pro leads the leaderboard with a 100% win rate and an Elo of 1182, establishing a 47-point gap over the budget-tier FLUX.2 [klein] 9B. Despite its low price point and nearly 10x faster generation speed, FLUX.2 [klein] 9B (1135 Elo) narrowly outperforms expensive competitors like GPT Image 2 and Qwen Image Edit Latest in complex fashion transfer tasks.
Elo vs Cost
Elo vs Speed
Competitors
8 models, ranked by EloQwen Image Edit Latest
Highlighted Battles
The most competitive head-to-head matchups, selected by closeness and vote count.