Pose & Character Mashup
Vote4 models were given the same image and edit instruction, and the community voted blind on which outputs looked best. How it works
This prompt is challenging because it forces models to do two hard things at once: accurately transfer complex body poses and joint angles from Image 1 while strictly preserving the exact face, identity, and clothing style from Image 2. Most models struggle with unnatural limb distortions, identity blending, incorrect fabric behavior on the new pose, and mismatched lighting, making it a strong test of advanced spatial reasoning and character consistency.
#1 — FLUX.2 [klein] 9B
Challenge Rankings
| # | Model | Elo |
|---|---|---|
| 1 | 1134 | |
| 2 | 1126 | |
| 3 | 1100 | |
| 4 | 997 |
FLUX.2 [klein] 9B leads the category with 1134 Elo, though GPT Image 1 Mini remains highly competitive with a 60.0% win rate and similar $0.005–$0.006 budget pricing. A significant 129-point Elo gap separates the top performers from the mid-tier FLUX.2 [dev] Turbo, illustrating a sharp drop-off in the ability to maintain character consistency across complex pose transfers.
Elo vs Cost
Elo vs Speed
Competitors
4 models, ranked by EloFLUX.2 [klein] 9B
Highlighted Battles
The most competitive head-to-head matchups, selected by closeness and vote count.