The Halloween Invitation
Vote7 models were given the same prompt, and the community voted blind on which outputs looked best. How it works
While rendering simple text is no longer a real challenge for today’s SOTA models, this test shows something more interesting: how much visual taste a model has, and whether it can create a layout that feels like it came from a professional designer instead of a basic Canva template (no offense to Canva).
#1 — FLUX.2 [klein] 4B
Challenge Rankings
| # | Model | Elo |
|---|---|---|
| 1 | 1152 | |
| 2 | 1126 | |
| 3 | 1099 | |
| 4 | 1081 | |
| 5 | 1080 | |
| 6 | 1079 | |
| 7 | 1048 |
FLUX.2 [klein] 4B leads the challenge with an 1152 Elo, demonstrating superior design layout and text integration at a fraction of the cost ($0.001/img) of its high-end competitors. Despite a higher 75% win rate, Nano Banana Pro trails by 26 Elo points while being over 60 times more expensive and significantly slower.
Elo vs Cost
Elo vs Speed
Competitors
7 models, ranked by EloGLM-Image
Playground coming soonHighlighted Battles
The most competitive head-to-head matchups, selected by closeness and vote count.