ARENA Leaderboard
See how AI image models stack up against each other. How it works
Which model creates the best videos from text?
Ranked by blind votes in side-by-side matchups. Voters watch the videos, not the model names.
Best AI Models for Text To Video
| # | Model | Elo |
|---|---|---|
| 1 | 1185 | |
| 2 | 1180 | |
| 3 | 1136 | |
| 4 | 1120 | |
| 5 | 1112 | |
| 6 | 1107 |
As of June 2026, Seedance 2.0 holds a narrow lead with 1185 Elo and a dominant 72.7% win rate, maintaining a slim 4-point margin over Sora 2 Pro (1181 Elo). While the top tier is characterized by premium pricing and slow generation speeds, Alibaba’s Wan 2.6 (1112 Elo) secures the fifth position at a significant discount, costing 83% less per generation than the top-ranked model. Despite the performance gap at the summit, the race for efficiency is tightening as P-Video (1110 Elo) delivers a top-six performance with the fastest relative speeds in the tier.
Elo vs Cost
Elo vs Speed
Challenges
Neon Rain Reverie Text-to-Video
The Will Smith Spaghetti Challenge Text-to-Video
The Soul Gauntlet Text-to-Video
The Rubik's Gauntlet Text-to-Video
FAQ
What is the best AI text to video model?
Based on blind community voting, Seedance 2.0 is currently the #1 ranked AI text to video model with an Elo rating of 1185. Rankings update in real time as new votes come in.
How are AI text to video models ranked on Lumenfall?
Lumenfall Arena ranks AI models through blind community voting. In each matchup, two models generate from the same prompt and voters pick the better result without seeing model names. Votes are processed using TrueSkill, a Bayesian rating algorithm developed by Microsoft Research, that produces a single Elo score reflecting each model's relative quality.
What is an Elo rating for AI models?
An Elo rating is a numerical score representing a model's skill relative to other models. Under the hood, Lumenfall uses TrueSkill, which tracks two values per model: mu (estimated skill) and sigma (uncertainty). The displayed Elo is calculated as 1000 + 10 x (mu - 3*sigma), a conservative lower bound. A model must prove itself consistently across many matchups to earn a high rating.