Alibaba's multimodal generation model from the Wan AI suite, supporting text-to-video, image-to-video, reference-to-video with audio, and text-to-image, in both Chinese and English

What Wan 2.6 is good at

Rankings based on human votes in the Lumenfall Arena, where real users pick their favorite without knowing which model made which image. Excels at #8 Photorealism (Text-to-Image) , placing in the top 19% of all competing models.

Wan 2.6 Strengths

Wan 2.6 outperforms most competing models here

Photorealism

Text-to-Image
Ranked #8 of 38 models · 70.0% win rate Leaderboard
Wan 2.6
Same prompt, other models

Other categories

Wan 2.6 competes but doesn't rank as highly

Product, Branding & Commercial

Text-to-Image
Ranked #20 of 29 models · 40.0% win rate Leaderboard
Wan 2.6
Same prompt, other models

Text Rendering

Text-to-Image
Ranked #29 of 39 models · 17.9% win rate Leaderboard
Wan 2.6
Same prompt, other models