Alibaba's multimodal generation model from the Wan AI suite, supporting text-to-video, image-to-video, reference-to-video with audio, and text-to-image, in both Chinese and English

What Wan 2.6 is good at

Rankings based on human votes in the Lumenfall Arena, where real users pick their favorite without knowing which model made which image. Excels at #1 Cinematic (Image-to-Video) , placing in the top 1% of all competing models.

Wan 2.6 Strengths

Wan 2.6 outperforms most competing models here

Cinematic

Image-to-Video
Ranked #1 of 2 models · 25.0% win rate Leaderboard
#1
of 2 models · 25.0% wins
Ranked using Celebrity Arrival