The Soul Gauntlet

Vote
Text-to-Video

6 models were given the same prompt, and the community voted blind on which outputs looked best. How it works

This is one of the hardest remaining frontiers in 2026 video generation; testing whether models can convey genuine human emotion through subtle facial acting, realistic tear physics, and micro-expressions. While many models can create beautiful faces, very few can deliver emotionally convincing performances without looking uncanny or robotic.

Prompt
Extreme cinematic close-up of a beautiful young woman experiencing deep, raw emotion. Her expression slowly shifts from quiet sorrow to intense cathartic crying — realistic skin texture with visible pores, subtle muscle twitches, glistening tears forming in her eyes and rolling down her cheeks, red-rimmed eyes with natural blinking and micro-expressions of pain and release. Soft dramatic side lighting with gentle rim light highlighting the tears, very shallow depth of field, slight emotional camera push-in during the emotional peak, photorealistic, 8K, intricate skin and eye details, filmic color grading, subtle film grain.
Voters were asked to judge by Facial Micro-Expressions & Emotion Skin Eye & Tear Realism Emotional Arc & Timing

Challenge Rankings

6 models
# Model Elo
1 1187
2 1150
3 1134
4 1116
5 1065
6 984

Kling V3 Omni Pro dominates the emotional rendering challenge with a 100% win rate and an 1187 Elo, establishing a 122-point lead over Sora 2 Pro. Despite costing significantly less per generation, the mid-tier P-Video (1116 Elo) outperforms Sora 2 Pro in facial nuance while maintaining an 8x speed advantage.

Elo vs Speed

4 models waiting for enough speed data

Competitors

6 models, ranked by Elo
1

Kling V3 Omni Pro

Playground coming soon
2

Kling V3 Pro

Playground coming soon

Grok Imagine Video

Try in Playground →