OpenAI's state-of-the-art image generation model with arbitrary resolution up to 4K and strong instruction following
These two have not faced off in a shared challenge yet. Here is how their skills stack up, side by side.
GPT Image 2
28.4
arena score
#2 of 48 in Text-to-Image
Top 2 in Text-to-Image
Top 3 in Image Editing
Skill signature
· Text-to-Image
Grok Imagine Video
13.7
arena score
#6 of 7 in Text-to-Video
Top 3 in Image-to-Video
Not yet settled
GPT Image 2 and Grok Imagine Video have not faced off in a shared challenge yet.
The skill signature above is the honest read for now. Cast a vote in the arena to start putting them head to head.
Next steps
Explore each model
xAI's video generation model based on the Aurora architecture, supporting text-to-video, image-to-video, and video editing with native audio-visual synthesis at up to 720p