OpenAI's previous image generation model that accepts both text and image inputs and produces image outputs
These two have not faced off in a shared challenge yet. Here is how their skills stack up, side by side.
GPT Image 1
#19 of 23 in Image Editing
Not enough comparable category data
The chart appears once both models have ratings across at least three shared arena categories.
Wan 2.6
#23 of 44 in Text-to-Image
GPT Image 1 and Wan 2.6 have not faced off in a shared challenge yet.
The skill signature above is the honest read for now. Cast a vote in the arena to start putting them head to head.
Explore each model
Alibaba's multimodal generation model from the Wan AI suite, supporting text-to-video, image-to-video, reference-to-video with audio, and text-to-image, in both Chinese and English