OpenAI's previous image generation model that accepts both text and image inputs and produces image outputs
Settled by community votes across 1 shared challenge, with an AI judge weighing in on each.
GPT Image 1
#19 of 23 in Image Editing
Not enough comparable category data
The chart appears once both models have ratings across at least three shared arena categories.
Wan 2.7 Pro
#29 of 44 in Text-to-Image
Where the votes landed
GPT Image 1
0%
win rate
Ties
0%
Wan 2.7 Pro
0%
win rate
Challenge by challenge
The strongest take from each model on every shared challenge, with the AI judge's read.
Outfit Transfer Challenge
Editing“Use Image 1 as the base person. Dress them in the exact elaborate outfit from Image 2 (including all layers, accessories, jewelry, and shoes). Carefully adapt the clothing to the body shape and pose in Image 1 while maintaining realistic fabric behavior, correct proportions, and perfect lighting/shadow matching. Keep the person’s exact face, hair, and background completely unchanged.”
AI Judge Analysis
GPT Image 1
- + Perfectly replicates the specific clothing, scarf, and watch from Image 2
- + Maintains the person's facial features and distinctive vitiligo patterns with high accuracy
- + Seamlessly integrates the clothing into the lighting and pose of the scene
Wan 2.7 Pro
- + Preserves the full-body composition and background of the original image
- + Maintains the vitiligo markings on both the face and hands consistently
- − Completely failed to use the outfit from Image 2, generating a generic gold-patterned jacket instead
- − The scale of the person relative to the beach structure is slightly altered
Verdict: GPT Image 1 followed the instructions nearly perfectly, accurately transplanting the exact coat, plaid scarf, and watch from Image 2 onto the subject while keeping the subject's face and hair unchanged. Wan 2.7 Pro failed the core task of the prompt by generating an entirely different outfit that was not present in any of the source images.
Explore each model
Alibaba's Wan 2.7 Pro image generation and editing model with higher-quality outputs and support for 4K image generation