OpenAI's previous image generation model that accepts both text and image inputs and produces image outputs
Settled by community votes across 1 shared challenge, with an AI judge weighing in on each.
GPT Image 1
#19 of 23 in Image Editing
Not enough comparable category data
The chart appears once both models have ratings across at least three shared arena categories.
Grok Imagine Image
#19 of 44 in Text-to-Image
Where the votes landed
GPT Image 1
0%
win rate
Ties
0%
Grok Imagine Image
0%
win rate
Challenge by challenge
The strongest take from each model on every shared challenge, with the AI judge's read.
Outfit Transfer Challenge
Editing“Use Image 1 as the base person. Dress them in the exact elaborate outfit from Image 2 (including all layers, accessories, jewelry, and shoes). Carefully adapt the clothing to the body shape and pose in Image 1 while maintaining realistic fabric behavior, correct proportions, and perfect lighting/shadow matching. Keep the person’s exact face, hair, and background completely unchanged.”
AI Judge Analysis
GPT Image 1
- + Matches the outfit from Image 2 accurately
- + Preserves the face and general background elements
- − The lighting on the person is too moody compared to the bright beach background
- − Subtle changes were made to the hair texture and facial proportions
Grok Imagine Image
- + Successfully preserves the original subject's exact face, hair, and lighting
- + Maintains the bright, original beach background perfectly
- − Completely ignored the reference outfit in Image 2
- − Generated a random royal costume instead of the requested attire
Verdict: GPT Image 1 correctly followed the core instruction to use the clothing from Image 2, though it slightly altered the subject's face and lighting. Grok Imagine Image failed the task by creating a generic elaborate outfit unrelated to the reference image, despite preserving the person better.
Explore each model
An image generation model by xAI designed to generate highly aesthetic images from text descriptions.