Grok Imagine Image vs Qwen Image Edit 2511
Head-to-head across 6 challenges
Grok Imagine Image
44.0%
win rate
Ties
8.0%
Qwen Image Edit 2511
48.0%
win rate
Challenge Results
Man and Car in California
Editing“Make a photo of the man driving the car down the California coastline”
AI Judge Analysis
Grok Imagine Image
- + Excellent preservation of the white Rolls-Royce Phantom Drophead Coupe's external design and details.
- + Highly realistic integration of the car onto a coastal road with motion blur.
- + Accurate depiction of the California coastline environment.
- − Completely failed to use the specific man provided in the second source image, substituting a generic older white male driver.
Qwen Image Edit 2511
- + Successfully preserved the identity, hairstyle, and clothing of the man from the source image.
- + Accurately places the subjects in a California coastline setting as requested.
- + Good composition that focuses on the specific characters provided in the task.
- − Failed to preserve the specific car model, changing the luxury Rolls-Royce to a different convertible with a brown wood interior.
- − The steering wheel and dashboard have some warped geometry/perspective.
Verdict: This comparison highlights a classic trade-off in image editing: Grok Imagine Image perfectly preserved the car but ignored the specific person provided, while Qwen Image Edit 2511 perfectly preserved the person but changed the car. Qwen is the preferred model for this specific task because identifying and placing the person from the second source image is a more complex multi-modal requirement than simply keeping the car, even though it struggled with the interior details of the vehicle.
Bald man challenge
Editing“Give the person a full, thick head of natural hair with realistic texture, density, and a natural hairline. Preserve facial features and lighting.”
AI Judge Analysis
Grok Imagine Image
- + Excellent source preservation, keeping the face and lighting identical
- + Very realistic and natural-looking hair texture and density
- + Perfectly integrated hairline that matches the sideburns
- − The hair is quite short, bordering on a buzz cut rather than 'full'
Qwen Image Edit 2511
- + Successfully added a very thick and full head of hair
- + Maintained the original facial features and background correctly
- − Hair texture and curls look oily or plastic-like
- − The lighting on the hair doesn't perfectly match the environment
- − The hair volume is slightly exaggerated, looking like a wig
Verdict: Grok Imagine produced a much more realistic result by seamlessly integrating the new hair with the existing beard and lighting, though the style is shorter than expected. Qwen Image Edit 2511 followed the 'full and thick' instruction more literally, but the hair texture looks artificial and poorly blended with the rest of the image.
Night Sky Transformation
Editing“Change the scene to night: a deep, dark sky with subtle, glistening stars visible behind the mountain.”
AI Judge Analysis
Grok Imagine Image
- + Excellent source preservation, maintaining the layout and structure of the original town and mountain.
- + Photorealistic star field that looks natural and atmospheric.
- + Seamless lighting transition from sunset to deep night across the landscape.
- − None observed.
Qwen Image Edit 2511
- + Successfully darkened the landscape while keeping the town lights bright.
- + Preserved the composition of the original mountain and town well.
- − The stars appear as a repetitive, artificial grid of white dots that lacks realism.
- − The star patterns show unnatural, straight-line artifacts in the upper sky.
Verdict: Grok Imagine Image provides a superior result, creating a realistic night sky with believable star density and natural lighting that integrates perfectly with the preserved mountain landscape. Qwen Image Edit 2511 succeeds in preserving the source image but fails on the prompt's specific request for stars, instead generating a highly artificial-looking grid of white pixels that ruins the visual quality.
Studio Ghibli Anime Style
Editing“Transform this photo into a Studio Ghibli–inspired illustration. Use soft pastel colors, hand-painted textures, gentle lighting, dreamy backgrounds, and a warm, nostalgic mood”
AI Judge Analysis
Grok Imagine Image
- + Excellent capture of the Ghibli art style with hand-painted textures and soft line work.
- + Preserves the composition and poses of the iconic source image perfectly.
- + Creates a beautiful, dreamy watercolor-style background that fits the nostalgic theme.
- − The facial features are simplified significantly compared to the original subjects.
Qwen Image Edit 2511
- + Successfully translates the image into a clean anime illustration style.
- + Retains more recognizable facial likeness to the original people in the photo.
- + Maintains high clarity and clean linework throughout the composition.
- − The style leans more toward generic modern digital anime rather than the specific 'Ghibli' hand-painted look requested.
- − The lighting is flat compared to the soft, atmospheric glow in Model A.
Verdict: Both models successfully interpreted the scene into an illustration, but Grok Imagine Image (Model A) followed the specific 'Studio Ghibli' and 'hand-painted' instructions much more accurately with its watercolor textures and soft edges. While Qwen Image Edit 2511 (Model B) preserved the likenesses of the subjects better, it lacked the specific artistic charm and nostalgic mood requested in the prompt.
Neutral Expression to Genuine Smile
Editing{
"action": "image_edit",
"reference": "uploaded neutral portrait",
"change": "Warm genuine Duchenne smile: lips curved up, slight natural teeth, soft eye crinkles, subtle cheek raise",
"details": "Realistic smiling skin (dimples if present, soft cheek shadows), slightly brighter eyes; keep exact eye shape/color/iris",
"preserve_exact": "Face identity/structure, eyes/nose/lips/eyebrows, hair, skin texture/pores/freckles, makeup, clothing, head pose, background, lighting, shadows, framing",
"no_changes": "No face shape change, no new features, no gaze shift, no hair/clothing/lighting/background edits",
"style": "Ultra-photorealistic 8K portrait, sharp face focus, natural soft lighting, realistic skin glow"
}
AI Judge Analysis
Grok Imagine Image
- + Excellent preservation of identity and background elements.
- + The smile is very natural and subtle, maintaining the lip shape well.
- + Highly realistic skin texture remains consistent with the source.
- − The smile is a bit reserved and lacks the requested 'eye crinkles' for a full Duchenne effect.
- − Teeth are very barely visible compared to the prompt requirements.
Qwen Image Edit 2511
- + Successfully produces a warmer, more expressive Duchenne smile with eye crinkles.
- + Excellent rendering of teeth and natural folding around the mouth.
- + Maintains very high source preservation across the rest of the image.
- − The eyes are shut slightly more than the original to achieve the crinkle, which slightly alters the eye shape.
- − Slightly more smoothing on the cheeks compared to the source.
Verdict: Both models performed exceptionally well at preserving the identity and composition of the original image. Qwen Image Edit 2511 is the winner because it more accurately captured the 'Duchenne' aspect of the smile, including the characteristic eye crinkles and visible teeth, whereas Grok Imagine Image produced a smile that was a bit too subtle.
Golden Hour Stroll
Editing“Add dynamic motion to this photo: make hair blow in the wind, add leaves flying, energetic and lively feel.”
AI Judge Analysis
Grok Imagine Image
- + Successfully added a large volume of falling leaves
- + Effectively modified the hair and dog's ears to show wind direction
- + High source preservation with minimal facial distortion
- − The wind effect on the hair is slightly less dramatic than the competitor
- − Added leaves appear slightly flat and repetitive in texture
Qwen Image Edit 2511
- + Excellent dynamic hair movement with a more realistic 'blown' appearance
- + Leaves show varied colors (green and orange) and motion blur for energy
- + Perfect preservation of the original subjects and background
- − Fewer leaves overall compared to Model A
- − A few digital artifacts where the hair strands meet the background
Verdict: Both models followed the instructions well, preserving the source image while adding motion. Qwen Image Edit 2511 is the winner due to the superior quality of the 'hair blowing' effect and the use of motion blur on the leaves, which creates a much more energetic feel than Grok Imagine Image's static-looking leaves.
Grok Imagine Image
An image generation model by xAI designed to generate highly aesthetic images from text descriptions.
Qwen Image Edit 2511
Alibaba's Qwen image editing model for instruction-based image modifications and transformations