Grok Imagine Image Pro vs Wan 2.7 Pro

Head-to-head across 4 challenges

Grok Imagine Image Pro

0.0%

win rate

Ties

50.0%

Wan 2.7 Pro

50.0%

win rate

0.0% 50.0% ties 50.0%

Challenge Results

Chalkboard Menu

Text-to-Image

“Handwritten-style chalkboard menu in a cozy café, all text rendered in the exact same realistic chalk handwriting style with natural variations in letter size, slight slant, and chalk texture — no printed or digital fonts anywhere on the board. Title at the top in elegant cursive chalk handwriting: ‘TODAY’S SPECIALS – APRIL 30, 2026’. Below it, three menu items also in the same handwritten chalk style: ‘Truffle Mushroom Risotto – $24’, ‘Grilled Octopus with Lemon & Herbs – $28’, ‘Brown Butter Chocolate Chip Cookies – $9’. At the very bottom, smaller text in the identical handwritten chalk style (slightly smaller but still clearly legible with the same handwriting characteristics): ‘All items made fresh daily • Ask about our gluten-free options’. Warm ambient café lighting, visible chalk dust and smudges, realistic handwriting imperfections, no clean printed text anywhere.”

Grok Imagine Image Pro

Wan 2.7 Pro

AI Judge Analysis

Grok Imagine Image Pro

+ Excellent chalk texture with realistic grit and smudge marks.
+ Highly accurate rendering of the text without repetitions.
+ Natural variation in handwriting that feels authentic to a person writing on a board.

− The 'Today's Specials' title is in a blocky style rather than the requested elegant cursive.

Wan 2.7 Pro

+ Beautiful background composition and lighting in a cafe setting.
+ Clean, legible text layout.
+ Successful execution of the cursive style for the title.

− Text rendering is too clean and lacks the realistic texture of chalk.
− Significant repetition error: 'with Lemon & Herbs -$28' is printed twice for the octopus item.
− The text looks more like a digital font than actual human handwriting.

Verdict: Grok Imagine Image Pro follows the spirit of the prompt much better by providing a realistic, textured chalk feel and accurate text without the repetition errors found in the competitor. While Wan 2.7 Pro has a more aesthetically pleasing background, its failure to handle the menu list correctly and its overly-clean 'digital' font look makes it less successful for this specific task.

Pose & Character Mashup

Editing

Edit instruction

“Use Image 1 as the exact pose reference and Image 2 as the character reference. Recreate the person/character from Image 2 in the exact dynamic pose and body position from Image 1. Keep the exact face, hair, clothing style/details, and expression from Image 2. Match the lighting and environment of Image 1. The final image must show the character from Image 2 performing the precise action/pose from Image 1 with perfect anatomy and natural integration.”

Source

Grok Imagine Image Pro

Wan 2.7 Pro

0% wins 50% ties 50% wins

AI Judge Analysis

Grok Imagine Image Pro

+ Excellent preservation of the pose from Image 1.
+ High visual clarity and resolution.

− Failed the character reference task completely by generating a different woman instead of the man from Image 2.
− Ignored all clothing details from Image 2, keeping the red hoodie from Image 1 instead.

Wan 2.7 Pro

+ Perfectly preserves the layout and pose of Image 1.
+ Maintains consistent lighting and background from the source.

− Failed to incorporate any elements from the character reference in Image 2.
− Only returned a slightly cropped version of Image 1 with no character or clothing changes.

Verdict: Both models failed the specific character transfer task, entirely ignoring the instruction to use the man from Image 2 as the character reference. Grok Imagine Image Pro replaced the woman in Image 1 with a different woman, while Wan 2.7 Pro essentially returned Image 1 with no significant changes to the character or clothing.

Outfit Transfer Challenge

Editing

Edit instruction

“Use Image 1 as the base person. Dress them in the exact elaborate outfit from Image 2 (including all layers, accessories, jewelry, and shoes). Carefully adapt the clothing to the body shape and pose in Image 1 while maintaining realistic fabric behavior, correct proportions, and perfect lighting/shadow matching. Keep the person’s exact face, hair, and background completely unchanged.”

Source

Grok Imagine Image Pro

Wan 2.7 Pro

AI Judge Analysis

Grok Imagine Image Pro

+ Excellent preservation of the subject's face, hair, and vitiligo patterns
+ High-quality rendering of the garment fabrics and embroidery
+ Maintains the exact background and wooden structure from the source

− Completely failed to use the clothing from Image 2, generating a generic royal outfit instead
− Hands are poorly rendered with unnatural fingering and artifacts

Wan 2.7 Pro

+ Preserved the subject's face, hair, and unique skin patterns well
+ Successfully adapted the pose while maintaining background consistency

− Failed to use the clothing from Image 2, instead generating a busy patterned suit
− The scale of the subject relative to the background changed slightly
− Distortion in the hands and shoe rendering

Verdict: Both models failed the specific instruction to use the 'exact' outfit from Image 2, which featured a navy peacoat, plaid scarf, and jeans. Grok Image Imagine Pro generated a regal purple and gold robe, while Wan 2.1 Pro generated a patterned brown suit; Grok is slightly better due to superior facial and skin texture preservation and better integration with the wooden post.

The Capybara Taxi Driver

Text-to-Image

“Photorealistic scene inside a yellow New York taxi at night. A capybara is driving, wearing a yellow taxi driver cap and a dark jacket. It has a calm, professional expression and both front paws on the steering wheel. In the back seat sits a human businesswoman in a coat, looking at her phone with a completely normal, bored expression (as if this is just another normal ride). Through the windows you can see the streets of Manhattan at night with blurred lights. Realistic taxi interior, photorealistic, detailed fur and fabric, 35mm lens, night lighting with reflections, shallow depth of field.”

Grok Imagine Image Pro

Wan 2.7 Pro

AI Judge Analysis

Grok Imagine Image Pro

+ Excellent composition with a wide-angle view showing both the driver and passenger clearly.
+ Superior lighting and photorealism in the Manhattan background.
+ Very high level of detail on the capybara's fur and the taxi dashboard.

− The capybara's hands look slightly more like monkey hands than capybara paws.
− The passenger's expression is more focused/sad than 'bored'.

Wan 2.7 Pro

+ Good adherence to the 'taxi driver cap' style specifically.
+ Captured the bored/disinterested expression of the passenger perfectly.

− Perspective error: the passenger appears to be in the front seat next to the driver rather than the back seat.
− The capybara's head looks pasted onto the body with a visible seam around the neck.
− The hands/paws on the steering wheel are anatomically messy and unnatural.

Verdict: Grok Imagine Image Pro is the clear winner as it correctly places the passenger in the back seat and maintains a high level of photorealistic detail throughout the scene. Wan 2.7 Pro fails the spatial arrangement by putting the passenger in the front seat and has significant issues with the blending of the capybara's head to its body.

Grok Imagine Image Pro

xAI's premium image generation model offering higher fidelity output and stronger performance on single-image editing benchmarks compared to the standard Grok Imagine model

View Model Arena

Wan 2.7 Pro

Alibaba's Wan 2.7 Pro image generation and editing model with higher-quality outputs and support for 4K image generation

View Model Arena