Grok Imagine Image vs Wan 2.7 Pro

Head-to-head across 5 challenges

Grok Imagine Image

50.0%

win rate

Ties

50.0%

Wan 2.7 Pro

0.0%

win rate

50.0% 50.0% ties 0.0%

Challenge Results

Magic Burger Explosion: Fiery Photorealism Challenge

Text-to-Image

“Ad for 'Magic Burger'. Dynamic, exploded burger with all components (bun, patty, cheese, lettuce, tomato, sauce) suspended in mid-air. Emphasize photorealistic detail and a sense of motion. Dark, fiery background with glowing embers. Integrate text: 'MAGIC BURGER' as a prominent title, 'LIMITED TIME ONLY' as a secondary message, and '€6.99' in a starburst, all rendered with a fiery, glowing effect.”

Grok Imagine Image
Wan 2.7 Pro
100% wins 0% ties 0% wins

AI Judge Analysis

Grok Imagine Image

  • + Excellent typography rendering with perfect spelling of all three requested text elements.
  • + High-quality textures on the patty and vegetables enhance the 'photorealistic' feel.
  • + Dynamic composition with splashes of sauce and flying embers creates a strong sense of motion.
  • The pricing starburst looks a bit like a flat clip-art element compared to the 3D burger components.

Wan 2.7 Pro

  • + Strong 'fiery' glow effect on the main title text.
  • + Clean ingredient separation with good lighting consistency.
  • + Effective use of negative space to make the burger components pop.
  • Completely failed to include the secondary 'LIMITED TIME ONLY' text and '€6.99' price starburst.
  • The bottom bun looks slightly flat and lacks the same texture detail as the top bun.
  • The background effect is less integrated with the foreground elements compared to Model A.

Verdict: Grok Imagine is the clear winner as it successfully followed all instructions, including the rendering of multiple specific text fields and the price starburst. Wan 2.7 Pro produced an aesthetically pleasing image but omitted more than half of the text requirements, making it fail as a complete advertisement.

Chalkboard Menu

Text-to-Image

“Handwritten-style chalkboard menu in a cozy café, all text rendered in the exact same realistic chalk handwriting style with natural variations in letter size, slight slant, and chalk texture — no printed or digital fonts anywhere on the board. Title at the top in elegant cursive chalk handwriting: ‘TODAY’S SPECIALS – APRIL 30, 2026’. Below it, three menu items also in the same handwritten chalk style: ‘Truffle Mushroom Risotto – $24’, ‘Grilled Octopus with Lemon & Herbs – $28’, ‘Brown Butter Chocolate Chip Cookies – $9’. At the very bottom, smaller text in the identical handwritten chalk style (slightly smaller but still clearly legible with the same handwriting characteristics): ‘All items made fresh daily • Ask about our gluten-free options’. Warm ambient café lighting, visible chalk dust and smudges, realistic handwriting imperfections, no clean printed text anywhere.”

Grok Imagine Image
Wan 2.7 Pro

AI Judge Analysis

Grok Imagine Image

  • + Excellent chalk texture with realistic smudges and strokes
  • + Accurate spelling and completion of the truncated prompt text
  • + Highly realistic natural variations in handwriting styles
  • The date text at the top is not in cursive as requested

Wan 2.7 Pro

  • + Clean layout with decorative elements inside a wooden frame
  • + Good spatial arrangement of the text
  • Text looks like a digital font rather than natural chalk handwriting
  • Significant repetition error on the second menu item
  • Failed to use cursive for the title

Verdict: Grok Imagine followed the prompt instructions much better, providing realistic chalk textures and finishing the cut-off text for the final menu item flawlessly. Wan 2.7 Pro struggled with text generation, producing a duplicate line for the second item and utilizing a style that looked like a digital font rather than the requested handwritten chalk.

Pose & Character Mashup

Editing
Edit instruction

“Use Image 1 as the exact pose reference and Image 2 as the character reference. Recreate the person/character from Image 2 in the exact dynamic pose and body position from Image 1. Keep the exact face, hair, clothing style/details, and expression from Image 2. Match the lighting and environment of Image 1. The final image must show the character from Image 2 performing the precise action/pose from Image 1 with perfect anatomy and natural integration.”

Source
Grok Imagine Image
Wan 2.7 Pro
0% wins 100% ties 0% wins

AI Judge Analysis

Grok Imagine Image

  • + Excellent preservation of the source background and lighting.
  • Completely failed to perform the edit, returning the original Image 1.
  • Zero adherence to the character reference from Image 2.

Wan 2.7 Pro

  • + Successfully preserved the original Image 1 without degradation.
  • Totally ignored the request to swap characters.
  • Failed to incorporate any elements from the character reference (sunglasses, scarf, male face).

Verdict: Both Grok Imagine and Wan 2.7 Pro completely failed this image editing challenge. They both returned a copy of the source pose image (Image 1) without attempting to integrate the character, facial features, or clothing from Image 2.

Outfit Transfer Challenge

Editing
Edit instruction

“Use Image 1 as the base person. Dress them in the exact elaborate outfit from Image 2 (including all layers, accessories, jewelry, and shoes). Carefully adapt the clothing to the body shape and pose in Image 1 while maintaining realistic fabric behavior, correct proportions, and perfect lighting/shadow matching. Keep the person’s exact face, hair, and background completely unchanged.”

Source
Grok Imagine Image
Wan 2.7 Pro

AI Judge Analysis

Grok Imagine Image

  • + Successfully preserved the subject's face and unique skin patterns.
  • + Excellent lighting and shadow integration on the new garment.
  • + High-quality rendering of fabric texture and ornate jewelry.
  • Completely failed to use the clothing from Image 2, substituting an unrelated royal tunic.
  • The background wood structure's lower sections were altered/removed.

Wan 2.7 Pro

  • + Successfully preserved the subject's face and skin patterns.
  • + Accurately captured the full-body pose of the person in the original environment.
  • + Provided a complete head-to-toe outfit including trousers and shoes.
  • Failed to use the clothing from Image 2, providing a patterned blazer instead of the coat and scarf.
  • Substantially changed the wooden lifeguard tower structure in the background.

Verdict: Both models failed significantly on the primary instruction to use the 'exact elaborate outfit from Image 2' (a navy coat and plaid scarf), instead hallucinating different 'elaborate' outfits. Grok Imagine (Image A) provided much better lighting and blending of the garment, while Wan 2.7 Pro (Image B) captured the full body and pose more effectively, despite both models altering the background structure.

The Capybara Taxi Driver

Text-to-Image

“Photorealistic scene inside a yellow New York taxi at night. A capybara is driving, wearing a yellow taxi driver cap and a dark jacket. It has a calm, professional expression and both front paws on the steering wheel. In the back seat sits a human businesswoman in a coat, looking at her phone with a completely normal, bored expression (as if this is just another normal ride). Through the windows you can see the streets of Manhattan at night with blurred lights. Realistic taxi interior, photorealistic, detailed fur and fabric, 35mm lens, night lighting with reflections, shallow depth of field.”

Grok Imagine Image
Wan 2.7 Pro

AI Judge Analysis

Grok Imagine Image

  • + Excellent photorealism in the skin and fur textures
  • + Accurately places the passenger in the back seat as requested
  • + Strong adherence to the 'bored expression' and phone usage details
  • The capybaras paws look more like bird talons or sharp claws than actual capybara feet
  • The passenger is sitting in the front passenger seat instead of the back seat

Wan 2.7 Pro

  • + Naturalistic side profile composition that captures the vehicle interior well
  • + Higher quality rendering of the capybara's paws on the steering wheel
  • + Very professional-looking chauffeur cap design
  • The passenger is sitting in the front passenger seat, failing the 'back seat' instruction
  • The passenger is not looking at a phone as requested in the prompt
  • The capybara's head has a slightly strange anatomical merger with its neck/jacket

Verdict: Grok Imagine Image followed the prompt much more closely, successfully including the businesswoman looking at her phone even though she was placed in the front seat. Wan 2.7 Pro delivered a high-quality cinematic angle but failed to include the phone and presented the passenger with a gaze that did not match the instruction. Grok is the winner for superior prompt adherence and realistic texture work.

Grok Imagine Image

An image generation model by xAI designed to generate highly aesthetic images from text descriptions.

Wan 2.7 Pro

Alibaba's Wan 2.7 Pro image generation and editing model with higher-quality outputs and support for 4K image generation