GPT Image 1 Mini vs Wan 2.7 Pro

Head-to-head across 4 challenges

GPT Image 1 Mini

100.0%

win rate

Ties

0.0%

Wan 2.7 Pro

0.0%

win rate

100.0% 0.0% ties 0.0%

Challenge Results

Chalkboard Menu

Text-to-Image

“Handwritten-style chalkboard menu in a cozy café, all text rendered in the exact same realistic chalk handwriting style with natural variations in letter size, slight slant, and chalk texture — no printed or digital fonts anywhere on the board. Title at the top in elegant cursive chalk handwriting: ‘TODAY’S SPECIALS – APRIL 30, 2026’. Below it, three menu items also in the same handwritten chalk style: ‘Truffle Mushroom Risotto – $24’, ‘Grilled Octopus with Lemon & Herbs – $28’, ‘Brown Butter Chocolate Chip Cookies – $9’. At the very bottom, smaller text in the identical handwritten chalk style (slightly smaller but still clearly legible with the same handwriting characteristics): ‘All items made fresh daily • Ask about our gluten-free options’. Warm ambient café lighting, visible chalk dust and smudges, realistic handwriting imperfections, no clean printed text anywhere.”

GPT Image 1 Mini

Wan 2.7 Pro

100% wins 0% ties 0% wins

AI Judge Analysis

GPT Image 1 Mini

+ Excellent chalk texture throughout the lettering
+ Very realistic and natural handwriting variations
+ Perfect text accuracy and spelling

− The title is in print-style block letters rather than the requested elegant cursive
− The background context of a 'cozy café' is barely visible beyond the board frame

Wan 2.7 Pro

+ Beautiful 'cozy café' background with nice lighting and depth
+ Highly consistent font/handwriting style
+ Good composition with decorative flourishes

− Repeated the phrase '& Herbs - $28' twice on the second item
− The text looks more like a digital font overlay than real chalk on a board
− Failed to render the title in cursive as requested

Verdict: GPT Image 1 Mini wins on realism and technical accuracy, providing a convincing chalk texture and perfect spelling. While Wan 2.7 Pro creates a much more appealing 'café' environment, its text contains a repetitive error and lacks the authentic grainy texture of real chalk specified in the prompt.

Pose & Character Mashup

Editing

Edit instruction

“Use Image 1 as the exact pose reference and Image 2 as the character reference. Recreate the person/character from Image 2 in the exact dynamic pose and body position from Image 1. Keep the exact face, hair, clothing style/details, and expression from Image 2. Match the lighting and environment of Image 1. The final image must show the character from Image 2 performing the precise action/pose from Image 1 with perfect anatomy and natural integration.”

Source

GPT Image 1 Mini

Wan 2.7 Pro

AI Judge Analysis

GPT Image 1 Mini

+ Successfully transferred the person, sunglasses, scarf, and clothing from Image 2.
+ Matches the yellow background and red stool environment accurately.
+ Good facial recognition and resemblance to the character reference.

− Failed to match the 'exact' complex pose, simplifying the leg cross into a basic step.
− The left foot (on the stool) is anatomically distorted with too many toes.
− Changed the character's expression slightly to a smirk instead of the neutral look requested.

Wan 2.7 Pro

+ Perfectly preserved the original background, pose, and composition from Image 1.

− Completely failed to perform the edit, ignoring the character reference (Image 2) entirely.
− Just returned a cropped/modified version of the source Image 1.

Verdict: GPT Image 1 Mini followed the complex instructions by combining the character's appearance from Image 2 with the environment of Image 1, though it struggled with the precise anatomy and the exact complexity of the dancer's pose. Wan 2.7 Pro failed the task entirely, simply returning the source image without incorporating the character reference.

Outfit Transfer Challenge

Editing

Edit instruction

“Use Image 1 as the base person. Dress them in the exact elaborate outfit from Image 2 (including all layers, accessories, jewelry, and shoes). Carefully adapt the clothing to the body shape and pose in Image 1 while maintaining realistic fabric behavior, correct proportions, and perfect lighting/shadow matching. Keep the person’s exact face, hair, and background completely unchanged.”

Source

GPT Image 1 Mini

Wan 2.7 Pro

AI Judge Analysis

GPT Image 1 Mini

+ Excellent adherence to the specific outfit in Image 2, including the exact scarf pattern and coat style.
+ Higher image quality and resolution with realistic fabric textures.
+ Very accurate lighting and shadow integration that matches the beach environment.

− Significantly changed the person's facial structure and hair, making them look older and different from the source image.
− Lost the 'Inersy' branding details on the waistband and the specific vitiligo patterns were altered.

Wan 2.7 Pro

+ Perfectly preserved the person's exact face, hair, and original vitiligo patterns on the skin.
+ Maintained the full composition of the source image including the wooden structure and background 1:1.

− Completely failed to use the outfit from Image 2, instead generating a random patterned robe/suit.
− The hands have minor anatomical distortions and the clothing fit looks slightly unnatural at the waist.

Verdict: This is a trade-off between outfit accuracy and identity preservation. GPT Image 1 Mini correctly identified and adapted the complex outfit from Image 2 but failed to keep the base person's face and hair unchanged. Wan 2.7 Pro perfectly preserved the person and the environment but completely ignored the instruction to use the specific outfit from Image 2, making GPT Image 1 Mini a better choice for an edit task.

The Capybara Taxi Driver

Text-to-Image

“Photorealistic scene inside a yellow New York taxi at night. A capybara is driving, wearing a yellow taxi driver cap and a dark jacket. It has a calm, professional expression and both front paws on the steering wheel. In the back seat sits a human businesswoman in a coat, looking at her phone with a completely normal, bored expression (as if this is just another normal ride). Through the windows you can see the streets of Manhattan at night with blurred lights. Realistic taxi interior, photorealistic, detailed fur and fabric, 35mm lens, night lighting with reflections, shallow depth of field.”

GPT Image 1 Mini

Wan 2.7 Pro

AI Judge Analysis

GPT Image 1 Mini

+ Excellent atmospheric lighting and photorealism
+ Very accurate rendering of the capybara's fur and expression
+ Stronger focus on the requested background element of the woman looking at her phone

− One of the capybara's paws is missing/not visible on the wheel as requested
− The woman's hand holding the phone looks slightly distorted

Wan 2.7 Pro

+ Better adherence to the 'both front paws on the steering wheel' instruction
+ Clearer depiction of the New York City street through the window
+ Shows more of the taxi's exterior and interior for context

− The woman in the back is not looking at a phone as requested
− The scale of the capybara and the interior layout feels slightly unnatural

Verdict: GPT Image 1 Mini creates a more moody and cinematically realistic image that captures the specific 'bored' interaction with the phone, though it fails to place both paws on the wheel. Wan 2.7 Pro succeeds in the physical positioning of the capybara but misses the key prompt detail of the passenger looking at her phone. GPT Image 1 Mini is the narrow winner due to its superior lighting, texture, and emotional adherence to the prompt's tone.

GPT Image 1 Mini

OpenAI's cost-effective image generation model for when image quality isn't the top priority

View Model Arena

Wan 2.7 Pro

Alibaba's Wan 2.7 Pro image generation and editing model with higher-quality outputs and support for 4K image generation

View Model Arena