Gemini 3 Pro with image generation capabilities. Combines advanced reasoning with the ability to generate and edit images.
Settled by community votes across 19 shared challenges, with an AI judge weighing in on each.
Nano Banana Pro
#3 of 48 in Text-to-Image
Wan 2.6
#24 of 48 in Text-to-Image
Where the votes landed
Nano Banana Pro
58.8%
win rate
Ties
5.9%
Wan 2.6
35.3%
win rate
Challenge by challenge
The strongest take from each model on every shared challenge, with the AI judge's read.
Geometric Composition
Text-to-Image“A glass cube on a wooden table. Inside the cube is a small blue sphere. On top of the cube sits a red book. A green plant is behind the cube, partially visible through the glass. Soft window light from the left.”
AI Judge Analysis
Nano Banana Pro
- + Excellent photorealism with a film-like quality and authentic lighting shadow play.
- + Perfect adherence to spatial instructions with the plant clearly visible through the glass panels.
- + Highly detailed wood texture and convincing glass refraction.
- − The glass cube looks more like a small aquarium or पाँच-sided box rather than a solid-walled geometric cube.
Wan 2.6
- + Strong colors and sharp contrast.
- + Good interpretation of a solid glass cube with thick walls.
- − The sphere is disproportionately large compared to the prompt for a 'small' sphere.
- − The plant is positioned next to the cube rather than behind it, missing the 'visible through the glass' instruction.
- − The scale of the items makes the wooden table look like a miniature or macro shot.
Verdict: Nano Banana Pro followed all spatial instructions perfectly, including the specific requirement to see the plant through the glass. While Wan 2.6 has vibrant colors, it failed to place the plant correctly and ignored the size descriptor for the small sphere. Nano Banana Pro also features superior lighting and texture detail.
Man and Car in California
Editing“Make a photo of the man driving the car down the California coastline”
AI Judge Analysis
Nano Banana Pro
- + Perfectly preserves the specific plaid coat and thick black scarf from the source.
- + High identity retention for the man, including his unique hairstyle.
- + Excellent lighting integration with the golden hour coastal scene.
- − The steering wheel placement looks slightly awkward relative to his hands.
Wan 2.6
- + Strong dynamic composition with a wider view of the coastline.
- + Good preservation of the car's exterior details.
- + Successful interpretation of the California coastal setting.
- − Significant loss of clothing detail; the scarf is smaller and the plaid pattern is different.
- − The man's facial features and hair texture differ more from the source than in Model A.
Verdict: Gemini 3 Pro Image Preview is the clear winner due to its exceptional preservation of source details, accurately maintaining the man's specific plaid coat, heavy knitted scarf, and facial features. While Wan 2.6 captures the spirit of the prompt well, it loses too much identity and clothing detail compared to Gemini 3 Pro Image Preview.
Candid Street Photography
Text-to-Image“A candid street photo of an elderly Japanese man repairing a red bicycle in light rain, reflections on wet pavement, shallow depth of field, 50mm lens, natural skin texture, imperfect framing, motion blur from passing cars, cinematic but realistic, no stylization.”
AI Judge Analysis
Nano Banana Pro
- + Excellent photographic composition that feels like a genuine candid street photo.
- + Highly realistic skin textures and clothing details without over-sharpening.
- + The background traffic and rain intensity look very natural and accurately reflect the requested 50mm look.
- − The red bicycle is slightly cut off at the bottom and right, though this could be argued as 'imperfect framing'.
Wan 2.6
- + Strong inclusion of rain droplets on the subject's jacket which adds to the atmosphere.
- + Good execution of the shallow depth of field with cinematic lighting.
- − The hands and the tool being used show significant anatomical and structural glitches.
- − The rain drops on the jacket look like frozen glass beads rather than liquid, appearing slightly unnatural.
- − The bicycle's front wheel is missing its tire/rim structure in a way that defies physics.
Verdict: Nano Banana Pro produces a much more coherent and realistic image that genuinely looks like a 50mm street photograph. While Wan 2.6 captures the lighting well, it suffers from significant AI artifacts in the man's hands and the bicycle's anatomy, whereas Nano Banana Pro maintains high physical integrity and natural textures.
Fantasy Warrior
Text-to-Image“Close portrait of a battle-worn paladin in ornate engraved plate armor, hair braided with small beads, faint scars and dirt on the skin, warm torchlight reflecting off metal, shallow depth of field, bokeh sparks, lifelike eyes, highly detailed texture on leather straps and cloth underlayer.”
AI Judge Analysis
Nano Banana Pro
- + Excellent depiction of ornate engraved plate armor with realistic scratches
- + High-fidelity hair texture with intricate braids and beads
- + Authentic lighting with warm highlights and bokeh sparks
- − The scars appear more like surface dirt than healed or open tissue
Wan 2.6
- + Very realistic dirt and grimy skin texture on the face
- + Great rendering of torn fabric layers and rough leather stitching
- + More emotive, lifelike eye rendering and expression
- − The beads in the hair look a bit like modern glass ornaments
- − The armor engraving is slightly less crisp than Model A
Verdict: Both models performed exceptionally well, but Nano Banana Pro captures the 'ornate engraved plate armor' and braided hair beads with superior clarity and composition. Wan 2.6 provides a slightly more gritty and realistic skin texture, but Nano Banana Pro feels more like a cohesive, high-budget cinematic portrait.
Modern Clean Menu
Text-to-Image“Modern minimalist restaurant menu design, white background with colorful food photos in grid, sections for appetizers/pizza/mains, bold sans-serif fonts, vibrant accents, clean professional layout for casual dining.”
AI Judge Analysis
Nano Banana Pro
- + Perfect adherence to the requested category sections (Appetizers, Pizza, Mains).
- + Excellent visual hierarchy and clean, professional typography.
- + High-quality, distinct food photos that match the grid layout perfectly.
- + Realistic presentation on a wooden background context.
- − Nonsense filler text in descriptions (though common in AI).
- − One repeated dish name ('Bruschetta' and 'Margherita Pizza' are used for all items in their sections).
Wan 2.6
- + Modern use of vibrant color accents and geometric blocks.
- + Includes a clear 'Restaur Menu' header.
- + Clean minimalist aesthetic with good white space.
- − Failed to include 'Mains' as a distinct section from 'Pizza' (labeled Pizza headings under Appetizers layout).
- − Food photos are repetitive (mostly pizza) and do not represent the variety requested.
- − Graphic elements like prices are messy ($0,09 or $36£2,280) and text is less legible than Model A.
Verdict: Gemini 3 Pro Image Preview is the clear winner as it accurately followed the instructions for specific menu sections (Appetizers, Pizza, Mains) and provided a diverse grid of high-quality food photos. Wan 2.6 struggled with the categorization, filling most of the layout with pizzas despite the prompt for variety, and the text rendering was significantly more distorted.
Magic Burger Explosion: Fiery Photorealism Challenge
Text-to-Image“Ad for 'Magic Burger'. Dynamic, exploded burger with all components (bun, patty, cheese, lettuce, tomato, sauce) suspended in mid-air. Emphasize photorealistic detail and a sense of motion. Dark, fiery background with glowing embers. Integrate text: 'MAGIC BURGER' as a prominent title, 'LIMITED TIME ONLY' as a secondary message, and '€6.99' in a starburst, all rendered with a fiery, glowing effect.”
AI Judge Analysis
Nano Banana Pro
- + Excellent typography with clean, glowing neon-style text
- + Superior rendering of food textures including the char on the patty and juice in the tomato
- + Clean, professional composition that feels like a real commercial advertisement
- − The 'exploded' effect is slightly less dynamic than the other model as the components are more stacked than scattered
Wan 2.6
- + Highly dynamic sense of motion with sauce drips and floating toppings
- + Strong integration of the fiery background with realistic smoke and flame effects
- + The 'Magic Burger' title has an impressive 3D burned-texture effect
- − The pricing starburst looks a bit like a flat sticker, clashing with the 3D scene
- − Minor anatomical issues with how the sauce connects to the top bun
Verdict: Nano Banana Pro produces a cleaner, more legible advertisement with superior text rendering and food photography quality. While Wan 2.6 offers more creative energy and a more 'exploded' composition, Nano Banana Pro is preferred for its professional layout and consistent high-quality details.
Chalkboard Menu
Text-to-Image“Handwritten-style chalkboard menu in a cozy café, all text rendered in the exact same realistic chalk handwriting style with natural variations in letter size, slight slant, and chalk texture — no printed or digital fonts anywhere on the board. Title at the top in elegant cursive chalk handwriting: ‘TODAY’S SPECIALS – APRIL 30, 2026’. Below it, three menu items also in the same handwritten chalk style: ‘Truffle Mushroom Risotto – $24’, ‘Grilled Octopus with Lemon & Herbs – $28’, ‘Brown Butter Chocolate Chip Cookies – $9’. At the very bottom, smaller text in the identical handwritten chalk style (slightly smaller but still clearly legible with the same handwriting characteristics): ‘All items made fresh daily • Ask about our gluten-free options’. Warm ambient café lighting, visible chalk dust and smudges, realistic handwriting imperfections, no clean printed text anywhere.”
AI Judge Analysis
Nano Banana Pro
- + Excellent text legibility and accuracy
- + Highly realistic chalk texture and board scuff marks
- + Great atmospheric lighting and background café scene
- − Handwriting looks slightly too uniform, bordering on a digital font style
Wan 2.6
- + Perfect chalk texture with realistic thick and thin strokes
- + Authentic handwritten variation in sizing and slant
- + Includes realistic chalk dust at the base of the board
- − Unnecessary repetition of prices on separate lines
- − Slightly less clarity in the 'TODAY's' cursive rendering compared to Model A
Verdict: Nano Banana Pro produces a very clean and professionally framed image with perfect spelling, but the text feels a bit like a digital 'chalk' font. Wan 2.6 captures the messy, authentic soul of a hand-drawn chalkboard much better, including realistic dust and variable pressure in the strokes, despite some repetitive price formatting.
The Reversed Rodeo
Text-to-Image“Horse riding astronaut in space — horse on top, not vice versa. Surreal, highly detailed, cinematic.”
AI Judge Analysis
Nano Banana Pro
- + Perfectly follows the specific instruction of 'horse on top'
- + Creative interpretation featuring a majestic cape
- + Strong sense of surrealism and cinematic lighting
- − The astronaut's pose looks more like he is carrying the horse than being ridden by it
Wan 2.6
- + High visual quality with dynamic lighting and textures
- + Excellent detail on the spacesuit and horse tack
- + Beautifully rendered nebula and celestial background
- − Completely fails the negative constraint to put the horse on top
- − Common AI trope of an astronaut on a horse
Verdict: Nano Banana Pro is the winner because it successfully navigated the prompt's difficult logic constraint of having the horse riding the astronaut. While Wan 2.6 produced a visually stunning and polished image, it fell back on a standard cliché and ignored the specific 'horse on top' instruction.
Outfit Transfer Challenge
Editing“Use Image 1 as the base person. Dress them in the exact elaborate outfit from Image 2 (including all layers, accessories, jewelry, and shoes). Carefully adapt the clothing to the body shape and pose in Image 1 while maintaining realistic fabric behavior, correct proportions, and perfect lighting/shadow matching. Keep the person’s exact face, hair, and background completely unchanged.”
AI Judge Analysis
Nano Banana Pro
- + Successfully preserved the exact face, hair, and vitiligo patterns without distortion
- + Transferred almost every element of the outfit including the peacoat, plaid scarf, gold watch, and jeans
- + Excellent lighting integration and shadow matching on the clothing
- − Missed the sunglasses from Image 2
Wan 2.6
- + Included the sunglasses from the reference image
- + Good preservation of the background and wooden structure
- − Failed to include major outfit components like the gold watch and bottom half of the clothing
- − Altered the facial structure slightly, making the nose and jawline less accurate to Image 1
- − The scarf pattern and color are less accurate to Image 2 than Model A
Verdict: Nano Banana Pro is the clear winner as it successfully transferred nearly the entire complex outfit while perfectly preserving the identity, hair, and skin markings of the original subject. Wan 2.6 struggled with the full outfit transfer, omitting the jewelry and bottom half of the attire, and slightly altered the subject's facial features in the process.
The Capybara Taxi Driver
Text-to-Image“Photorealistic scene inside a yellow New York taxi at night. A capybara is driving, wearing a yellow taxi driver cap and a dark jacket. It has a calm, professional expression and both front paws on the steering wheel. In the back seat sits a human businesswoman in a coat, looking at her phone with a completely normal, bored expression (as if this is just another normal ride). Through the windows you can see the streets of Manhattan at night with blurred lights. Realistic taxi interior, photorealistic, detailed fur and fabric, 35mm lens, night lighting with reflections, shallow depth of field.”
AI Judge Analysis
Nano Banana Pro
- + Excellent photorealism in various textures like the capybara fur and raindrops.
- + Great use of depth from the dashboard perspective including the taxi meter.
- + Captures the bored, nonplussed expression of the passenger perfectly.
- − The capybara's 'hands' are a bit mangled and fuse into the steering wheel.
- − The composition feels slightly cramped by including the exterior car roof at the top.
Wan 2.6
- + Features a more authentic-looking chauffeur/taxi cap with an emblem.
- + Clearer side-profile view that highlights the taxi's exterior details alongside the interior.
- + The capybara's posture and jacket fit look more natural and professional.
- − The passenger's face has a slight digital 'gloss' that appears less realistic than Model A.
- − The background bokeh and light streaks are a bit generic compared to the high-detail street view in Model A.
Verdict: Nano Banana Pro excels in photorealistic textures and atmosphere, creating a gritty and believable New York night scene. However, Wan 2.6 provides a better character design for the capybara, with a cleaner driver's uniform and a more readable side-profile composition. Nano Banana Pro is the likely winner due to its superior realism and more accurate depiction of the passenger's expression as requested.
The Halloween Invitation
Text-to-Image“Vintage gothic Halloween party invitation. Dark parchment poster, spooky border with webs and thorns, central glowing jack-o-lantern, bats, twisted trees, moody night sky. Add elegant gothic title text saying "Halloween Party Invitation", a small scroll banner saying "You are invited to a night of frights", and event details at the bottom: Date: 30.10.2026 Time: 7pm Location: The Arches, NYC Spooky but polished, cinematic lighting, square format.”
AI judge analysis unavailable for this challenge.
Bald man challenge
Image Editing“Give the person a full, thick head of natural hair with realistic texture, density, and a natural hairline. Preserve facial features and lighting.”
AI Judge Analysis
Nano Banana Pro
- + Perfect preservation of original facial features and glasses
- + Extremely realistic hair texture and natural integration with the sideburns
- + Maintains the exact lighting and background from the original image
- − None notable
Wan 2.6
- + Successfully adds a thick head of hair
- + Maintains original background and clothing
- − Significantly alters facial features, especially the nose and eyes
- − The hair looks slightly like a wig and doesn't blend as naturally with the sideburns as Model A
Verdict: Gemini 3 Pro Image Preview performed a flawless edit, seamlessly adding realistic hair while keeping every other detail of the source image identical. Wan 2.6 struggled with source preservation, noticeably changing the subject's face and eyes during the hair addition process.
Isometric Miniature Diorama Scenes
Text-to-Image“Create a clear, 45° top-down isometric miniature 3D cartoon scene of Japan's signature dish: sushi, with soft refined textures, realistic PBR materials, gentle lighting, on a small raised diorama base with minimal garnish and plate. Solid light blue background. At top-center: 'JAPAN' in large bold text, 'SUSHI' below it, small flag icon. Perfectly centered, ultra-clean, high-clarity, square format.”
AI Judge Analysis
Nano Banana Pro
- + Excellent rendering of materials, especially the wood grain and ceramic textures.
- + Higher level of detail and variety in the sushi types and garnishes.
- + Very clean and legible typography that follows the requested layout perfectly.
- − The flag icon is slightly off-center compared to the text arrangement.
- − The background is slightly more saturated than the 'light blue' requested.
Wan 2.6
- + Achieves a very distinct 'miniature diorama' look with the double-layered base.
- + Bold and clear typography with a large, accurate flag icon.
- + Soft, clean lighting that matches the '3D cartoon' aesthetic well.
- − The rice texture looks a bit like large white pebbles rather than refined sushi rice.
- − The composition feels slightly bottom-heavy with a lot of empty space in the diorama base.
- − The transition between the sushi and the wooden board is a bit flat.
Verdict: Gemini 3 Pro Image Preview is the winner due to its superior material rendering and overall detail; the wood, ceramics, and sushi textures feel more 'refined' and high-quality. While Wan 2.6 captures the diorama base concept well and has great typography, its actual sushi models are much simpler and the rice texture is less realistic.
Over-the-top cartoon caricature
Editing“Create a caricature of me and my job. Make it exaggerated and humorous, incorporating my profession as a tv show anchor and my love for dogs and hockey.”
AI Judge Analysis
Nano Banana Pro
- + Excellent preservation of the subject's facial features in caricature form.
- + Extremely creative and dense composition with many humorous details.
- + Perfectly integrates all requested elements: anchor desk, multiple dogs, and hockey jerseys/gear.
- − The character's hair is rendered as somewhat messy/disordered compared to the source.
- − High visual density might feel a bit cluttered to some viewers.
Wan 2.6
- + Clean, vibrant cartoon art style.
- + Successfully includes a news studio, dogs, and hockey equipment.
- + Good use of vertical space with a full-body character.
- − The caricature's face bears very little resemblance to the woman in the source image.
- − The hockey stick is oddly thin and elongated, resembling a cane or golf club more than a hockey stick.
- − The anchor role is interpreted more as a field reporter due to the handheld mic.
Verdict: Gemini 3 Pro Image Preview is the clear winner because it successfully creates a caricature that actually looks like the person in the source photo while cleverly combining every requested element into a cohesive 'Dog & Hockey News' set. Wan 2.6 provides a generic cartoon character that lacks the likeness required for an effective caricature and has some odd scaling with the hockey stick.
Adorable Baby Animals in Sunny Meadow
Text-to-Image“Hyper-photorealistic scene of fluffy baby animals—a golden retriever puppy, tabby kitten, baby bunny, and red fox kit—with big expressive eyes and ultra-detailed soft fur, playfully chasing butterflies and tumbling together in a lush wildflower meadow, warm golden sunrise light with god rays and dew sparkles, joyful wholesome vibe, 8K masterpiece.”
AI Judge Analysis
Nano Banana Pro
- + Excellent character rendering with very clean, distinct features for each animal.
- + Rich, vibrant colors and sharp focus on the subjects.
- + Strong adherence to the 'god rays' and 'wildflower' elements of the prompt.
- − The lighting feels slightly artificial or like an illustration rather than a photograph.
- − The butterfly scale is a bit large relative to the animals.
Wan 2.6
- + High degree of photorealism with natural-looking fur textures and atmospheric lighting.
- + The 'tumbling together' action feels more dynamic and integrated than Model A.
- + Beautiful lens flare and dew droplet effects create a soft, cinematic feel.
- − The fox's front paw looks anatomically distorted where it meets the ground.
- − Some floating seed/dust particles appear a bit messy or cluttered across the frame.
Verdict: Nano Banana Pro produces a very clean and joyful image that leans toward a high-end digital illustration style, with excellent clarity on all four animals. Wan 2.6 achieves a much higher level of photorealism and better captures the requested 'tumbling' interaction and natural sunrise atmosphere, making it the more representative output for a 'hyper-photorealistic' request.
Studio Ghibli Anime Style
Editing“Transform this photo into a Studio Ghibli–inspired illustration. Use soft pastel colors, hand-painted textures, gentle lighting, dreamy backgrounds, and a warm, nostalgic mood”
AI Judge Analysis
Nano Banana Pro
- + Perfectly captures the Studio Ghibli 'line and wash' art style.
- + Excellent preservation of poses, clothing patterns, and expressions from the source image.
- + Creative addition of a yellow tram and European-style architecture which fits the Ghibli aesthetic.
- − The character in the red dress has a slightly more defined 'anime' face compared to the more grounded Ghibli style.
Wan 2.6
- + Captures a beautiful watercolor/fresco texture that feels hand-painted.
- + Maintains high fidelity to the original source image elements.
- + Uses soft, dreamy lighting with light particles to enhance the mood.
- − The facial features are a bit generic and lean towards modern manga rather than specific Ghibli character designs.
- − The background is washed out and lacks the detailed environmental charm typical of Ghibli films.
Verdict: Both models did an excellent job of translating the 'Distracted Boyfriend' meme into an illustration. Gemini 3 Pro Image Preview is the winner because its environmental design—including the tram and detailed buildings—much more accurately reflects the world-building and artistic DNA of Studio Ghibli. While Wan 2.6 has lovely watercolor textures, its background is sparsely detailed compared to the rich, curated world seen in Gemini's output.
Golden Hour Stroll
Image Editing“Add dynamic motion to this photo: make hair blow in the wind, add leaves flying, energetic and lively feel.”
AI Judge Analysis
Nano Banana Pro
- + Excellent depiction of blowing hair with realistic lift and strands.
- + Dynamic variety of colorful autumn leaves that feel integrated into the scene.
- + Effectively added whitecaps/motion to the water in the background for consistency.
- − The leaf colors (orange/red) contrast slightly with the very green trees in the background.
- − Some minor blurring around the dog's ear looks accidental rather than intentional motion.
Wan 2.6
- + Successfully added wind effect to the hair while keeping the face clear.
- + Green leaf choice matches the existing foliage of the source image better.
- + Excellent preservation of the woman and dog's original features.
- − The flying leaves are sparse and less dynamic than requested.
- − The wind effect on the hair is slightly less energetic compared to the other model.
Verdict: Gemini 3 Pro Image Preview provided a much more 'energetic and lively' edit as requested, with a significant amount of wind in the hair and a flurry of leaves. While Wan 2.6 did a great job preserving the scene and choosing color-accurate green leaves, its interpretation of 'dynamic motion' was more subtle. Gemini's addition of movement to the water in the background makes the overall environment feel more cohesive.
Vintage Cafe Logo
Text-to-Image“Vintage minimalist restaurant logo for "Caffè Florian", retro cloche dome with steam and "Est. 1720" banner, classic typography, warm brown and cream tones, subtle texture on light background, vector emblem style.”
AI Judge Analysis
Nano Banana Pro
- + Perfect text rendering for both the name and the date banner.
- + Excellent composition with the ribbon centered over the cloche.
- + Beautiful illustrative cross-hatching and stylized steam patterns.
- − The 'f's in Caffè touch slightly, which is minor for a vintage look but less 'minimalist'.
Wan 2.6
- + Higher contrast vector style that feels very clean.
- + Subtle and elegant steam design.
- + Good adherence to the requested color palette.
- − The 'Est. 1720' banner is awkwardly placed to the side rather than integrated into the emblem.
- − The 'Est. 1720' text is slightly distorted and less legible than Model A.
Verdict: Gemini 3 Pro Image Preview captures the vintage emblem style much better by centering the banner and using sophisticated illustrative textures. Wan 2.6 provides a clean vector look but fails on the composition of the secondary text, placing the banner in a way that feels like an afterthought.
Apollo 11: Journey to Tranquility
Text-to-Image“Create a clean, modern vector infographic poster about the Apollo 11 mission. NASA-inspired palette (navy, white, muted red, light gray). Flat-vector style, crisp lines, consistent iconography, subtle gradients only. Steps (stop at landing): 1. Launch (Saturn Vicon) 2. Earth Orbit (Earth + orbit ring icon) 3. Translunar (trajectory arc icon) 4. Lunar Orbit (Moon + orbit ring icon) 5. Descent (lunar module descending icon) 6. Landing (lunar module on the surface icon) Small supporting elements (minimal text): • Crew strip: three silhouette icons with only last names: Armstrong, Aldrin, Collins. • Landing site marker: Moon pin labeled "Tranquility" only. Layout constraints: generous margins, large readable labels, clean background with subtle stars. Vector-only, print-poster look, high resolution.”
AI Judge Analysis
Nano Banana Pro
- + Perfect adherence to all six requested steps and iconography.
- + Excellent text rendering for headings and labels.
- + Sophisticated, professional layout that matches the 'modern infographic' request.
- − None notable; matches the prompt requirements exceptionally well.
Wan 2.6
- + Clean palette and minimalist style.
- + Legible text for names and title.
- − Fails to include any of the six requested infographic steps.
- − Large amount of empty space results in a poor composition for an infographic.
- − Lacks the Saturn V and mission-specific iconography requested.
Verdict: Gemini 3 Pro Image Preview perfectly followed the complex multi-step prompt, delivering a detailed and logically organized infographic with correct icons and text labels. In contrast, Wan 2.6 failed to include the infographic steps entirely, providing only a title and crew silhouettes with significant empty space.
Explore each model
Alibaba's multimodal generation model from the Wan AI suite, supporting text-to-video, image-to-video, reference-to-video with audio, and text-to-image, in both Chinese and English