Nano Banana 2 vs GPT Image 1.5
Head-to-head across 12 challenges
Nano Banana 2
42.9%
win rate
Ties
14.3%
GPT Image 1.5
42.9%
win rate
Challenge Results
Geometric Composition
Text-to-Image“A glass cube on a wooden table. Inside the cube is a small blue sphere. On top of the cube sits a red book. A green plant is behind the cube, partially visible through the glass. Soft window light from the left.”
AI Judge Analysis
Nano Banana 2
- + Excellent text rendering on the book spine.
- + Highly realistic lighting and depth of field.
- + Perfect adherence to all spatial instructions, including the plant being visible through the glass.
- − The glass cube lacks a top pane, making it more of an open tank or frame.
- − The blue sphere is quite large relative to the cube, pushing the definition of 'small'.
GPT Image 1.5
- + Strong geometric coherence for the glass cube with clear definition of all sides.
- + Bright, vibrant colors and clean surfaces.
- + Accurate placement of all requested elements.
- − The book appears to be floating slightly above the glass top rather than resting on it.
- − The glass refraction and the reflection of the sphere on the bottom are slightly less realistic than Model A.
Verdict: Both models followed the complex spatial instructions perfectly. Nano Banana 2 has superior photographic quality and impressive text rendering on the book, though the cube is missing its top lid. GPT Image 1.5 provides a more geometrically solid cube, but suffers from a 'floating' book artifact and slightly less natural lighting.
Man and Car in California
Editing“Make a photo of the man driving the car down the California coastline”
AI Judge Analysis
Nano Banana 2
- + Perfectly preserves the specific car model (white Rolls-Royce Phantom Drophead Coupé) from the source image.
- + Accurately places the man inside the car with his signature hairstyle and scarf visible.
- + Creates a high-quality, professional-looking action shot with motion blur on the wheels and road.
GPT Image 1.5
- + Maintains the man's likeness and clothing details very well.
- + The background scenery feels very authentic to the California coastline (Big Sur style).
- − The car is rendered as a generic white convertible, losing the iconic Rolls-Royce grille and hood ornament from the source.
- − The composition is uncomfortably tight, cutting off the front of the vehicle.
Verdict: Nano Banana 2 is the clear winner because it successfully integrated the man into the exact car provided in the source image, whereas GPT Image 1.5 replaced the Rolls-Royce with a generic vehicle. Nano Banana 2 also displayed superior composition by showing the full car in motion along the coastline.
Candid Street Photography
Text-to-Image“A candid street photo of an elderly Japanese man repairing a red bicycle in light rain, reflections on wet pavement, shallow depth of field, 50mm lens, natural skin texture, imperfect framing, motion blur from passing cars, cinematic but realistic, no stylization.”
AI Judge Analysis
Nano Banana 2
- + Excellent environmental storytelling with authentic Japanese street signage.
- + Highly realistic wet pavement reflections and color palette.
- + Includes 'motion blur from passing cars' as requested in the prompt.
- − The bike chain is disconnected from the bike in a physically impossible way.
- − The wrench being held is slightly distorted and floating near the hand.
GPT Image 1.5
- + Stronger shallow depth of field effect that emphasizes the subject.
- + Excellent natural skin texture and more convincing interaction with the bike parts.
- + Visible rain drops on the man's jacket and the bike frame heighten the realism.
- − Lacks the requested 'motion blur from passing cars' as the vehicle in the background is static or frozen.
- − The environment feels less distinctly Japanese compared to the signage in the other image.
Verdict: Nano Banana 2 captures the 'motion blur' and atmospheric signage of a Japanese street much better, but suffers from significant anatomical and mechanical glitches with the hands and bicycle chain. GPT Image 1.5 produces a much more coherent subject and higher quality textures, though it fails to incorporate the specific motion blur request. GPT Image 1.5 is the winner for its superior visual clarity and physical logic.
Fantasy Warrior
Text-to-Image“Close portrait of a battle-worn paladin in ornate engraved plate armor, hair braided with small beads, faint scars and dirt on the skin, warm torchlight reflecting off metal, shallow depth of field, bokeh sparks, lifelike eyes, highly detailed texture on leather straps and cloth underlayer.”
AI Judge Analysis
Nano Banana 2
- + Excellent depiction of multiple braids with small beads as requested.
- + Highly detailed engraved plate armor with clear runic-style symbols.
- + Strong character expression that effectively conveys a 'battle-worn' state.
- − The hand holding the sword is anatomically distorted with messy finger merging.
- − The scale of the character feels slightly smaller compared to the armor bulk.
GPT Image 1.5
- + Beautiful warm lighting and bokeh sparks that create a cinematic atmosphere.
- + Exceptional skin texture showing fine pores, sweat, and subtle scarring.
- + Superior eyes with lifelike reflections and depth.
- − Armor engravings are a bit softer and less defined than in Model A.
- − Fewer braids visible compared to the detailed braiding in Model A.
Verdict: Both models followed the prompt exceptionally well, but GPT Image 1.5 is the winner due to its superior photographic quality and lifelike facial details. While Nano Banana 2 did a better job with the specific request for braided hair and intricate armor runes, it suffered from significant anatomical errors in the hand, whereas GPT Image 1.5 felt more cohesive and visually stunning.
Modern Clean Menu
Text-to-Image“Modern minimalist restaurant menu design, white background with colorful food photos in grid, sections for appetizers/pizza/mains, bold sans-serif fonts, vibrant accents, clean professional layout for casual dining.”
AI Judge Analysis
Nano Banana 2
- + Excellent text rendering with very few spelling errors.
- + Highly professional layout with sophisticated grid for images.
- + Accurately includes all three requested sections (Appetizers, Pizza, Mains) with clear headers.
- − Small fine print descriptions contain some minor gibberish characters.
- − The layout feels slightly crowded due to the high density of photos.
GPT Image 1.5
- + Clean, minimalist aesthetic that is easy to read at a glance.
- + Excellent image quality for the food photographs.
- + Good use of color accents on the section headers.
- − The image grid is less structured than requested, with variable sizing.
- − The bottom section is cut off at the edge of the frame.
- − Only includes two categories in the menu text list (Appetizers/Pizza) before the cutoff.
Verdict: Nano Banana 2 is the superior design as it provides a complete, professional menu layout with all requested prompt elements, including three distinct categories and a structured photo grid. GPT Image 1.5 has high-quality visuals but fails on the composition, as the bottom of the menu is cut off and it lacks the level of detail found in the first image's text and branding.
Bald man challenge
Editing“Give the person a full, thick head of natural hair with realistic texture, density, and a natural hairline. Preserve facial features and lighting.”
AI Judge Analysis
Nano Banana 2
- + Perfect preservation of original facial features and expression
- + Seamless hair integration with the original lighting and texture
- + High realism in the salt-and-pepper hair coloring
- − The forehead height makes the hairline feel slightly pushed back, though still within natural limits
GPT Image 1.5
- + Provides a full, thick head of curly hair
- + Correctly identifies the texture requested in the prompt
- − Significantly alters the subject's facial features, making him look like a different person
- − The ear and glasses frame on the left side are poorly rendered and merged with the hair
- − The image has a softer, lower-resolution appearance compared to the original
Verdict: Nano Banana 2 is the clear winner as it successfully adds natural-looking hair while perfectly preserving the identity, lighting, and details of the original photograph. GPT Image 1.5 fails the editing task by fundamentally changing the subject's face and introducing artifacts around the ears and glasses.
Night Sky Transformation
Editing“Change the scene to night: a deep, dark sky with subtle, glistening stars visible behind the mountain.”
AI Judge Analysis
Nano Banana 2
- + Excellent source preservation, keeping all foreground village details and mountain structures identical to the original.
- + Realistic transition to night while maintaining visibility of the landscape.
- + Subtle and natural-looking star field.
- − The lighting on the mountain peaks is slightly brighter than a 'deep dark sky' might suggest, though it remains cohesive.
GPT Image 1.5
- + Very effective 'deep, dark sky' as requested in the prompt.
- + Atmospheric and high-contrast night scene.
- − Loss of fine detail in the mountain shadows compared to the source.
- − Slightly less 'subtle' star field, leaning towards a very dense Milky Way effect.
- − Foreground village lighting lost some of the warm glow and clarity found in the original.
Verdict: Nano Banana 2 is the preferred choice because it successfully changed the sky to a starry night while perfectly preserving every detail of the mountain and village from the source image. GPT Image 1.5 followed the 'dark' instruction well but at the cost of losing significant visual information in the landscape and shadows.
Over-the-top cartoon caricature
Editing“Create a caricature of me and my job. Make it exaggerated and humorous, incorporating my profession as a tv show anchor and my love for dogs and hockey.”
AI Judge Analysis
Nano Banana 2
- + Perfectly captures the hand-drawn colored pencil aesthetic of a classic boardwalk caricature.
- + Integrated clever details like the 'W-K9 News' microphone and hockey sticks held by dogs.
- + Excellent text rendering with humorous, relevant headlines that fit the theme.
- − The character's facial resemblance to the original woman is slightly less accurate than Model B.
- − The hockey stick held by the character has some structural issues near the hand.
GPT Image 1.5
- + Maintains a high level of facial resemblance to the subject in the source image despite the caricature style.
- + Higher visual fidelity with vibrant colors and complex background elements like the hockey rink monitor.
- + Creative depiction of a dog wearing a hockey helmet and holding a stick.
- − The style feels more like a 3D digital illustration than a traditional 'caricature'.
- − Anatomical errors in the hand holding the microphone (odd finger placement and size).
Verdict: Both models followed the instructions well, but Nano Banana 2 better captured the specific 'caricature' art style requested, presenting it as a physical drawing with clever puns like 'W-K9'. GPT Image 1.5 achieved a better facial likeness of the source subject and high-quality rendering, but its execution feels more like a digital painting than a caricature and suffers from significant hand artifacts.
Adorable Baby Animals in Sunny Meadow
Text-to-Image“Hyper-photorealistic scene of fluffy baby animals—a golden retriever puppy, tabby kitten, baby bunny, and red fox kit—with big expressive eyes and ultra-detailed soft fur, playfully chasing butterflies and tumbling together in a lush wildflower meadow, warm golden sunrise light with god rays and dew sparkles, joyful wholesome vibe, 8K masterpiece.”
AI Judge Analysis
Nano Banana 2
- + Excellent anatomical accuracy for all four animals.
- + Very clean details in the fur textures and individual blades of grass.
- + Clear inclusion of all requested elements with a dynamic 'chasing' composition.
- − The lighting feels slightly more artificial and layered compared to the warmer glow of the competitor.
GPT Image 1.5
- + Beautiful warm golden hour lighting with soft 'god rays' as requested.
- + Adorable and expressive facial expressions on the animals.
- + Rich, painterly texture that enhances the 'wholesome' vibe.
- − Anatomical issues with the kitten, particularly the paws and the way it is 'tumbling'.
- − The fox appears a bit more like a domestic dog hybrid in its facial structure.
Verdict: Nano Banana 2 produces a much more coherent and anatomically correct image, with distinct and well-rendered animals that are clearly chasing butterflies. While GPT Image 1.5 captures the warm, hazy 'wholesome' lighting slightly better, it suffers from significant anatomical warping in the kitten's legs and paws, making Nano Banana 2 the superior technical choice.
Studio Ghibli Anime Style
Editing“Transform this photo into a Studio Ghibli–inspired illustration. Use soft pastel colors, hand-painted textures, gentle lighting, dreamy backgrounds, and a warm, nostalgic mood”
AI Judge Analysis
Nano Banana 2
- + Excellent structural preservation of the original image composition and character poses.
- + Effective use of watercolor line-and-wash texture common in Ghibli backgrounds.
- + Captures the character expressions and clothing patterns with high fidelity to the source.
- − The colors are slightly more saturated than the requested 'soft pastel' palette.
GPT Image 1.5
- + Strong 'dreamy' atmosphere with soft, hazy lighting and warm tones.
- + Very successful application of soft pastel colors.
- + Art style feels reminiscent of Ghibli's softer, more painterly moments.
- − Loses some of the distinct facial expressions from the original meme.
- − The hand of the man is somewhat malformed compared to Model A.
Verdict: Nano Banana 2 is the better edit for this specific challenge as it maintains the iconic character expressions and structural integrity of the 'Distracted Boyfriend' meme while perfectly applying a Ghibli-esque watercolor style. GPT Image 1.5 succeeds in creating a dreamy atmosphere and a beautiful pastel palette, but it loses some of the character personality and has minor issues with hand anatomy.
Vintage Cafe Logo
Text-to-Image“Vintage minimalist restaurant logo for "Caffè Florian", retro cloche dome with steam and "Est. 1720" banner, classic typography, warm brown and cream tones, subtle texture on light background, vector emblem style.”
AI Judge Analysis
Nano Banana 2
- + Perfect text rendering for both the name and the date banner.
- + Excellently follows the light background and subtle texture request.
- + Clean vector emblem style with cohesive design elements.
- − The accent on 'Caffè' is slightly oversized.
GPT Image 1.5
- + Attractive lighting and shading on the cloche dome.
- + Nice textured brushwork on the illustration.
- − Failed the background instruction by using solid black instead of a light textured background.
- − Text rendering on 'FLORIAN' has inconsistent letter heights and kerning.
- − The banner is disconnected and lacks the circular 'emblem' feel requested.
Verdict: Nano Banana 2 followed the prompt instructions perfectly, particularly the requirement for a light background with subtle texture and a vector emblem style. In contrast, GPT Image 1.5 ignored the background color instruction and produced text with minor inconsistencies in the lettering. Nano Banana 2 is the clear winner for its superior typography and layout coherence.
Apollo 11: Journey to Tranquility
Text-to-Image“Create a clean, modern vector infographic poster about the Apollo 11 mission. NASA-inspired palette (navy, white, muted red, light gray). Flat-vector style, crisp lines, consistent iconography, subtle gradients only. Steps (stop at landing): 1. Launch (Saturn Vicon) 2. Earth Orbit (Earth + orbit ring icon) 3. Translunar (trajectory arc icon) 4. Lunar Orbit (Moon + orbit ring icon) 5. Descent (lunar module descending icon) 6. Landing (lunar module on the surface icon) Small supporting elements (minimal text): • Crew strip: three silhouette icons with only last names: Armstrong, Aldrin, Collins. • Landing site marker: Moon pin labeled "Tranquility" only. Layout constraints: generous margins, large readable labels, clean background with subtle stars. Vector-only, print-poster look, high resolution.”
AI Judge Analysis
Nano Banana 2
- + Excellent adherence to the color palette and clean vector aesthetic.
- + Perfectly rendered, legible text for all mission steps and crew names.
- + Strong icon-based composition that feels like a professional infographic.
- − Includes a fourth, unnamed silhouette in the crew section.
- − Icons are a bit small compared to the overall canvas size.
GPT Image 1.5
- + Dynamic and engaging vertical layout.
- + Bold, clear typography that aligns well with the graphic elements.
- + Good illustrative detail on the Saturn V and Lunar Module.
- − The top of the text is cropped out of the frame.
- − The Earth orbit icon is placed next to the 'Launch' text, creating a slight misalignment in the process flow.
- − Includes greenery (green landmasses) on Earth which deviates from the requested 'NASA-inspired' muted palette.
Verdict: Nano Banana 2 followed the prompt's stylistic and color instructions more accurately, producing a clean and professional-looking infographic with excellent text rendering. GPT Image 1.5 created a more visually dynamic poster, but it suffered from framing issues at the top and slight inaccuracies in the sequence of icons relative to the text labels.
Nano Banana 2
Gemini 3.1 Flash with image generation capabilities. High-efficiency image generation model with support for text rendering, reference images, search grounding, and thinking mode. The efficient counterpart to Gemini 3 Pro Image.
GPT Image 1.5
OpenAI's state-of-the-art image generation model with better instruction following and adherence to prompts