Gemini 3 Pro with image generation capabilities. Combines advanced reasoning with the ability to generate and edit images.
Settled by community votes across 15 shared challenges, with an AI judge weighing in on each.
Nano Banana Pro
#3 of 48 in Text-to-Image
GPT Image 2
#2 of 48 in Text-to-Image
Where the votes landed
Nano Banana Pro
100.0%
win rate
Ties
0.0%
GPT Image 2
0.0%
win rate
Challenge by challenge
The strongest take from each model on every shared challenge, with the AI judge's read.
Geometric Composition
Text-to-Image“A glass cube on a wooden table. Inside the cube is a small blue sphere. On top of the cube sits a red book. A green plant is behind the cube, partially visible through the glass. Soft window light from the left.”
AI Judge Analysis
Nano Banana Pro
- + Excellent photographic realism with natural window light and realistic shadows.
- + Highly detailed wood grain and book texture that feels authentic.
- + Precise adherence to the spatial arrangement of elements.
- − The plant pot is quite dominant, though it follows the 'behind the cube' instruction.
GPT Image 2
- + Clean, vibrant colors and sharp focus on the central objects.
- + Good understanding of the prompt's requirements for object placement.
- + The blue sphere is solid and well-defined.
- − The lighting feels more synthetic and less like natural window light compared to the other model.
- − The glass cube edges appear slightly inconsistent in thickness.
Verdict: Nano Banana Pro produces a significantly more realistic and cinematic image with fantastic attention to light, shadow, and texture. While GPT Image 2 follows the prompt accurately, its output has a flatter, more digital appearance compared to the high-fidelity photographic quality of Nano Banana Pro.
Candid Street Photography
Text-to-Image“A candid street photo of an elderly Japanese man repairing a red bicycle in light rain, reflections on wet pavement, shallow depth of field, 50mm lens, natural skin texture, imperfect framing, motion blur from passing cars, cinematic but realistic, no stylization.”
AI Judge Analysis
Nano Banana Pro
- + Exceptional photographic realism with film-like grain.
- + Highly accurate rain effects including visible droplets and wet surface reflections.
- + Successfully captures motion blur from passing cars while maintaining focus on the subject.
- − The man's right hand has some slight structural weirdness near the bike seat.
GPT Image 2
- + Good inclusion of a toolbox as a logical detail for a repair scene.
- + Sharp focus on the subject with a shallow depth of field.
- + Accurate Japanese signage in the background.
- − The bicycle rendering is physically incoherent with the frame passing through the rear tire/spokes.
- − Lacks the requested 'rain' atmospheric effect, appearing mostly dry with only minor ground reflections.
- − The lighting feels flat and less cinematic than requested.
Verdict: Nano Banana Pro significantly outperforms GPT Image 2 by delivering a truly cinematic and realistic 'film' aesthetic that perfectly captures the requested rainy atmosphere. While GPT Image 2 includes nice environmental details, it fails on the basic physics of the bicycle and fails to produce the 'light rain' requested in the prompt. Nano Banana Pro feels like a genuine candid street photograph.
Fantasy Warrior
Text-to-Image“Close portrait of a battle-worn paladin in ornate engraved plate armor, hair braided with small beads, faint scars and dirt on the skin, warm torchlight reflecting off metal, shallow depth of field, bokeh sparks, lifelike eyes, highly detailed texture on leather straps and cloth underlayer.”
AI Judge Analysis
Nano Banana Pro
- + Excellent adherence to the 'battle-worn' descriptor with visible scars and grit.
- + Superior engraving details on the plate armor and realistic leather strap texture.
- + Very strong implementation of bokeh sparks and warm torchlight reflections.
- − The torch in the corner is a bit distracting in its proximity to the face.
- − Symmetry in the facial scars feels slightly artificial.
GPT Image 2
- + Very naturalistic and beautiful skin texture with realistic freckles and subtle dirt.
- + High-quality hair rendering with intricate braids and beads.
- + Photorealistic eyes with great depth and clarity.
- − The armor looks more like metal-detailed cloth or rusted iron rather than 'ornate engraved plate'.
- − Missing the prominent 'battle-worn' scars requested in the prompt.
- − Lacks the 'bokeh sparks' which were a specific requirement.
Verdict: Nano Banana Pro followed the prompt more closely, delivering on the specific details of ornate plate armor, scars, and bokeh sparks. While GPT Image 2 produced a very beautiful and realistic portrait, it leaned toward a cleaner aesthetic and missed several key technical elements requested in the prompt like the specific scars and sparks.
Modern Clean Menu
Text-to-Image“Modern minimalist restaurant menu design, white background with colorful food photos in grid, sections for appetizers/pizza/mains, bold sans-serif fonts, vibrant accents, clean professional layout for casual dining.”
AI Judge Analysis
Nano Banana Pro
- + Features very clear, bold headers that establish a strong hierarchy
- + Adheres strictly to the requested three sections (Appetizers, Pizza, Mains)
- + Maintains clean, vibrant color coding for each section
- − Text contains significant gibberish and misspellings (e.g., 'Barked Pizza', 'Graoy Salad')
- − The food images are repetitive, with pasta being shown in the Pizza section
GPT Image 2
- + Excellent text rendering with real ingredients and coherent descriptions
- + Highly professional layout with sophisticated graphic design elements like icons and social handles
- + Food photos are diverse and accurately correspond to their labels
- − The layout is significantly more cluttered than the requested minimalist style
- − The grid of circles creates a busier aesthetic compared to a simple minimalist grid
Verdict: Nano Banana Pro provides a simpler, more minimalist structure but is severely hindered by nonsensical text and incorrect food associations (pasta labeled as pizza). GPT Image 2 creates a much more functional and professional menu with readable text and high-quality imagery, despite being slightly more complex than the 'minimalist' prompt requested. GPT Image 2 is the clear winner for its real-world usability and technical execution.
Magic Burger Explosion: Fiery Photorealism Challenge
Text-to-Image“Ad for 'Magic Burger'. Dynamic, exploded burger with all components (bun, patty, cheese, lettuce, tomato, sauce) suspended in mid-air. Emphasize photorealistic detail and a sense of motion. Dark, fiery background with glowing embers. Integrate text: 'MAGIC BURGER' as a prominent title, 'LIMITED TIME ONLY' as a secondary message, and '€6.99' in a starburst, all rendered with a fiery, glowing effect.”
AI Judge Analysis
Nano Banana Pro
- + Excellent typography with a clean, professional neon-glow finish.
- + High level of photorealism in the food textures, especially the tomato and bun.
- + Clean composition with a well-defined starburst element.
- − The 'exploded' effect is more restrained compared to the other model.
- − The background embers are somewhat generic and lack the intense 'fiery' feel requested.
GPT Image 2
- + Superior 'exploded' dynamics with sauce splashes and intense motion.
- + Text is perfectly integrated with an impressive fire and flame effect.
- + The background is more dramatic and better matches the 'fire and glowing embers' prompt.
- − The placement of the lettuce under the top bun makes the burger feel a bit lopsided.
- − The starburst design is a bit chaotic and slightly distracting from the product.
Verdict: Both models followed the prompt exceptionally well, but GPT Image 2 captured the 'fiery' and 'dynamic' spirit of the request more effectively with its flame-textured typography and intense motion. Nano Banana Pro produced a cleaner, more traditional commercial look with better food textures, but it lacked the dramatic impact seen in the competing image. GPT Image 2 is the winner for its superior creative integration of the text and background elements.
Chalkboard Menu
Text-to-Image“Handwritten-style chalkboard menu in a cozy café, all text rendered in the exact same realistic chalk handwriting style with natural variations in letter size, slight slant, and chalk texture — no printed or digital fonts anywhere on the board. Title at the top in elegant cursive chalk handwriting: ‘TODAY’S SPECIALS – APRIL 30, 2026’. Below it, three menu items also in the same handwritten chalk style: ‘Truffle Mushroom Risotto – $24’, ‘Grilled Octopus with Lemon & Herbs – $28’, ‘Brown Butter Chocolate Chip Cookies – $9’. At the very bottom, smaller text in the identical handwritten chalk style (slightly smaller but still clearly legible with the same handwriting characteristics): ‘All items made fresh daily • Ask about our gluten-free options’. Warm ambient café lighting, visible chalk dust and smudges, realistic handwriting imperfections, no clean printed text anywhere.”
AI Judge Analysis
Nano Banana Pro
- + Excellent text legibility and accuracy
- + Effective use of negative space on the chalkboard surface
- + Composition creates a nice atmospheric depth with the café background
- − The handwriting style looks slightly more like a digital font than natural chalk
- − Minimal 'chalk' texture within the letter strokes
GPT Image 2
- + Authentic chalk texture and natural handwriting variations
- + Excellent adherence to the 'elegant cursive' request for the title
- + Higher detail in the chalkboard surface grain and surrounding props
- − The alignment of the bottom text is slightly slanted upwards compared to the rest of the board
Verdict: Both models followed the prompt instructions perfectly, including the specific date and menu items. GPT Image 2 is the winner because its rendering of the chalk texture and handwriting felt much more authentic to a real hand-drawn board, whereas Nano Banana Pro produced text that looked a bit too much like a clean digital font overlay.
Pose & Character Mashup
Editing“Use Image 1 as the exact pose reference and Image 2 as the character reference. Recreate the person/character from Image 2 in the exact dynamic pose and body position from Image 1. Keep the exact face, hair, clothing style/details, and expression from Image 2. Match the lighting and environment of Image 1. The final image must show the character from Image 2 performing the precise action/pose from Image 1 with perfect anatomy and natural integration.”
AI Judge Analysis
Nano Banana Pro
- + Excellent adherence to the exact pose and body position from Image 1.
- + High fidelity to the character's face, sunglasses, and scarf from Image 2.
- + Very clean integration of the character's feet onto the red ottoman.
- − The scarf's tassels are slightly simplified compared to the source.
GPT Image 2
- + Successfully replicates the complex pose and environment.
- + Maintains the core identity and accessories of the character from Image 2.
- + Good rendering of the hands and overall lighting.
- − The character's right foot (on the left) is poorly rendered with merging toes.
- − The face looks slightly more 'painterly' and less photorealistic than Nano Banana Pro.
Verdict: Nano Banana Pro is the winner because it achieves nearly perfect pose replication while maintaining higher anatomical accuracy, particularly in the feet and facial features. GPT Image 2 performs well on the overall composition but suffers from visible artifacts in the toes and a slightly less consistent facial likeness compared to the character reference.
The Reversed Rodeo
Text-to-Image“Horse riding astronaut in space — horse on top, not vice versa. Surreal, highly detailed, cinematic.”
AI Judge Analysis
Nano Banana Pro
- + Exhibits a beautiful, cinematic space background with nebulae and planetary details.
- + Captures the surreal nature of the prompt with a whimsical cape and floating elements.
- + Highly detailed rendering of both the astronaut suit and the horse's fur.
- − The astronaut is holding the horse up rather than the horse riding the astronaut.
- − Anatomical merge where the horse's front hoof becomes the astronaut's hand.
GPT Image 2
- + Perfectly adheres to the specific 'riding' instruction with the horse seated on a saddle on the astronaut.
- + High realism in textures, particularly the lunar surface and the astronaut's suit fabric.
- + Correctly interprets the 'horse on top' spatial relationship.
- − The astronaut's hands have too many fingers (6 on each hand).
- − The horse's front legs are truncated and end abruptly in stirrups/loops in a confusing way.
Verdict: GPT Image 2 followed the specific instruction of 'horse riding astronaut' much more literally by positioning the horse in a saddle on the astronaut's back. Nano Banana Pro failed the core logic of the prompt by having the astronaut lift the horse instead of being ridden, though it produced a more colorful and 'cinematic' space aesthetic. Despite the extra fingers, GPT Image 2 is the winner for its superior prompt adherence.
Outfit Transfer Challenge
Editing“Use Image 1 as the base person. Dress them in the exact elaborate outfit from Image 2 (including all layers, accessories, jewelry, and shoes). Carefully adapt the clothing to the body shape and pose in Image 1 while maintaining realistic fabric behavior, correct proportions, and perfect lighting/shadow matching. Keep the person’s exact face, hair, and background completely unchanged.”
AI Judge Analysis
Nano Banana Pro
- + Excellent preservation of the person's exact facial features, hair, and sand markings.
- + High fidelity to the original background's details and lighting.
- + Accurate replication of the coat, scarf, and watch from Image 2.
- − The hands inside the pockets look slightly poorly defined or merged with the fabric.
- − The right side of the background (the horizon) is slightly cropped compared to the original.
GPT Image 2
- + Successfully transfers the outfit, including the specific scarf pattern and peacoat.
- + Maintains the characteristic hair and vitiligo markings of the subject.
- + Good integration of the pose with the pockets of the new jacket.
- − Noticeable changes to the facial features, making him look slightly older and different from the source person.
- − The background beach area is significantly altered and simplified compared to Image 1.
- − The skin tone and texture have been smoothed out, losing original character.
Verdict: Nano Banana Pro is the clear winner as it successfully performs the edit while keeping the person's identity and the original background almost perfectly intact. GPT Image 2 applies the clothing well, but it fails the 'source preservation' requirement by altering the subject's face and completely replacing the background elements with a similar but new environment.
The Capybara Taxi Driver
Text-to-Image“Photorealistic scene inside a yellow New York taxi at night. A capybara is driving, wearing a yellow taxi driver cap and a dark jacket. It has a calm, professional expression and both front paws on the steering wheel. In the back seat sits a human businesswoman in a coat, looking at her phone with a completely normal, bored expression (as if this is just another normal ride). Through the windows you can see the streets of Manhattan at night with blurred lights. Realistic taxi interior, photorealistic, detailed fur and fabric, 35mm lens, night lighting with reflections, shallow depth of field.”
AI Judge Analysis
Nano Banana Pro
- + Excellent photorealism with gritty, cinematic lighting
- + Includes realistic interior taxi details like the fare meter and dashboard
- + Strong composition from a perspective that captures both the driver and passenger clearly
- − The capybara's paws on the steering wheel look slightly like human fingers
GPT Image 2
- + Features a more authentic-looking 'NYC' taxi driver cap
- + Good depiction of the bored expression on the passenger
- + The capybara's fur texture is very detailed and realistic
- − The perspective is awkward, cutting off part of the car's exterior frame
- − Lacks the interior details like the taxi meter that add to the requested realism
Verdict: Nano Banana Pro is the winner because it provides a more complete and realistic scene, including detailed taxi instrumentation and a balanced composition that feels like a still from a high-budget movie. While GPT Image 2 has a more official-looking hat for the capybara, its composition is less engaging and lacks the environmental storytelling found in the first image.
The Halloween Invitation
Text-to-Image“Vintage gothic Halloween party invitation. Dark parchment poster, spooky border with webs and thorns, central glowing jack-o-lantern, bats, twisted trees, moody night sky. Add elegant gothic title text saying "Halloween Party Invitation", a small scroll banner saying "You are invited to a night of frights", and event details at the bottom: Date: 30.10.2026 Time: 7pm Location: The Arches, NYC Spooky but polished, cinematic lighting, square format.”
AI Judge Analysis
Nano Banana Pro
- + Excellent typography with a warm glow effect
- + Very clean and readable layout
- + Perfect adherence to all text elements in the prompt
- − The parchment texture is a bit generic
- − The central background illustration is less detailed than Model B
GPT Image 2
- + Highly atmospheric and detailed background scenery including a cathedral and bridge
- + Intricate gothic border design
- + Moodier and more cinematic lighting
- − Text is slightly less legible due to the complex background
- − The fonts used are inconsistent with each other
Verdict: Nano Banana Pro produces a cleaner, more practical invitation with excellent legibility and a classic layout. GPT Image 2 offers a more artistic and immersive gothic world with superior background detail, though it sacrifices a bit of the 'official invitation' clarity found in Nano Banana Pro.
Isometric Miniature Diorama Scenes
Text-to-Image“Create a clear, 45° top-down isometric miniature 3D cartoon scene of Japan's signature dish: sushi, with soft refined textures, realistic PBR materials, gentle lighting, on a small raised diorama base with minimal garnish and plate. Solid light blue background. At top-center: 'JAPAN' in large bold text, 'SUSHI' below it, small flag icon. Perfectly centered, ultra-clean, high-clarity, square format.”
AI Judge Analysis
Nano Banana Pro
- + Excellent typography with clean, professional alignment
- + Beautiful soft 3D cartoon lighting and refined textures
- + Adheres perfectly to the isometric perspective and minimal diorama request
- − The flag icon is slightly Stylized/Rounded and placed to the right of 'SUSHI' rather than being strictly centered under the text
GPT Image 2
- + Highly realistic PBR material textures, especially on the salmon
- + Clever use of the diorama base to include stone and a small lantern
- + Strong text rendering with a bold 3D effect
- − The perspective is a bit lower than a true 45° isometric view
- − The dish is slightly off-center toward the top left
- − The text and flag take up more vertical space relative to the illustration
Verdict: Nano Banana Pro followed the visual style requirements perfectly, producing a balanced, clean, and genuinely isometric miniature that looks like a professional asset. GPT Image 2 has impressive individual material textures, but the overall composition feels more crowded and less centered than Nano Banana Pro, which nailed the 'ultra-clean' aesthetic requested.
Adorable Baby Animals in Sunny Meadow
Text-to-Image“Hyper-photorealistic scene of fluffy baby animals—a golden retriever puppy, tabby kitten, baby bunny, and red fox kit—with big expressive eyes and ultra-detailed soft fur, playfully chasing butterflies and tumbling together in a lush wildflower meadow, warm golden sunrise light with god rays and dew sparkles, joyful wholesome vibe, 8K masterpiece.”
AI Judge Analysis
Nano Banana Pro
- + Excellent character expressions with very clear, emotive eyes.
- + Strong lighting effects with distinct god rays and sparkling dew details.
- + Vibrant and well-balanced composition where each animal has its own space.
- − The aesthetic leans slightly more towards digital illustration than true photorealism.
- − The butterfly scale is a bit large compared to the animals.
GPT Image 2
- + Higher level of photorealism in the texture of the fur and grass.
- + Better sense of movement and 'tumbling' as requested in the prompt.
- + Natural back-lighting and atmospheric bokeh effects.
- − The kitten's facial structure is slightly distorted and the eyes are less expressive.
- − The rabbit is partially obscured and lacks the distinct 'joyful' expression of the other animals.
Verdict: Both models followed the prompt exceptionally well, including all four specific animals and the sunrise setting. Nano Banana Pro created a more stylized, 'masterpiece' look with perfect expressions, while GPT Image 2 achieved a more believable photorealistic texture and sense of action in the meadow. Nano Banana Pro is preferred for its superior clarity and the charm of its characters' expressions.
Vintage Cafe Logo
Text-to-Image“Vintage minimalist restaurant logo for "Caffè Florian", retro cloche dome with steam and "Est. 1720" banner, classic typography, warm brown and cream tones, subtle texture on light background, vector emblem style.”
AI Judge Analysis
Nano Banana Pro
- + Excellent typography rendering with modern clean lines
- + High contrast and well-defined vector style shapes
- + Accurate adherence to the cloche and banner elements
- − The 'Est. 1720' text is slightly off-center within the banner
- − The steam effect is a bit chunky and less sophisticated than traditional engraving
GPT Image 2
- + Beautifully intricate vintage engraving style on the cloche and banner
- + Stronger overall composition with an elegant border frame
- + Perfectly centered and professional typography
- − Included a large border frame that wasn't specifically requested in the prompt
- − Texture is a bit heavier on the background, leaning more into 'aged paper' than 'subtle texture'
Verdict: Both models followed the prompt exceptionally well, but GPT Image 2 stands out for its superior 'vintage' execution, feeling like a genuine 18th-century engraving. While Nano Banana Pro produces a very clean and usable vector-style logo, GPT Image 2's attention to detail in the cross-hatching and composition makes it the more visually impressive choice.
Apollo 11: Journey to Tranquility
Text-to-Image“Create a clean, modern vector infographic poster about the Apollo 11 mission. NASA-inspired palette (navy, white, muted red, light gray). Flat-vector style, crisp lines, consistent iconography, subtle gradients only. Steps (stop at landing): 1. Launch (Saturn Vicon) 2. Earth Orbit (Earth + orbit ring icon) 3. Translunar (trajectory arc icon) 4. Lunar Orbit (Moon + orbit ring icon) 5. Descent (lunar module descending icon) 6. Landing (lunar module on the surface icon) Small supporting elements (minimal text): • Crew strip: three silhouette icons with only last names: Armstrong, Aldrin, Collins. • Landing site marker: Moon pin labeled "Tranquility" only. Layout constraints: generous margins, large readable labels, clean background with subtle stars. Vector-only, print-poster look, high resolution.”
AI Judge Analysis
Nano Banana Pro
- + Excellent typography and logical flow of the mission path.
- + Very clean, minimalist flat-vector aesthetic that perfectly matches the 'modern vector' request.
- + Accurate and legible text including crew names and specific mission details.
- − The transition from step 2 to 3 is a bit visually cluttered with overlapping lines.
GPT Image 2
- + Highly organized grid layout that is very easy to read as an infographic.
- + Includes the iconic NASA logo and Apollo 11 mission patch (re-imagined) for added authenticity.
- + Strong adherence to the color palette and structured icon requirements.
- − The illustrations use detailed shading and 3D-like effects rather than the requested flat-vector style.
- − The crew section uses generic silhouettes rather than the more descriptive icons seen in the other model.
Verdict: Nano Banana Pro better captures the 'flat-vector' and 'modern' style requested, featuring a more creative and fluid layout for the mission path. While GPT Image 2 is very well-organized and includes brand elements like the NASA meatball, its use of heavy 3D shading on the lunar module deviates from the flat-vector style requirement. Nano Banana Pro feels like a cohesive graphic design piece, whereas GPT Image 2 feels slightly more like a collage of assets.
Explore each model
OpenAI's state-of-the-art image generation model with arbitrary resolution up to 4K and strong instruction following