Nano Banana Google GPT Image 2 OpenAI

Settled by community votes across 8 shared challenges, with an AI judge weighing in on each.

Nano Banana

24.0 arena score

#20 of 44 in Text-to-Image

Skill signature

Not enough comparable category data

The chart appears once both models have ratings across at least three shared arena categories.

GPT Image 2

28.2 arena score

#3 of 44 in Text-to-Image

Top 3 in Text-to-Image

Vote tally

Where the votes landed

Nano Banana

0.0%

win rate

Ties

0.0%

GPT Image 2

100.0%

win rate

0.0% 0.0% ties 100.0%

Shared challenges 8

Challenge by challenge

The strongest take from each model on every shared challenge, with the AI judge's read.

Modern Clean Menu

Text-to-Image

“Modern minimalist restaurant menu design, white background with colorful food photos in grid, sections for appetizers/pizza/mains, bold sans-serif fonts, vibrant accents, clean professional layout for casual dining.”

Nano Banana

GPT Image 2

AI Judge Analysis

Nano Banana

+ Features a very clean, centered minimalist layout
+ Excellent text readability for the menu items
+ High consistency in the photography grid style

− Several spelling errors like 'APPETIERS' and 'Margheiita'
− The food photos in the grid don't always correspond to the categories listed below

GPT Image 2

+ Excellent typography and brand identity design
+ The food images directly correspond to the specific menu item descriptions
+ High level of detail with descriptions and prices for every item

− The layout is slightly more crowded than a 'minimalist' prompt usually implies
− Minor text rendering artifacts on smaller secondary fonts

Verdict: Nano Banana provides a basic minimalist template with good structure but suffers from significant spelling errors and a lack of coherence between the photos and the text. GPT Image 2 delivers a much more functional and professional design where every photo matches the detailed menu descriptions, feeling like a complete brand identity. GPT Image 2 is the clear winner for its superior execution of the professional casual dining theme.

Magic Burger Explosion: Fiery Photorealism Challenge

Text-to-Image

“Ad for 'Magic Burger'. Dynamic, exploded burger with all components (bun, patty, cheese, lettuce, tomato, sauce) suspended in mid-air. Emphasize photorealistic detail and a sense of motion. Dark, fiery background with glowing embers. Integrate text: 'MAGIC BURGER' as a prominent title, 'LIMITED TIME ONLY' as a secondary message, and '€6.99' in a starburst, all rendered with a fiery, glowing effect.”

Nano Banana

GPT Image 2

AI Judge Analysis

Nano Banana

+ Excellent typography rendering with clean, glowing edges.
+ Highly accurate starburst design for the price.
+ Clean, professional composition with good use of negative space.

− The 'exploded' effect is less dynamic than the competitor.
− Lighting on the burger feels slightly disconnected from the fiery environment.

GPT Image 2

+ Energetic and highly dynamic 'exploded' effect with realistic sauce splashes.
+ Vibrant, high-contrast colors and rich textures on the food components.
+ Strong intensity in the fiery background elements.

− The starburst shape is slightly irregular and messy.
− The composition is a bit crowded, making the text feel forced into the side.

Verdict: Both models followed the prompt exceptionally well, but GPT Image 2 creates a more appetizing and dynamic food shot with superior textures and motion. Nano Banana features cleaner graphic design and better text placement, but its burger looks static in comparison to the 'exploded' energy of GPT Image 2.

Chalkboard Menu

Text-to-Image

“Handwritten-style chalkboard menu in a cozy café, all text rendered in the exact same realistic chalk handwriting style with natural variations in letter size, slight slant, and chalk texture — no printed or digital fonts anywhere on the board. Title at the top in elegant cursive chalk handwriting: ‘TODAY’S SPECIALS – APRIL 30, 2026’. Below it, three menu items also in the same handwritten chalk style: ‘Truffle Mushroom Risotto – $24’, ‘Grilled Octopus with Lemon & Herbs – $28’, ‘Brown Butter Chocolate Chip Cookies – $9’. At the very bottom, smaller text in the identical handwritten chalk style (slightly smaller but still clearly legible with the same handwriting characteristics): ‘All items made fresh daily • Ask about our gluten-free options’. Warm ambient café lighting, visible chalk dust and smudges, realistic handwriting imperfections, no clean printed text anywhere.”

Nano Banana

GPT Image 2

AI Judge Analysis

Nano Banana

+ Excellent text legibility and accuracy
+ Clean, professional composition
+ Authentic café background bokeh

− The text looks slightly digital and uniform, lacking enough 'chalky' texture variations
− Letters appear too perfectly aligned for authentic handwriting

GPT Image 2

+ Superior chalk texture with realistic grit and pressure variations
+ More authentic handwriting slant and letter size inconsistencies
+ Excellent framing with the wooden board and physical chalk visible

− Slightly less crisp text rendered at the bottom edge
− The $9 price formatting on the last item is a bit thin

Verdict: Both models followed the prompt perfectly regarding text content. GPT Image 2 wins because it captured the specific 'chalk' texture requested—it looks like physical chalk on a board with varying pressure and grain. Nano Banana's text, while clear, appears a bit too much like a digital 'handwriting' font rather than real hand-drawn chalk.

Pose & Character Mashup

Editing

Edit instruction

“Use Image 1 as the exact pose reference and Image 2 as the character reference. Recreate the person/character from Image 2 in the exact dynamic pose and body position from Image 1. Keep the exact face, hair, clothing style/details, and expression from Image 2. Match the lighting and environment of Image 1. The final image must show the character from Image 2 performing the precise action/pose from Image 1 with perfect anatomy and natural integration.”

Source

Nano Banana

GPT Image 2

AI Judge Analysis

Nano Banana

+ Successfully applied the black sweatshirt and scarf accessories to the person from Image 1.
+ Maintained the exact environment and background of Image 1.
+ The blending of the scarf and hair is relatively natural.

− Failed the primary task of swapping the character; it kept the woman from Image 1 instead of the man from Image 2.
− The sunglasses are poorly integrated, appearing flat and misaligned with the face.

GPT Image 2

+ Successfully swapped the character's face, hair, and facial hair to match the person from Image 2.
+ Accurately replicated the pose and environment from Image 1 with the new character.
+ Perfectly captured all clothing details, including the specific scarf pattern and black sweatshirt.

− Minor anatomical distortion in the hands, particularly the left hand (upper right).
− The skin tone on the feet and legs is slightly light compared to the face.

Verdict: Nano Banana failed to follow the core instruction of swapping the character, essentially just adding clothes and glasses to the woman in Image 1. GPT Image 2 successfully performed a complex character swap, maintaining the man's identity from Image 2 while perfectly replicating the challenging pose and lighting from Image 1.

Outfit Transfer Challenge

Editing

Edit instruction

“Use Image 1 as the base person. Dress them in the exact elaborate outfit from Image 2 (including all layers, accessories, jewelry, and shoes). Carefully adapt the clothing to the body shape and pose in Image 1 while maintaining realistic fabric behavior, correct proportions, and perfect lighting/shadow matching. Keep the person’s exact face, hair, and background completely unchanged.”

Source

Nano Banana

GPT Image 2

0% wins 0% ties 100% wins

AI Judge Analysis

Nano Banana

+ Excellent replication of the specific scarf and coat textures from Image 2
+ Includes minor details like the gold watch and rings from Image 2

− Significantly alters the person's face into a shorter, wider shape compared to the source
− Poorly blended neck and jawline area results in an unnatural appearance

GPT Image 2

+ Successfully preserves the person's original facial structure, expression, and skin vitiligo patterns
+ Better overall proportions and integration of the clothing onto the body

− The scarf pattern is slightly less accurate to the source texture than Model A's version
− Adds generic jeans that were only partially visible in the source

Verdict: GPT Image 2 is the preferred choice as it preserves the identity of the person in the source image, whereas Nano Banana noticeably distorts the face shape. Although Nano Banana captured the fabric patterns of the coat and scarf with higher fidelity, GPT Image 2's superior blending and anatomical consistency make for a much more believable edit.

The Capybara Taxi Driver

Text-to-Image

“Photorealistic scene inside a yellow New York taxi at night. A capybara is driving, wearing a yellow taxi driver cap and a dark jacket. It has a calm, professional expression and both front paws on the steering wheel. In the back seat sits a human businesswoman in a coat, looking at her phone with a completely normal, bored expression (as if this is just another normal ride). Through the windows you can see the streets of Manhattan at night with blurred lights. Realistic taxi interior, photorealistic, detailed fur and fabric, 35mm lens, night lighting with reflections, shallow depth of field.”

Nano Banana

GPT Image 2

0% wins 0% ties 100% wins

AI Judge Analysis

Nano Banana

+ Excellent fur texture and lighting on the capybara.
+ The capybara's expression is very calm and fits the prompt well.
+ The reflection on the window adds a layer of realism.

− The passenger is sitting in the front seat instead of the back seat as requested.
− The scale of the capybara relative to the woman feels slightly off.
− Only one paw is clearly on the steering wheel.

GPT Image 2

+ Correctly followed the spatial instruction, placing the businesswoman in the back seat.
+ Accurately depicted both front paws on the steering wheel.
+ The taxi driver cap includes a relevant 'T NYC' logo, enhancing the theme.

− The passenger's face is a bit blurry and less detailed than the driver.
− The lighting on the capybara's face is slightly less natural than in the other image.

Verdict: GPT Image 2 followed the prompt much more accurately, correctly placing the businesswoman in the back seat and showing both paws on the steering wheel. While Nano Banana has slightly higher aesthetic quality in the capybara's textures, its failure to follow the composition and seating instructions makes it less successful overall.

The Halloween Invitation

Text-to-Image

“Vintage gothic Halloween party invitation. Dark parchment poster, spooky border with webs and thorns, central glowing jack-o-lantern, bats, twisted trees, moody night sky. Add elegant gothic title text saying "Halloween Party Invitation", a small scroll banner saying "You are invited to a night of frights", and event details at the bottom: Date: 30.10.2026 Time: 7pm Location: The Arches, NYC Spooky but polished, cinematic lighting, square format.”

Nano Banana

GPT Image 2

AI Judge Analysis

Nano Banana

+ Excellent typography rendering with almost no errors.
+ Very clean and polished composition that feels like a professional digital graphic.
+ The thorn and web border is intricate and well-defined.

− The transition between the central scene and the parchment background feels a bit digital and less integrated.
− The aesthetic is slightly more generic 'modern digital' rather than truly 'vintage'.

GPT Image 2

+ Stronger adherence to the 'vintage' and 'gothic' aesthetic with textured, moody details.
+ Excellent integration of the NYC skyline and arches which references the location prompt.
+ Sophisticated layout with high artistic merit and atmospheric lighting.

− Minor text artifacts, specifically the overlapping 'H' and 'a' in 'Halloween'.
− The border is slightly more cluttered, though it matches the gothic theme.

Verdict: Nano Banana produces a very clean, legible invitation with perfect typography, but GPT Image 2 better captures the 'vintage gothic' mood requested in the prompt. GPT Image 2 also creatively incorporates 'The Arches' and an 'NYC' skyline into the background, whereas Nano Banana uses a generic forest, making the former a more comprehensive interpretation of the prompt despite minor text overlaps.

Vintage Cafe Logo

Text-to-Image

“Vintage minimalist restaurant logo for "Caffè Florian", retro cloche dome with steam and "Est. 1720" banner, classic typography, warm brown and cream tones, subtle texture on light background, vector emblem style.”

Nano Banana

GPT Image 2

AI Judge Analysis

Nano Banana

+ Excellent typography with a curved path and correct accents.
+ Clean, minimalist vector feel that suits a modern brand identity.
+ Includes the banner and established date correctly.

− Steam is loosely interpreted as abstract swirls rather than distinct vapor.
− The cloche is very simplified, bordering on two-dimensional.

GPT Image 2

+ Beautifully detailed shading and etching that enhances the vintage aesthetic.
+ Explicitly renders the 'steam' rising from the cloche as requested.
+ Excellent layout with a sophisticated frame and clear hierarchy.

− The font 'FLORIAN' is slightly less 'classic' in character than the script in Model A.
− Texture is a bit heavier, moving away from a 'minimalist' vector style.

Verdict: GPT Image 2 is the stronger output because it captures all prompt elements with high visual fidelity, particularly the steam and the vintage banner. Nano Banana provides a very clean vector logo with superior typography, but GPT Image 2's sophisticated composition and adherence to the 'vintage' texture make it a more evocative and complete response.

Next steps

Explore each model

Nano Banana

Google

Gemini 2.5 Flash Image is optimized for image understanding and generation, offering a balance of price and performance with fast and efficient image generation and editing capabilities.

Vote this model in the arena

Arena profile Lumenfall catalog

GPT Image 2

OpenAI

OpenAI's state-of-the-art image generation model with arbitrary resolution up to 4K and strong instruction following

Vote this model in the arena

Arena profile Lumenfall catalog