Head to head
Esc

Models · slot A

to navigate to pick

Nano Banana 2 Google Stable Diffusion 3.5 Large Stability AI

Settled by community votes across 8 shared challenges, with an AI judge weighing in on each.

Nano Banana 2

29.0 arena score

#1 of 44 in Text-to-Image

Best Text-to-Image right now Top 2 in Image Editing
Skill signature

Not enough comparable category data

The chart appears once both models have ratings across at least three shared arena categories.

Stable Diffusion 3.5 Large

22.9 arena score

#25 of 44 in Text-to-Image

Vote tally

Where the votes landed

Nano Banana 2

66.7%

win rate

Ties

0.0%

Stable Diffusion 3.5 Large

33.3%

win rate

66.7% 0.0% ties 33.3%
Shared challenges 8

Challenge by challenge

The strongest take from each model on every shared challenge, with the AI judge's read.

Geometric Composition

Text-to-Image

“A glass cube on a wooden table. Inside the cube is a small blue sphere. On top of the cube sits a red book. A green plant is behind the cube, partially visible through the glass. Soft window light from the left.”

Nano Banana 2
Stable Diffusion 3.5 Large

AI Judge Analysis

Nano Banana 2

  • + Perfect adherence to spatial instructions with the book on top and sphere inside.
  • + Highly realistic textures, especially in the wood grain and glass reflections.
  • + Excellent lighting that correctly illuminates the plant through the glass.

Stable Diffusion 3.5 Large

  • + Crisp and clean modern aesthetic.
  • + Very clear glass rendering with sharp edges.
  • + Good lighting and shadows.
  • Failed the spatial task by putting the book inside the cube and the sphere on top of the book.
  • Plant is mostly behind the cube but doesn't show clearly through the glass as requested.
  • The sphere is light blue/cyan rather than the requested blue.

Verdict: Nano Banana 2 followed every detail of the prompt perfectly, correctly placing the red book on top of the glass cube and the blue sphere inside it. In contrast, Stable Diffusion 3.5 Large failed the spatial logic of the prompt by placing the sphere on top of a book that was inside the cube. Nano Banana 2 also achieved a much higher level of photographic realism and complex light interaction.

Candid Street Photography

Text-to-Image

“A candid street photo of an elderly Japanese man repairing a red bicycle in light rain, reflections on wet pavement, shallow depth of field, 50mm lens, natural skin texture, imperfect framing, motion blur from passing cars, cinematic but realistic, no stylization.”

Nano Banana 2
Stable Diffusion 3.5 Large

AI Judge Analysis

Nano Banana 2

  • + Excellent photorealism with gritty, natural skin textures and greasy hands.
  • + Rich background detail with authentic-looking Japanese signage and atmospheric lighting.
  • + Perfectly captures the motion blur of passing cars as requested.
  • The bike anatomy is slightly warped, specifically the frame connecting to the rear wheel.
  • The framing is a bit centered despite the request for 'imperfect framing'.

Stable Diffusion 3.5 Large

  • + Strong composition with a clear shallow depth of field.
  • + Accurately represents light rain with visible streaks and surface impact.
  • + Good adherence to the 'imperfect framing' prompt by placing the subject off-center.
  • The subject's hands and the bicycle handlebar area are physically incoherent and merged.
  • The background car lacks the specific 'motion blur' requested, appearing mostly static.
  • Significant anatomical issues where the man's arm meets the bicycle.

Verdict: Nano Banana 2 produces a significantly more realistic and detailed image, specifically in the rendering of the man's skin, clothing, and the surrounding city environment. Stable Diffusion 3.5 Large struggles with the interaction between the man and the bicycle, resulting in merged limbs and structural artifacts, though it handles the 'rain' and 'framing' aspects of the prompt well.

Fantasy Warrior

Text-to-Image

“Close portrait of a battle-worn paladin in ornate engraved plate armor, hair braided with small beads, faint scars and dirt on the skin, warm torchlight reflecting off metal, shallow depth of field, bokeh sparks, lifelike eyes, highly detailed texture on leather straps and cloth underlayer.”

Nano Banana 2
Stable Diffusion 3.5 Large

AI Judge Analysis

Nano Banana 2

  • + Excellent adherence to all prompt details including beads in hair and ornate engraving.
  • + Superior texture rendering on leather straps, cloth, and skin.
  • + Dynamic lighting with clear bokeh sparks and warm torchlight reflections.
  • The hand holding the sword has slightly merged fingers/anatomical confusion.

Stable Diffusion 3.5 Large

  • + Ornate engraving on the armor is intricate and well-defined.
  • + Strong cinematic focus with good skin textures.
  • Missed the 'small beads' in the hair braid requirement.
  • The lighting feels more like daylight than the requested warm torchlight.
  • Lacks the depth of detail in leather straps and cloth underlayers seen in the other model.

Verdict: Nano Banana 2 followed the prompt much more closely, capturing specific details like the beads in the braids and the rich textures of the leather and cloth underlayers. While Stable Diffusion 3.5 Large produced a high-quality cinematic image, it failed to include the requested beads and opted for a much brighter, less atmospheric lighting setup than requested.

Modern Clean Menu

Text-to-Image

“Modern minimalist restaurant menu design, white background with colorful food photos in grid, sections for appetizers/pizza/mains, bold sans-serif fonts, vibrant accents, clean professional layout for casual dining.”

Nano Banana 2
Stable Diffusion 3.5 Large
100% wins 0% ties 0% wins

AI Judge Analysis

Nano Banana 2

  • + Excellent text legibility with nearly perfect spelling
  • + Professional grid layout that effectively uses vibrant accents for categorization
  • + Highly realistic food photography that matches the menu descriptions
  • Small minor typos in the secondary fine print (e.g., 'somon', 'metrons')

Stable Diffusion 3.5 Large

  • + Strong creative interpretation of a grid using side panels
  • + Bold, modern aesthetic with a focus on high-impact typography
  • Very poor text rendering and spelling across all sections
  • Does not follow the request for a white background with a specific sections-based grid effectively
  • Illegible menu items and prices

Verdict: Nano Banana 2 is much better for this task as it produces a functional, professional, and legible menu design that follows all prompt instructions perfectly. Stable Diffusion 3.5 Large fails significantly on text legibility and structural coherence, resulting in an unusable design for a restaurant.

Isometric Miniature Diorama Scenes

Text-to-Image

“Create a clear, 45° top-down isometric miniature 3D cartoon scene of Japan's signature dish: sushi, with soft refined textures, realistic PBR materials, gentle lighting, on a small raised diorama base with minimal garnish and plate. Solid light blue background. At top-center: 'JAPAN' in large bold text, 'SUSHI' below it, small flag icon. Perfectly centered, ultra-clean, high-clarity, square format.”

Nano Banana 2
Stable Diffusion 3.5 Large

AI Judge Analysis

Nano Banana 2

  • + Excellent photo-realistic PBR materials and textures
  • + Clean and perfectly rendered typography and flag icon
  • + High visual clarity and sophisticated lighting
  • Lean more toward food photography than a 'cartoon scene'
  • The diorama base is quite flat rather than a playful miniature block

Stable Diffusion 3.5 Large

  • + Stronger adherence to the '3D cartoon' and 'miniature' aesthetic
  • + Good diorama-style raised base
  • + Creative interpretation of garnish as plastic-like 3D assets
  • Failed to place text at top-center as requested, placing it on a flag instead
  • Lower fidelity on textures compared to model A
  • Minor artifacts in the sushi rolls and floating elements

Verdict: Nano Banana 2 produces a high-fidelity image with perfect text rendering and realistic materials, though it leans more toward realism than the 'cartoon' style requested. Stable Diffusion 3.5 Large nails the 'miniature 3D cartoon' look and diorama base perfectly, but it fails the specific layout instructions regarding the text placement and has lower overall texture quality. Nano Banana 2 is the winner for its professional polish and perfect adherence to the textual and icon requirements.

Adorable Baby Animals in Sunny Meadow

Text-to-Image

“Hyper-photorealistic scene of fluffy baby animals—a golden retriever puppy, tabby kitten, baby bunny, and red fox kit—with big expressive eyes and ultra-detailed soft fur, playfully chasing butterflies and tumbling together in a lush wildflower meadow, warm golden sunrise light with god rays and dew sparkles, joyful wholesome vibe, 8K masterpiece.”

Nano Banana 2
Stable Diffusion 3.5 Large

AI Judge Analysis

Nano Banana 2

  • + Excellent adherence to the prompt, including all four specific animals and environmental details.
  • + Very high visual clarity with crisp textures on both the animals and the meadow flowers.
  • + Beautiful lighting with visible god rays and a clear golden hour atmosphere.
  • The kitten's tail looks slightly detached or oddly positioned relative to its body.
  • The butterflies are somewhat static and lack the motion blur seen in the animals.

Stable Diffusion 3.5 Large

  • + Captures a very joyful, expressive 'tumbling' energy with good use of depth of field.
  • + Whimsical lighting and soft bokeh create a dreamlike, masterfully composed scene.
  • Failed to correctly render a tabby kitten, instead generating a generic ginger/fox-like feline.
  • The fox in the background is blurry and less defined compared to the other animals.
  • Missing some of the requested '8K' sharpness, looking more like a digital painting in areas.

Verdict: Nano Banana 2 is the clear winner as it successfully included all four requested animals (puppy, tabby kitten, bunny, and fox) with high detail and fidelity. Stable Diffusion 3.5 Large struggled with the specific animal types, failing to make the kitten a tabby and losing detail on the fox, though it captured a very cute and dynamic sense of movement.

Vintage Cafe Logo

Text-to-Image

“Vintage minimalist restaurant logo for "Caffè Florian", retro cloche dome with steam and "Est. 1720" banner, classic typography, warm brown and cream tones, subtle texture on light background, vector emblem style.”

Nano Banana 2
Stable Diffusion 3.5 Large
50% wins 0% ties 50% wins

AI Judge Analysis

Nano Banana 2

  • + Excellent typography with correct spelling and beautiful classic fonts.
  • + Highly professional emblem composition with clean vector styling.
  • + Accurate and cohesive color palette that feels authentic to the vintage theme.
  • The layout is slightly more complex/detailed than a strictly 'minimalist' prompt might suggest.

Stable Diffusion 3.5 Large

  • + Successfully captures the requested subtle texture on a light background.
  • + Clean, minimalist iconography with the cloche and steam elements.
  • + Good use of warm brown and cream tones.
  • Misspells 'Caffè' as 'Cafféé' in the primary text.
  • The cloche graphic is oddly split, creating a strange visual gap between the lid and the base.
  • Composition is vertically stretched and lacks the unified feel of a professional logo.

Verdict: Nano Banana 2 is the clear winner as it provides a professional-grade logo with perfect spelling and a sophisticated vintage aesthetic. Stable Diffusion 3.5 Large struggles with text accuracy and an awkward, disconnected cloche illustration that disrupts the minimalist design.

Apollo 11: Journey to Tranquility

Text-to-Image

“Create a clean, modern vector infographic poster about the Apollo 11 mission. NASA-inspired palette (navy, white, muted red, light gray). Flat-vector style, crisp lines, consistent iconography, subtle gradients only. Steps (stop at landing): 1. Launch (Saturn Vicon) 2. Earth Orbit (Earth + orbit ring icon) 3. Translunar (trajectory arc icon) 4. Lunar Orbit (Moon + orbit ring icon) 5. Descent (lunar module descending icon) 6. Landing (lunar module on the surface icon) Small supporting elements (minimal text): • Crew strip: three silhouette icons with only last names: Armstrong, Aldrin, Collins. • Landing site marker: Moon pin labeled "Tranquility" only. Layout constraints: generous margins, large readable labels, clean background with subtle stars. Vector-only, print-poster look, high resolution.”

Nano Banana 2
Stable Diffusion 3.5 Large

AI Judge Analysis

Nano Banana 2

  • + Excellent text rendering with legible and accurate labels.
  • + Perfect adherence to the requested six mission steps with corresponding icons.
  • + Clean, professional vector aesthetic that matches the requested NASA palette.
  • The icon for the fourth astronaut silhouette is unnecessary/redundant.

Stable Diffusion 3.5 Large

  • + Atmospheric design with a grand sense of scale.
  • + Sophisticated use of the color palette.
  • Fails to follow the specific six-step mission sequence requested.
  • Text and labels are largely gibberish or heavily distorted.
  • Incorrectly includes a space shuttle-style vehicle instead of a Saturn V rocket.

Verdict: Nano Banana 2 followed the prompt instructions near-perfectly, creating a logical, legible, and aesthetically pleasing infographic that accurately depicted all six mission phases. In contrast, Stable Diffusion 3.5 Large failed on almost every technical requirement, producing illegible text, an incorrect rocket type, and a disorganized layout that did not follow the requested steps.

Next steps

Explore each model