Head to head
Esc

Models · slot A

to navigate to pick

Nano Banana 2 Lite Google Stable Diffusion 3.5 Large Stability AI

Settled by community votes across 13 shared challenges, with an AI judge weighing in on each.

Nano Banana 2 Lite

22.3 arena score

#27 of 48 in Text-to-Image

Skill signature · Text-to-Image

Stable Diffusion 3.5 Large

22.9 arena score

#25 of 48 in Text-to-Image

Vote tally

Where the votes landed

Nano Banana 2 Lite

0%

win rate

Ties

0%

Stable Diffusion 3.5 Large

0%

win rate

Shared challenges 13

Challenge by challenge

The strongest take from each model on every shared challenge, with the AI judge's read.

Geometric Composition

Text-to-Image

“A glass cube on a wooden table. Inside the cube is a small blue sphere. On top of the cube sits a red book. A green plant is behind the cube, partially visible through the glass. Soft window light from the left.”

Nano Banana 2 Lite
Stable Diffusion 3.5 Large

AI Judge Analysis

Nano Banana 2 Lite

  • + Perfectly adheres to all spatial instructions in the prompt
  • + Excellent realism with natural textures and convincing refraction
  • + Cleverly includes thematic text on the book spine that relates to the prompt
  • The blue sphere appears to float mid-air within the cube without support

Stable Diffusion 3.5 Large

  • + High resolution with very sharp edges and clear glass reflections
  • + Natural lighting and shadows are well-rendered
  • Failed the spatial instruction to place the book on top of the cube
  • The cube appears to be a five-sided lid rather than a solid or enclosed cube
  • The blue sphere is quite large for being described as a small sphere

Verdict: Nano Banana 2 Lite followed every detail of the prompt, including the specific placement of the red book on top of the cube and the plant behind it. Stable Diffusion 3.5 Large produced a high-quality image but failed the core spatial reasoning by placing the cube on top of the book and making the sphere too large.

Candid Street Photography

Text-to-Image

“A candid street photo of an elderly Japanese man repairing a red bicycle in light rain, reflections on wet pavement, shallow depth of field, 50mm lens, natural skin texture, imperfect framing, motion blur from passing cars, cinematic but realistic, no stylization.”

Nano Banana 2 Lite
Stable Diffusion 3.5 Large

AI Judge Analysis

Nano Banana 2 Lite

  • + Excellent adherence to all prompt details including tools and rain environment
  • + Highly realistic skin textures and lighting
  • + Captures the 'candid' and 'imperfect framing' requested through a busy background
  • The hands on the wrench are slightly jumbled and anatomically messy
  • The background text contains some nonsensical characters

Stable Diffusion 3.5 Large

  • + Strong composition with a clear focus on the subject
  • + Good depiction of the red bicycle and wet pavement reflections
  • Missed the 'repairing' aspect as no tools are present and the pose is ambiguous
  • Lacks the requested 'motion blur' on the background car
  • The rain effect looks like a static overlay rather than part of the scene physics

Verdict: Nano Banana 2 Lite produced a much more faithful interpretation of the prompt, including the specific tools for the repair and the requested motion blur on passing vehicles. While Stable Diffusion 3.5 Large has a clean aesthetic, it missed several technical constraints of the prompt and felt less like a 'candid street photo' and more like a posed digital creation.

Fantasy Warrior

Text-to-Image

“Close portrait of a battle-worn paladin in ornate engraved plate armor, hair braided with small beads, faint scars and dirt on the skin, warm torchlight reflecting off metal, shallow depth of field, bokeh sparks, lifelike eyes, highly detailed texture on leather straps and cloth underlayer.”

Nano Banana 2 Lite
Stable Diffusion 3.5 Large

AI Judge Analysis

Nano Banana 2 Lite

  • + Excellent facial textures and realistic skin imperfections including grime and scars
  • + Superior handling of lighting with warm torchlight glints on the metal and deep shadows
  • + Accurate depiction of hair beads and braided hair as requested
  • Image is framed with unrelated white/blurred bars on the sides
  • Armor engraving is slightly less detailed than in the competitor's image

Stable Diffusion 3.5 Large

  • + Intricate and high-contrast engraving on the plate armor
  • + Good adherence to the 'bokeh sparks' and 'battle-worn' aspects of the prompt
  • + Solid composition and sharp focus on the subject's face
  • Missed the request for small beads in the braided hair
  • The lighting feels a bit more generic and less like natural torchlight compared to Model A
  • Leather strap detail is less prominent

Verdict: Nano Banana 2 Lite produced a more atmospheric and lifelike portrait with superior skin textures and lighting, despite the odd framing artifacts. Stable Diffusion 3.5 Large delivered impressive armor detail and clean composition but failed to include the requested hair beads and lacked the same depth of facial character.

Modern Clean Menu

Text-to-Image

“Modern minimalist restaurant menu design, white background with colorful food photos in grid, sections for appetizers/pizza/mains, bold sans-serif fonts, vibrant accents, clean professional layout for casual dining.”

Nano Banana 2 Lite
Stable Diffusion 3.5 Large

AI Judge Analysis

Nano Banana 2 Lite

  • + Excellent adherence to the menu structure with distinct sections for Appetizers, Pizza, and Mains.
  • + The layout is highly professional and ready for casual dining use.
  • + Text rendering is remarkably clear and mostly legible for a graphic design prompt.
  • Small minor spelling errors in descriptions (e.g., 'Cremy', 'Tomctoes').

Stable Diffusion 3.5 Large

  • + High-quality food photography with vibrant colors.
  • + Strong, bold typography for the main title.
  • The layout is cluttered and unconventional, more like a collage than a menu.
  • Poor spelling and nonsensical header text (e.g., 'APPETIZRS FHOPEADRE', 'MAIMAES').
  • The grid of photos obscures the actual menu content.

Verdict: Nano Banana 2 Lite significantly outperforms Stable Diffusion 3.5 Large by understanding the functional requirements of a menu design, providing a logical hierarchy and readable text. While Stable Diffusion 3.5 Large has high-quality food photography, its layout is chaotic and fails to present the information in a professional, usable format.

Magic Burger Explosion: Fiery Photorealism Challenge

Text-to-Image

“Ad for 'Magic Burger'. Dynamic, exploded burger with all components (bun, patty, cheese, lettuce, tomato, sauce) suspended in mid-air. Emphasize photorealistic detail and a sense of motion. Dark, fiery background with glowing embers. Integrate text: 'MAGIC BURGER' as a prominent title, 'LIMITED TIME ONLY' as a secondary message, and '€6.99' in a starburst, all rendered with a fiery, glowing effect.”

Nano Banana 2 Lite
Stable Diffusion 3.5 Large

AI Judge Analysis

Nano Banana 2 Lite

  • + Excellent text integration with accurate spelling and fiery effects.
  • + Perfect adherence to the 'exploded' burger layout with clear separation of ingredients.
  • + Commercial-ready composition with a distinct starburst and branding elements.
  • Texture on the tomato and lettuce looks slightly more digital than organic.
  • The sauce droplets are a bit repetitive in shape.

Stable Diffusion 3.5 Large

  • + High textural realism on the patties and the charcoal in the background.
  • + Strong sense of heat and lighting reflection on the bottom bun.
  • Completely failed to include the requested text.
  • Ignored the 'exploded' instruction, providing a stacked burger instead of one with suspended components.
  • Lacks the starburst element mentioned in the prompt.

Verdict: Nano Banana 2 Lite is the clear winner as it followed every instruction in the prompt, including the complex text rendering and the 'exploded' layout. Stable Diffusion 3.5 Large produced a high-quality image of a burger on fire, but it failed to include any of the requested text or the specific structural composition requested.

Chalkboard Menu

Text-to-Image

“Handwritten-style chalkboard menu in a cozy café, all text rendered in the exact same realistic chalk handwriting style with natural variations in letter size, slight slant, and chalk texture — no printed or digital fonts anywhere on the board. Title at the top in elegant cursive chalk handwriting: ‘TODAY’S SPECIALS – APRIL 30, 2026’. Below it, three menu items also in the same handwritten chalk style: ‘Truffle Mushroom Risotto – $24’, ‘Grilled Octopus with Lemon & Herbs – $28’, ‘Brown Butter Chocolate Chip Cookies – $9’. At the very bottom, smaller text in the identical handwritten chalk style (slightly smaller but still clearly legible with the same handwriting characteristics): ‘All items made fresh daily • Ask about our gluten-free options’. Warm ambient café lighting, visible chalk dust and smudges, realistic handwriting imperfections, no clean printed text anywhere.”

Nano Banana 2 Lite
Stable Diffusion 3.5 Large

AI Judge Analysis

Nano Banana 2 Lite

  • + Excellent text rendering with perfect spelling of all requested items.
  • + The chalk texture is highly realistic with believable variations in stroke weight.
  • + Good composition with a clear focus on the menu board and a warm café atmosphere in the background.
  • The small text at the bottom appears a bit too clean and uniform compared to the hand-drawn headers.

Stable Diffusion 3.5 Large

  • + Successfully captures the 'cozy café' environment with furniture and plants.
  • + The layout of the chalkboard feels design-oriented with structured boxes.
  • Numerous spelling errors including 'Todaay', 'Ottpups', and 'Cholcalte'.
  • Failed the date requirement, showing 2024 instead of 2026.
  • The handwriting looks more like a digital font than natural chalk strokes.

Verdict: Nano Banana 2 Lite significantly outperformed Stable Diffusion 3.5 Large by following the prompt's text requirements exactly, providing perfectly spelled items and the correct year. Stable Diffusion 3.5 Large struggled with many spelling hallucinations and failed to replicate a convincing hand-drawn chalk texture.

The Reversed Rodeo

Text-to-Image

“Horse riding astronaut in space — horse on top, not vice versa. Surreal, highly detailed, cinematic.”

Nano Banana 2 Lite
Stable Diffusion 3.5 Large

AI Judge Analysis

Nano Banana 2 Lite

  • + Excellent adherence to the 'horse on top' spatial instruction
  • + Vibrant, high-contrast cinematic lighting and color palette
  • + Clever interpretation of a saddle/platform for the astronaut
  • The horse's rear right leg is anatomically distorted/severed
  • Multiple floating planets and stars create a slightly cluttered background

Stable Diffusion 3.5 Large

  • + Natural composition and atmospheric blending with space dust/clouds
  • + Consistent lighting and high-quality textures on the spacesuit and horse
  • Failed the negative constraint; the astronaut is riding the horse, not vice versa
  • The horse's muzzle/head area has minor artifacts and extra straps

Verdict: Nano Banana 2 Lite successfully followed the specific and difficult instruction to have the horse on top of the astronaut, whereas Stable Diffusion 3.5 Large defaulted to the common trope of an astronaut riding a horse. Although Nano Banana 2 Lite has some anatomical issues with the horse's leg, its prompt adherence makes it the clear winner for this specific challenge.

The Capybara Taxi Driver

Text-to-Image

“Photorealistic scene inside a yellow New York taxi at night. A capybara is driving, wearing a yellow taxi driver cap and a dark jacket. It has a calm, professional expression and both front paws on the steering wheel. In the back seat sits a human businesswoman in a coat, looking at her phone with a completely normal, bored expression (as if this is just another normal ride). Through the windows you can see the streets of Manhattan at night with blurred lights. Realistic taxi interior, photorealistic, detailed fur and fabric, 35mm lens, night lighting with reflections, shallow depth of field.”

Nano Banana 2 Lite
Stable Diffusion 3.5 Large

AI Judge Analysis

Nano Banana 2 Lite

  • + Excellent adherence to the complex prompt including the businesswoman in the back seat.
  • + Very realistic lighting and textures for a rainy New York night.
  • + The capybara's expression and posture perfectly match the 'professional' requirement.
  • The cap is more of a police-style hat than a traditional yellow taxi cap.
  • The interior of the taxi looks a bit worn and dirty.

Stable Diffusion 3.5 Large

  • + Vibrant colors with a clear yellow taxi theme.
  • + The capybara's clothing is stylish and high-quality in texture.
  • Completely failed to include the human businesswoman in the back seat.
  • The capybara's hands are not on the steering wheel as requested; they are resting on its lap.
  • The composition is a side-profile that misses most of the taxi interior.

Verdict: Nano Banana 2 Lite followed every detail of the prompt, successfully capturing the surreal narrative of a capybara driving a human passenger who is bored on her phone. Stable Diffusion 3.5 Large failed to include the passenger entirely and missed the specific positioning of the paws on the wheel, resulting in a much simpler character portrait rather than a scene.

The Halloween Invitation

Text-to-Image

“Vintage gothic Halloween party invitation. Dark parchment poster, spooky border with webs and thorns, central glowing jack-o-lantern, bats, twisted trees, moody night sky. Add elegant gothic title text saying "Halloween Party Invitation", a small scroll banner saying "You are invited to a night of frights", and event details at the bottom: Date: 30.10.2026 Time: 7pm Location: The Arches, NYC Spooky but polished, cinematic lighting, square format.”

Nano Banana 2 Lite
Stable Diffusion 3.5 Large

AI Judge Analysis

Nano Banana 2 Lite

  • + Flawless text rendering for all requested information including the date, time, and location.
  • + Highly detailed and cohesive gothic aesthetic with intricate borders and cinematic lighting.
  • + Strong adherence to the 'central jack-o-lantern' instruction with a prominent and well-lit subject.
  • The layout is a bit crowded with many competing elements (skulls, witches, graves, etc.).

Stable Diffusion 3.5 Large

  • + Good use of negative space and atmospheric moonlight.
  • + The scroll banner and parchment texture are well-rendered.
  • Failed to include the required event details (Date, Time, Location) at the bottom.
  • Text in the banner contains minor artifacts and spelling distortions ('nigit', 'A koss').
  • Missing the 'central' jack-o-lantern, opting for several smaller ones on the sides instead.

Verdict: Nano Banana 2 Lite is the clear winner as it followed every instruction in the prompt, especially the complex task of rendering specific event details without errors. Stable Diffusion 3.5 Large failed to include the event details and struggled with the 'central' placement of the jack-o-lantern, resulting in a less functional invitation.

Isometric Miniature Diorama Scenes

Text-to-Image

“Create a clear, 45° top-down isometric miniature 3D cartoon scene of Japan's signature dish: sushi, with soft refined textures, realistic PBR materials, gentle lighting, on a small raised diorama base with minimal garnish and plate. Solid light blue background. At top-center: 'JAPAN' in large bold text, 'SUSHI' below it, small flag icon. Perfectly centered, ultra-clean, high-clarity, square format.”

Nano Banana 2 Lite
Stable Diffusion 3.5 Large

AI Judge Analysis

Nano Banana 2 Lite

  • + Excellent typography rendering that is perfectly integrated into the graphic design.
  • + Accurate sushi anatomy and textures including realistic rice grains and fish sheen.
  • + Perfectly adheres to the 'clean' and 'minimal' aesthetic requested.
  • The diorama base is a bit large compared to the plate size.

Stable Diffusion 3.5 Large

  • + Rich, vibrant colors and detailed food textures.
  • + Follows the isometric layout and diorama base instruction well.
  • Failed to render the text as a graphic overlay, instead placing it on a sign within the scene.
  • The rice grains look slightly gelatinous or plastic in some areas.
  • Includes excessive garnish that clashes with the 'minimal' requirement.

Verdict: Nano Banana 2 Lite is the clear winner as it perfectly captured the graphic design elements of the prompt, including the floating text and flag icon, while maintaining a clean, professional aesthetic. Stable Diffusion 3.5 Large struggled with the text placement and the 'minimal' constraint, resulting in a more cluttered scene.

Adorable Baby Animals in Sunny Meadow

Text-to-Image

“Hyper-photorealistic scene of fluffy baby animals—a golden retriever puppy, tabby kitten, baby bunny, and red fox kit—with big expressive eyes and ultra-detailed soft fur, playfully chasing butterflies and tumbling together in a lush wildflower meadow, warm golden sunrise light with god rays and dew sparkles, joyful wholesome vibe, 8K masterpiece.”

Nano Banana 2 Lite
Stable Diffusion 3.5 Large

AI Judge Analysis

Nano Banana 2 Lite

  • + Excellent depiction of all four distinct animals in a single frame.
  • + Beautiful backlighting with clearly defined god rays and dew sparkles.
  • + Highly detailed fur textures and sharp focus across all subjects.
  • The fox's anatomy is slightly strange where it touches the dog.
  • The kitten has an oddly long, striped tail that looks a bit disjointed.

Stable Diffusion 3.5 Large

  • + Captures the 'playfully chasing' aspect of the prompt very well.
  • + Soft, dreamy lighting that fits the 'wholesome vibe' perfectly.
  • + The centered golden retriever puppy has a very expressive, joyful face.
  • Missed the 'tabby' requirement for the kitten, rendering it a solid ginger color.
  • Lower overall sharpness and more background noise compared to Model A.
  • Lighting is a bit washed out with less 'hyper-photorealistic' detail.

Verdict: Nano Banana 2 Lite is the winner as it accurately rendered all four requested animals with distinct characteristics, including the tabby markings and the fox kit, which Stable Diffusion 3.5 Large struggled to differentiate clearly. Nano Banana 2 Lite also provided a higher level of detail in the fur and environment, better matching the '8K masterpiece' requirement.

Vintage Cafe Logo

Text-to-Image

“Vintage minimalist restaurant logo for "Caffè Florian", retro cloche dome with steam and "Est. 1720" banner, classic typography, warm brown and cream tones, subtle texture on light background, vector emblem style.”

Nano Banana 2 Lite
Stable Diffusion 3.5 Large

AI Judge Analysis

Nano Banana 2 Lite

  • + Excellent typography including the requested accent on 'Caffè'.
  • + Clean vector emblem style with balanced circular composition.
  • + Perfect adherence to the requested 'Est. 1720' banner and cloche imagery.
  • The steam curls are a bit more ornate than 'minimalist' usually suggests.

Stable Diffusion 3.5 Large

  • + Good use of subtle textures on the background as requested.
  • + Creative use of a banner for the main brand name.
  • Spelling error in the brand name ('Cafféé' instead of 'Caffè').
  • The cloche is detached and floating in a way that looks awkward.
  • The steam/flame element inside the cloche is messy and lacks vector clarity.

Verdict: Nano Banana 2 Lite followed all prompt instructions perfectly, including accurate spelling and a professional vector aesthetic. Stable Diffusion 3.5 Large struggled with text rendering and the structural integrity of the cloche graphic, resulting in a less polished logo.

Apollo 11: Journey to Tranquility

Text-to-Image

“Create a clean, modern vector infographic poster about the Apollo 11 mission. NASA-inspired palette (navy, white, muted red, light gray). Flat-vector style, crisp lines, consistent iconography, subtle gradients only. Steps (stop at landing): 1. Launch (Saturn Vicon) 2. Earth Orbit (Earth + orbit ring icon) 3. Translunar (trajectory arc icon) 4. Lunar Orbit (Moon + orbit ring icon) 5. Descent (lunar module descending icon) 6. Landing (lunar module on the surface icon) Small supporting elements (minimal text): • Crew strip: three silhouette icons with only last names: Armstrong, Aldrin, Collins. • Landing site marker: Moon pin labeled "Tranquility" only. Layout constraints: generous margins, large readable labels, clean background with subtle stars. Vector-only, print-poster look, high resolution.”

Nano Banana 2 Lite
Stable Diffusion 3.5 Large

AI Judge Analysis

Nano Banana 2 Lite

  • + Excellent text rendering with clear, legible English throughout.
  • + Strict adherence to the requested six-step sequence of events.
  • + Clean, professional flat-vector aesthetic that matches the 'modern infographic' prompt perfectly.
  • The transition line between steps 4 and 5 is a bit abrupt.
  • The rendering of the lunar module in step 5 is slightly more detailed than a simple 'icon' style.

Stable Diffusion 3.5 Large

  • + Atmospheric NASA-inspired color palette with a nice navy background.
  • + Creative use of space and planet textures.
  • Total failure to render legible text or correct spelling.
  • Incorrect iconography (shows a space shuttle instead of a Saturn V rocket).
  • Chaotic layout that fails to follow the requested sequential steps (1-6).

Verdict: Nano Banana 2 Lite produced a high-quality, professional-grade infographic that perfectly followed every instruction in the prompt, including complex text and specific historical steps. In contrast, Stable Diffusion 3.5 Large failed on almost every detail, providing garbled text, the wrong type of spacecraft, and an unreadable layout. Nano Banana 2 Lite is the clear winner for its functional design and perfect prompt adherence.

Next steps

Explore each model