OpenAI's previous generation image model with higher quality than DALL-E 2 and support for larger resolutions
Settled by community votes across 12 shared challenges, with an AI judge weighing in on each.
DALL-E 3
#35 of 44 in Text-to-Image
Not enough comparable category data
The chart appears once both models have ratings across at least three shared arena categories.
Nano Banana
#20 of 44 in Text-to-Image
Where the votes landed
DALL-E 3
0.0%
win rate
Ties
50.0%
Nano Banana
50.0%
win rate
Challenge by challenge
The strongest take from each model on every shared challenge, with the AI judge's read.
Geometric Composition
Text-to-Image“A glass cube on a wooden table. Inside the cube is a small blue sphere. On top of the cube sits a red book. A green plant is behind the cube, partially visible through the glass. Soft window light from the left.”
AI Judge Analysis
DALL-E 3
- + Excellent texture on the wood and book binding
- + Complex and creative interpretation of the sphere's interior
- + Good use of depth and lighting
- − Failed the spatial prompt: the book is inside the cube rather than on top of it
- − The cube has a wooden frame not mentioned in the prompt
Nano Banana
- + Perfect adherence to spatial instructions including object placement
- + Accurate representation of 'glass cube' without a frame
- + Natural and realistic lighting following the requested direction
- − The blue sphere is floating unnaturally in the center
- − Overall image is slightly softer and less detailed than the competitor
Verdict: Nano Banana followed every spatial instruction perfectly, correctly placing the red book on top and the sphere inside the cube. DALL-E 3 failed the prompt logic by placing the sphere on top of the book and the book inside the cube, despite having higher technical detail.
Candid Street Photography
Text-to-Image“A candid street photo of an elderly Japanese man repairing a red bicycle in light rain, reflections on wet pavement, shallow depth of field, 50mm lens, natural skin texture, imperfect framing, motion blur from passing cars, cinematic but realistic, no stylization.”
AI Judge Analysis
DALL-E 3
- + Excellent composition with a unique foreground framing element
- + Beautiful, clear reflection on the wet pavement
- + Atmospheric lighting that creates a cinematic mood
- − Anatomical issues with the man's feet and the way he is crouching bare-toed on wet asphalt
- − The man appears more frail/exhausted than simply 'repairing' a bike
Nano Banana
- + Highly realistic skin textures and clothing details
- + Consistent anatomy and a very natural pose
- + Faithful adherence to the 50mm lens look with subtle motion blur on the background car
- − The red bicycle frame is slightly less vibrant than requested
- − Slightly more 'staged' feel compared to the raw candid request
Verdict: Nano Banana is the clear winner as it achieves a level of photorealism that DALL-E 3 cannot match, particularly regarding the human subject's anatomy and skin texture. While DALL-E 3 offers a more artistic and interesting composition, it fails on the 'no stylization' and 'natural skin texture' requirements, whereas Nano Banana delivers a believable, cinematic street photograph.
Fantasy Warrior
Text-to-Image“Close portrait of a battle-worn paladin in ornate engraved plate armor, hair braided with small beads, faint scars and dirt on the skin, warm torchlight reflecting off metal, shallow depth of field, bokeh sparks, lifelike eyes, highly detailed texture on leather straps and cloth underlayer.”
AI Judge Analysis
DALL-E 3
- + Excellent metallic texture and complex engraving detail
- + Striking lighting with vibrant bokeh sparks
- + High skin texture realism including realistic beard stubble
- − Hair braids are messy and lack the requested beads
- − Anatomical oddity with the visible ear appearing misplaced behind the helmet neck guard
Nano Banana
- + Perfect adherence to the 'braided with small beads' requirement
- + Highly detailed leather straps, buckles, and cloth underlayers
- + Solid cinematic composition with clear torchlight sources
- − Scars look like surface-level scratches rather than deep battle-worn marks
- − Overall image has a slightly more 'rendered' digital look compared to the organic feel of A
Verdict: Both models performed well, but Nano Banana adhered much better to the specific technical details of the prompt, particularly the beaded braids and the requested leather and cloth textures. While DALL-E 3 produced a more high-contrast, visually striking portrait, it omitted the beads and had some minor structural issues with the character's ear.
Modern Clean Menu
Text-to-Image“Modern minimalist restaurant menu design, white background with colorful food photos in grid, sections for appetizers/pizza/mains, bold sans-serif fonts, vibrant accents, clean professional layout for casual dining.”
AI Judge Analysis
DALL-E 3
- + Excellent high-end aesthetic that feels like a professional editorial spread.
- + Richly detailed food photography with varied color palettes.
- + Captures the 'bold sans-serif' and 'modern minimalist' vibes well.
- − Internal text is largely illegible gibberish.
- − Layout is a bit cluttered and less functional as an actual menu.
- − Poor spelling on major headers like 'PIZAS' and 'APPGETIZERS'.
Nano Banana
- + Text is highly legible with accurate spelling in most places.
- + The grid layout is much cleaner and logically organized for a restaurant.
- + Excellent use of the specified sections (Appetizers, Pizza, Mains) and vibrant accents.
- − Food photography is slightly more generic/stock-like compared to Model A.
- − Minor typos present such as 'APPETIERS' and 'REMAIGINED'.
- − The yellow photo borders are a bit thick for a truly minimalist aesthetic.
Verdict: Model B (Nano Banana) is the winner because it successfully creates a functional, readable menu that adheres strictly to all parts of the prompt, including the specific sections requested. While Model A (DALL-E 3) has much more artistic food photography, its chaotic layout and nonsensical text make it unusable as a design template compared to the clear structure of Model B.
Magic Burger Explosion: Fiery Photorealism Challenge
Text-to-Image“Ad for 'Magic Burger'. Dynamic, exploded burger with all components (bun, patty, cheese, lettuce, tomato, sauce) suspended in mid-air. Emphasize photorealistic detail and a sense of motion. Dark, fiery background with glowing embers. Integrate text: 'MAGIC BURGER' as a prominent title, 'LIMITED TIME ONLY' as a secondary message, and '€6.99' in a starburst, all rendered with a fiery, glowing effect.”
AI Judge Analysis
DALL-E 3
- + Excellent photorealistic texture on the burger bun and patty.
- + Highly dynamic lighting with vibrant fire and ember effects.
- + Creative use of flying ingredients beyond the main stack.
- − Several spelling errors in the text including 'AGIC BURGR' and 'Limiited'.
- − The price is not rendered in a starburst as requested.
- − The text layout feels a bit cluttered and overlaps the main subject.
Nano Banana
- + Perfect text rendering with no spelling errors.
- + Strict adherence to the 'starburst' and 'fiery glowing effect' for the text and price.
- + Excellent composition with a clear, dynamic exploded view.
- − The food textures are slightly less detailed/sharp than Model A.
- − The background is more generic compared to the high-intensity ground effects in Model A.
Verdict: Nano Banana is the superior choice for this task due to its perfect adherence to the text requirements and layout instructions, whereas DALL-E 3 struggled significantly with spelling and specific design elements like the starburst. Nano Banana successfully integrated the fiery theme into the typography while maintaining a clean, professional ad layout.
Chalkboard Menu
Text-to-Image“Handwritten-style chalkboard menu in a cozy café, all text rendered in the exact same realistic chalk handwriting style with natural variations in letter size, slight slant, and chalk texture — no printed or digital fonts anywhere on the board. Title at the top in elegant cursive chalk handwriting: ‘TODAY’S SPECIALS – APRIL 30, 2026’. Below it, three menu items also in the same handwritten chalk style: ‘Truffle Mushroom Risotto – $24’, ‘Grilled Octopus with Lemon & Herbs – $28’, ‘Brown Butter Chocolate Chip Cookies – $9’. At the very bottom, smaller text in the identical handwritten chalk style (slightly smaller but still clearly legible with the same handwriting characteristics): ‘All items made fresh daily • Ask about our gluten-free options’. Warm ambient café lighting, visible chalk dust and smudges, realistic handwriting imperfections, no clean printed text anywhere.”
AI Judge Analysis
DALL-E 3
- + Excellent chalk-like texture and artistic blackboard aesthetic.
- + Warm, atmospheric lighting and composition.
- − Numerous spelling errors including 'Trufle', 'Occtus', and 'Grililled'.
- − Failed to render the full text for the final menu item.
- − Incorrect prices shown (e.g., $234).
Nano Banana
- + Perfect text accuracy for all prompted items including the final item completion.
- + Highly realistic chalkboard smudges and authentic chalk handwriting appearance.
- + Clean, legible composition that follows the requested formatting exactly.
- − Background is slightly more generic than Model A.
- − Handwriting is very consistent, appearing almost like a digital font despite the smear effects.
Verdict: Nano Banana is the clear winner as it followed every instruction, including the correct spelling of complex menu items and completing the unfinished 'Brown But...' prompt as 'Brown Butter Chocolate Chip Cookies'. DALL-E 3 suffered from significant hallucinations and spelling errors throughout the menu text.
The Capybara Taxi Driver
Text-to-Image“Photorealistic scene inside a yellow New York taxi at night. A capybara is driving, wearing a yellow taxi driver cap and a dark jacket. It has a calm, professional expression and both front paws on the steering wheel. In the back seat sits a human businesswoman in a coat, looking at her phone with a completely normal, bored expression (as if this is just another normal ride). Through the windows you can see the streets of Manhattan at night with blurred lights. Realistic taxi interior, photorealistic, detailed fur and fabric, 35mm lens, night lighting with reflections, shallow depth of field.”
AI Judge Analysis
DALL-E 3
- + Excellent fur detail and lighting coherence
- + Professional and clean graphic quality on the hat
- + Accurately depicts the bored businesswoman looking at her phone
- − The steering wheel and paws are not clearly visible in the foreground
- − The jacket color is yellow instead of the requested dark jacket
Nano Banana
- + Successfully shows the capybara's paws on the steering wheel as requested
- + Includes the dark jacket and professional driving posture
- + Very realistic 'candid' photographic style
- − The passenger appears to be in the front seat rather than the back seat
- − The hat is very small and sits unnaturally on the head
- − The transition between the capybara's head and the human-like hand is awkward
Verdict: DALL-E 3 produces a more visually polished and 'cinematic' image, but it fails to show the driving action (paws on wheel) and misses the jacket color instruction. Nano Banana adheres much better to the specific actions and attire described in the prompt, despite placing the passenger in the front seat and having slightly less refined textures.
The Halloween Invitation
Text-to-Image“Vintage gothic Halloween party invitation. Dark parchment poster, spooky border with webs and thorns, central glowing jack-o-lantern, bats, twisted trees, moody night sky. Add elegant gothic title text saying "Halloween Party Invitation", a small scroll banner saying "You are invited to a night of frights", and event details at the bottom: Date: 30.10.2026 Time: 7pm Location: The Arches, NYC Spooky but polished, cinematic lighting, square format.”
AI Judge Analysis
DALL-E 3
- + Exquisite visual depth with complex layered textures
- + Highly artistic 3D relief style that feels like a physical object
- + Mood matches the 'dark parchment' and 'gothic' descriptions perfectly
- − Text rendering is largely illegible/gibberish beyond the word 'Halloween'
- − Missed some text components like the specific scroll banner content
- − Layout is crowded with elements overlapping the text regions
Nano Banana
- + Perfect text accuracy for all requested fields including date and location
- + Followed all layout instructions including the scroll banner and specific border elements
- + Clear and readable hierarchy of information suitable for an actual invitation
- − Visual style is somewhat generic and feels like stock clip art rather than 'cinematic'
- − Lighting is flat compared to the atmospheric depth requested
- − The border is a bit repetitive in its texture
Verdict: While DALL-E 3 (Model A) creates a much more atmospheric and visually stunning gothic artwork, it fails significantly on the textual requirements of an invitation. Nano Banana (Model B) followed every specific text prompt perfectly, making it a functional invitation despite having a simpler, more digital illustration style. Nano Banana is the winner for following the complex text and layout instructions that DALL-E 3 ignored.
Isometric Miniature Diorama Scenes
Text-to-Image“Create a clear, 45° top-down isometric miniature 3D cartoon scene of Japan's signature dish: sushi, with soft refined textures, realistic PBR materials, gentle lighting, on a small raised diorama base with minimal garnish and plate. Solid light blue background. At top-center: 'JAPAN' in large bold text, 'SUSHI' below it, small flag icon. Perfectly centered, ultra-clean, high-clarity, square format.”
AI Judge Analysis
DALL-E 3
- + Excellent 3D toy-like aesthetic with high-quality global illumination.
- + Unique and creative interpretation of 'miniature' with stylized puffy textures.
- + Strong color palette and very clean rendering.
- − Failed to place 'SUSHI' text at the top-center as requested.
- − The text 'JAPAN' is embedded on the side of the diorama rather than at the top.
- − The sushi design is abstract and somewhat repetitive.
Nano Banana
- + Followed all text instructions perfectly, including 'JAPAN', 'SUSHI', and flag placement.
- + Very accurate isometric 45-degree perspective.
- + Varied sushi types (nigiri and maki) which provides better visual variety.
- − Lighting is a bit flat compared to the other model.
- − The 3D materials look slightly more generic and less 'refined' or 'premium'.
- − The rice texture is a bit grainy.
Verdict: Nano Banana is the clear winner for prompt adherence, accurately placing the requested text and flag at the top-center while maintaining the isometric diorama style. While DALL-E 3 produced a more visually striking 3D render with beautiful lighting, it completely failed to follow the specific layout and text requirements of the prompt.
Adorable Baby Animals in Sunny Meadow
Text-to-Image“Hyper-photorealistic scene of fluffy baby animals—a golden retriever puppy, tabby kitten, baby bunny, and red fox kit—with big expressive eyes and ultra-detailed soft fur, playfully chasing butterflies and tumbling together in a lush wildflower meadow, warm golden sunrise light with god rays and dew sparkles, joyful wholesome vibe, 8K masterpiece.”
AI Judge Analysis
DALL-E 3
- + Extremely vibrant colors and lighting with distinct god rays
- + High-detail texture on the fur of each animal
- + Very lush and dense floral environment
- − Failed to include a tabby kitten, instead doubling up on rabbits and foxes
- − Anatomical oddities like the fox's oversized, human-like paw and a kitten-fox hybrid face
- − The composition feels cluttered and less like a natural photo
Nano Banana
- + Included all four requested animals correctly: puppy, kitten, bunny, and fox kit
- + Captures the 'tumbling' and 'playful' action much better with the fox on its back
- + More naturalistic lighting and better spatial composition
- − The butterfly on the puppy's ear is slightly flat/artificial
- − Lower density of flowers compared to the other model
Verdict: Nano Banana is the clear winner as it followed the prompt's specific species list perfectly, including the tabby kitten that DALL-E 3 missed entirely. Furthermore, Nano Banana captured the playful, tumbling interaction between the animals with much more convincing anatomy and a cleaner composition.
Vintage Cafe Logo
Text-to-Image“Vintage minimalist restaurant logo for "Caffè Florian", retro cloche dome with steam and "Est. 1720" banner, classic typography, warm brown and cream tones, subtle texture on light background, vector emblem style.”
AI Judge Analysis
DALL-E 3
- + Strong vector aesthetic with high contrast
- + Excellent use of the warm brown and cream color palette
- + Accurate rendering of the cloche dome and steam iconography
- − Completely failed to include the requested brand name 'Caffè Florian', substituting it with generic text
- − The layout is a bit cluttered for a 'minimalist' prompt
Nano Banana
- + Perfect text adherence with accurate spelling of 'Caffè Florian'
- + Clean minimalist composition that fits the vector emblem style well
- + Sophisticated integration of the 'Est. 1720' banner
- − Steam effect is very subtle and looks more like decorative swirls
- − The cloche is slightly simplified compared to the detail in Model A
Verdict: Nano Banana is the clear winner because it correctly followed the most important instruction: the brand name 'Caffè Florian'. While DALL-E 3 produced a high-quality graphic, it substituted the requested name for 'Coffee House', making it unusable for the specific request, whereas Nano Banana provided a professional, minimalist logo that met all prompt criteria including typography and color.
Apollo 11: Journey to Tranquility
Text-to-Image“Create a clean, modern vector infographic poster about the Apollo 11 mission. NASA-inspired palette (navy, white, muted red, light gray). Flat-vector style, crisp lines, consistent iconography, subtle gradients only. Steps (stop at landing): 1. Launch (Saturn Vicon) 2. Earth Orbit (Earth + orbit ring icon) 3. Translunar (trajectory arc icon) 4. Lunar Orbit (Moon + orbit ring icon) 5. Descent (lunar module descending icon) 6. Landing (lunar module on the surface icon) Small supporting elements (minimal text): • Crew strip: three silhouette icons with only last names: Armstrong, Aldrin, Collins. • Landing site marker: Moon pin labeled "Tranquility" only. Layout constraints: generous margins, large readable labels, clean background with subtle stars. Vector-only, print-poster look, high resolution.”
AI Judge Analysis
DALL-E 3
- + Excellent artistic style with a retro-future aesthetic.
- + Captures the NASA-inspired palette effectively across complex layouts.
- + High visual density and interesting textures.
- − Completely fails to follow the logical step-by-step sequence requested.
- − Inaccurate iconography such as space shuttles instead of the Saturn V.
- − Text is mostly illegible gibberish.
Nano Banana
- + Perfect adherence to the requested six-step sequence with matching iconography.
- + Includes accurate and legible text for the mission stages and crew names.
- + Strictly follows the flat-vector style and NASA color palette.
- − The composition is somewhat simplistic with a lot of empty negative space.
- − Iconography is very basic compared to the stylistic potential of the prompt.
Verdict: While DALL-E 3 produces a more visually stunning set of posters, it fails the basic instructional requirements of the prompt by including shuttles and ignoring the step-by-step logic. Nano Banana follows the instructions perfectly, creating a functional, legible, and accurate infographic with all six steps clearly labeled.
Explore each model
Gemini 2.5 Flash Image is optimized for image understanding and generation, offering a balance of price and performance with fast and efficient image generation and editing capabilities.