OpenAI's legacy image generation model supporting generations, edits with masks (inpainting), and variations
Settled by community votes across 13 shared challenges, with an AI judge weighing in on each.
DALL-E 2
#37 of 44 in Text-to-Image
Not enough comparable category data
The chart appears once both models have ratings across at least three shared arena categories.
Nano Banana Pro
#2 of 44 in Text-to-Image
Where the votes landed
DALL-E 2
25.0%
win rate
Ties
0.0%
Nano Banana Pro
75.0%
win rate
Challenge by challenge
The strongest take from each model on every shared challenge, with the AI judge's read.
Geometric Composition
Text-to-Image“A glass cube on a wooden table. Inside the cube is a small blue sphere. On top of the cube sits a red book. A green plant is behind the cube, partially visible through the glass. Soft window light from the left.”
AI Judge Analysis
DALL-E 2
- + Features a realistic wood texture on the table.
- − Fails significantly on spatial requirements and object placement
- − The cube itself is blue rather than clear with a sphere inside
- − Extremely poor composition with large parts of objects cut off
Nano Banana Pro
- + Follows every specific detail of the prompt accurately.
- + High visual clarity and realistic lighting effects from the window.
- + Excellent text rendering on the book cover and realistic refraction through the glass.
- − The glass cube has a visible vertical seam in the center which might look slightly illogical.
Verdict: Nano Banana Pro successfully rendered all elements of the complex prompt including spatial relationships and transparency, resulting in a cohesive and high-quality image. DALL-E 2 failed on almost every criterion, producing a poorly composed shot where the cube is the wrong color and most objects are out of frame or missing.
Candid Street Photography
Text-to-Image“A candid street photo of an elderly Japanese man repairing a red bicycle in light rain, reflections on wet pavement, shallow depth of field, 50mm lens, natural skin texture, imperfect framing, motion blur from passing cars, cinematic but realistic, no stylization.”
AI Judge Analysis
DALL-E 2
- + Captured reflections on wet pavement effectively
- + Attempted an imperfect, candid-style artistic framing
- − Extreme blur obscures the main subject making details indistinguishable
- − Light and color are overblown and lack realism
Nano Banana Pro
- + Exceptional realism with natural skin texture and high detail
- + Successfully incorporated all prompt elements including rain, reflections, and motion blur
- + Highly accurate rendering of a Japanese street setting and bicycle
- − The framing is quite centered, missing the 'imperfect framing' request slightly
Verdict: Nano Banana Pro significantly outperformed DALL-E 2 by providing a sharp, high-fidelity image that looks like a real photograph. While DALL-E 2 captured the abstract mood of the prompt, it failed to render a recognizable man or bicycle, whereas Nano Banana Pro mastered the skin textures, rain effects, and complex background elements.
Fantasy Warrior
Text-to-Image“Close portrait of a battle-worn paladin in ornate engraved plate armor, hair braided with small beads, faint scars and dirt on the skin, warm torchlight reflecting off metal, shallow depth of field, bokeh sparks, lifelike eyes, highly detailed texture on leather straps and cloth underlayer.”
AI Judge Analysis
DALL-E 2
- + Features large bokeh as requested
- − Poor image resolution and clarity
- − Anatomical distortion in the face making it look like a statue
- − Lacks specific details like braided hair with beads and leather texture
Nano Banana Pro
- + Excellent adherence to all prompt details including braided hair, beads, and scars
- + Highly detailed textures on the engraved armor and leather straps
- + Realistic lighting and lifelike eyes
- − The torch in the foreground is slightly out of focus
Verdict: Nano Banana Pro significantly outperforms DALL-E 2 by accurately capturing every complex detail of the prompt, from the beaded braids to the intricate armor engravings. DALL-E 2 produced a low-quality, muddy image that resembles a melting statue rather than a lifelike paladin.
Modern Clean Menu
Text-to-Image“Modern minimalist restaurant menu design, white background with colorful food photos in grid, sections for appetizers/pizza/mains, bold sans-serif fonts, vibrant accents, clean professional layout for casual dining.”
AI Judge Analysis
DALL-E 2
- + Features a bold and artistic interpretation of food imagery.
- + Uses large, high-impact sans-serif typography as requested.
- − Fails to create a functional menu layout, resembling a book or pamphlet instead.
- − Food images are abstract shards rather than clear, appetizing photos.
- − Text is completely garbled and lacks necessary pricing or descriptions.
Nano Banana Pro
- + Perfectly adheres to the grid layout with sections for Appetizers, Pizza, and Mains.
- + Produces clear, high-quality, and appetizing food photography.
- + Exhibits professional graphic design elements with color-coded accents and consistent typography.
- − Contains minor spelling errors in item names and descriptions, though the overall structure is legible.
Verdict: Nano Banana Pro significantly outperforms DALL-E 2 by providing a highly functional and aesthetically pleasing menu design that follows every part of the prompt. DALL-E 2 produced an abstract, non-functional concept that failed to organize the sections properly, whereas Nano Banana Pro delivered a professional layout ready for a casual dining setting.
Magic Burger Explosion: Fiery Photorealism Challenge
Text-to-Image“Ad for 'Magic Burger'. Dynamic, exploded burger with all components (bun, patty, cheese, lettuce, tomato, sauce) suspended in mid-air. Emphasize photorealistic detail and a sense of motion. Dark, fiery background with glowing embers. Integrate text: 'MAGIC BURGER' as a prominent title, 'LIMITED TIME ONLY' as a secondary message, and '€6.99' in a starburst, all rendered with a fiery, glowing effect.”
AI Judge Analysis
DALL-E 2
- + Strong sense of chaotic motion and energy
- + Interesting fiery color palette
- − Text is heavily garbled and contains nonsensical words
- − Low image resolution and lacks photorealistic detail
- − The burger components are messy and difficult to identify
Nano Banana Pro
- + Perfect text rendering for all requested elements
- + High-quality photorealistic texture on the meat, fresh vegetables, and bun
- + Clean exploded composition that balances the burger and promotional text effectively
- − The lighting on the burger feels a bit separate from the dark background environment
Verdict: Nano Banana Pro is the clear winner as it perfectly follows all prompt instructions, including complex text integration and specific price point. DALL-E 2 fails significantly on text legibility and image clarity, resulting in an unpolished and messy advertisement.
Chalkboard Menu
Text-to-Image“Handwritten-style chalkboard menu in a cozy café, all text rendered in the exact same realistic chalk handwriting style with natural variations in letter size, slight slant, and chalk texture — no printed or digital fonts anywhere on the board. Title at the top in elegant cursive chalk handwriting: ‘TODAY’S SPECIALS – APRIL 30, 2026’. Below it, three menu items also in the same handwritten chalk style: ‘Truffle Mushroom Risotto – $24’, ‘Grilled Octopus with Lemon & Herbs – $28’, ‘Brown Butter Chocolate Chip Cookies – $9’. At the very bottom, smaller text in the identical handwritten chalk style (slightly smaller but still clearly legible with the same handwriting characteristics): ‘All items made fresh daily • Ask about our gluten-free options’. Warm ambient café lighting, visible chalk dust and smudges, realistic handwriting imperfections, no clean printed text anywhere.”
AI Judge Analysis
DALL-E 2
- + Captures a messy chalk texture.
- − Text is completely illegible and gibberish.
- − Fails to follow any of the specific textual prompts.
- − Image quality is low resolution and contains significant artifacts.
Nano Banana Pro
- + Excellent text rendering with perfect adherence to the requested phrases.
- + Realistic café environment with good lighting and composition.
- + Chalk texture and handwriting style look authentic and consistent.
- − The date is technically rendered in a neat print-style handwriting rather than the 'elegant cursive' requested for the title.
Verdict: Nano Banana Pro is the clear winner as it successfully rendered all the requested text with high legibility and photographic realism. In contrast, DALL-E 2 produced an abstract mess of chalk-like marks with no readable words or coherent composition.
The Reversed Rodeo
Text-to-Image“Horse riding astronaut in space — horse on top, not vice versa. Surreal, highly detailed, cinematic.”
AI Judge Analysis
DALL-E 2
- + Good atmosphere and lunar-like surface lighting
- + Coherent starry background
- − Failed to follow the core instruction of the horse being on top
- − Low resolution with significant grain and artifacts
- − Poor anatomical detail on the horse especially the legs and tail
Nano Banana Pro
- + Perfectly adhered to the complex instruction of putting the horse on top of the astronaut
- + High visual clarity and vibrant cinematic colors
- + Detailed rendering of the space suit, horse fur, and celestial bodies
- − The horse's back leg merging into the astronaut's shoulder is slightly confusing anatomically
Verdict: Nano Banana Pro successfully followed the difficult counter-intuitive instruction to place the horse on top of the astronaut, whereas DALL-E 2 produced a standard 'astronaut riding a horse' image. Nano Banana Pro also significantly outclassed DALL-E 2 in terms of resolution, detail, and artistic composition.
The Capybara Taxi Driver
Text-to-Image“Photorealistic scene inside a yellow New York taxi at night. A capybara is driving, wearing a yellow taxi driver cap and a dark jacket. It has a calm, professional expression and both front paws on the steering wheel. In the back seat sits a human businesswoman in a coat, looking at her phone with a completely normal, bored expression (as if this is just another normal ride). Through the windows you can see the streets of Manhattan at night with blurred lights. Realistic taxi interior, photorealistic, detailed fur and fabric, 35mm lens, night lighting with reflections, shallow depth of field.”
AI Judge Analysis
DALL-E 2
- + Includes a capybara and a phone.
- − Very poor visual quality with significant artifacts and low resolution.
- − Failed to include the businesswoman passenger in the back seat.
- − The capybara's hand is mutated into a human hand holding the phone.
- − The 'hat' looks like a flat clip-art overlay.
Nano Banana Pro
- + Excellent photorealism with detailed textures on the capybara fur and taxi interior.
- + Perfect adherence to all prompt elements, including the passenger, expression, and lighting.
- + High-quality composition with realistic 'bokeh' city lights in the background.
- − The capybara's paws are somewhat humanoid in their grip on the wheel.
Verdict: DALL-E 2 produced a low-quality image that failed on most prompt requirements, notably missing the passenger and merging the driver and phone into a single distorted figure. Nano Banana Pro delivered a high-fidelity, photorealistic scene that captured the specific mood, the bored passenger, and the professional capybara driver perfectly.
The Halloween Invitation
Text-to-Image“Vintage gothic Halloween party invitation. Dark parchment poster, spooky border with webs and thorns, central glowing jack-o-lantern, bats, twisted trees, moody night sky. Add elegant gothic title text saying "Halloween Party Invitation", a small scroll banner saying "You are invited to a night of frights", and event details at the bottom: Date: 30.10.2026 Time: 7pm Location: The Arches, NYC Spooky but polished, cinematic lighting, square format.”
AI Judge Analysis
DALL-E 2
- + Features a hand-painted, vintage aesthetic.
- + Atmospheric use of warm glowing colors.
- − Very low resolution and blurred image quality.
- − Text is illegible and contains various misspellings.
- − Failed to include the specific event details like date and location.
Nano Banana Pro
- + Excellent text rendering with near-perfect spelling and gothic font choice.
- + High-quality composition that follows all elements of the prompt including borders and jack-o-lantern.
- + Strong adherence to the specific event details like date, time, and location.
- − The parchment texture is limited mainly to the outer border rather than the whole background.
Verdict: Nano Banana Pro significantly outperforms DALL-E 2 by providing legible, accurate text and high-resolution details requested in the prompt. While DALL-E 2 captures a moody atmosphere, it fails to render any of the specific textual information and suffers from poor image clarity, whereas Nano Banana Pro delivers a professional, polished invitation layout.
Isometric Miniature Diorama Scenes
Text-to-Image“Create a clear, 45° top-down isometric miniature 3D cartoon scene of Japan's signature dish: sushi, with soft refined textures, realistic PBR materials, gentle lighting, on a small raised diorama base with minimal garnish and plate. Solid light blue background. At top-center: 'JAPAN' in large bold text, 'SUSHI' below it, small flag icon. Perfectly centered, ultra-clean, high-clarity, square format.”
AI Judge Analysis
DALL-E 2
- + Matches the requested isometric 45-degree angle.
- + Follows the solid light blue background requirement.
- − Serious visual artifacts and mangled 3D objects that lack clarity.
- − Failed to render the text at the top as requested, instead placing garbled text on the plate.
- − Does not include the flag icon.
Nano Banana Pro
- + Excellent text rendering and layout placement of 'JAPAN', 'SUSHI', and the flag icon.
- + High visual quality with clean textures and a clear diorama base.
- + Superior composition that perfectly balances the 3D assets with the typography.
- − Perspective is slightly flatter than a strict 45-degree iso angle but still fits the diorama theme.
Verdict: Nano Banana Pro followed every instruction perfectly, including complex typography, a flag icon, and high-quality 3D assets on a diorama base. DALL-E 2 struggled significantly, producing distorted objects and failing to generate the requested text header. Nano Banana Pro is the clear winner for its professional rendering and exact prompt adherence.
Adorable Baby Animals in Sunny Meadow
Text-to-Image“Hyper-photorealistic scene of fluffy baby animals—a golden retriever puppy, tabby kitten, baby bunny, and red fox kit—with big expressive eyes and ultra-detailed soft fur, playfully chasing butterflies and tumbling together in a lush wildflower meadow, warm golden sunrise light with god rays and dew sparkles, joyful wholesome vibe, 8K masterpiece.”
AI Judge Analysis
DALL-E 2
- + Features most of the requested animals
- + Natural lighting in the background
- − Low image quality with significant artifacts and blurring
- − Anatomically incorrect and distorted animals, particularly the kitten and rabbit
- − Butterflies appear fragmented and floating unnaturally
Nano Banana Pro
- + Excellent adherence to the prompt with all four animals clearly visible
- + Superior visual quality with ultra-detailed fur and expressive eyes
- + Beautiful lighting with god rays and dew sparkles that match the intended vibe
- − Slightly more stylized than hyper-photorealistic
- − Butterfly scales are a bit large relative to the animals
Verdict: Nano Banana Pro significantly outperforms DALL-E 2 in every category, delivering a high-resolution, coherent image that perfectly captures the joyful, wholesome atmosphere requested. DALL-E 2 produces a distorted and low-quality result where the animals are barely recognizable and the composition is messy.
Vintage Cafe Logo
Text-to-Image“Vintage minimalist restaurant logo for "Caffè Florian", retro cloche dome with steam and "Est. 1720" banner, classic typography, warm brown and cream tones, subtle texture on light background, vector emblem style.”
AI Judge Analysis
DALL-E 2
- + Matches the requested color palette
- + Features a simple cloche icon
- − Text is completely illegible and gibberish
- − Missing the establishment date and banner
- − Jagged, low-quality edges on the vector shape
Nano Banana Pro
- + Perfect text rendering for both name and date
- + Excellent vintage illustration style with cross-hatching
- + Exactly follows all prompt requirements including the banner and steam
- − The steam is slightly more ornate than 'minimalist' might suggest
Verdict: Nano Banana Pro significantly outperforms DALL-E 2 by providing accurate text rendering and a professional-grade vintage aesthetic. DALL-E 2 failed to produce legible text or follow the specific layout requirements such as the 'Est. 1720' banner.
Apollo 11: Journey to Tranquility
Text-to-Image“Create a clean, modern vector infographic poster about the Apollo 11 mission. NASA-inspired palette (navy, white, muted red, light gray). Flat-vector style, crisp lines, consistent iconography, subtle gradients only. Steps (stop at landing): 1. Launch (Saturn Vicon) 2. Earth Orbit (Earth + orbit ring icon) 3. Translunar (trajectory arc icon) 4. Lunar Orbit (Moon + orbit ring icon) 5. Descent (lunar module descending icon) 6. Landing (lunar module on the surface icon) Small supporting elements (minimal text): • Crew strip: three silhouette icons with only last names: Armstrong, Aldrin, Collins. • Landing site marker: Moon pin labeled "Tranquility" only. Layout constraints: generous margins, large readable labels, clean background with subtle stars. Vector-only, print-poster look, high resolution.”
AI Judge Analysis
DALL-E 2
- + Features a minimalist color palette.
- − Text is illegible and nonsensical.
- − Icons are abstract blobs and do not follow the specific steps requested.
- − Lacks a cohesive infographic structure and fails on almost all prompt requirements.
Nano Banana Pro
- + Excellent adherence to all six requested steps and iconography.
- + High-quality text rendering and crisp vector design.
- + Perfect application of the requested NASA-inspired color palette.
- − Minor text overlap in the trajectory line near the '3' might decrease readability slightly.
Verdict: Nano Banana Pro delivered a professional, informative, and visually appealing infographic that followed every instruction, including the specific sequence of events and color scheme. DALL-E 2 failed to generate readable text or identifiable icons, producing a chaotic and unusable image that ignored the core prompt requirements.
Explore each model
Gemini 3 Pro with image generation capabilities. Combines advanced reasoning with the ability to generate and edit images.