Head to head
Esc

Models · slot A

to navigate to pick

DALL-E 2 OpenAI Imagen 4.0 Ultra Generate 001 Google

Settled by community votes across 7 shared challenges, with an AI judge weighing in on each.

DALL-E 2

17.7 arena score

#37 of 44 in Text-to-Image

Skill signature

Not enough comparable category data

The chart appears once both models have ratings across at least three shared arena categories.

Imagen 4.0 Ultra Generate 001

22.3 arena score

#28 of 44 in Text-to-Image

Vote tally

Where the votes landed

DALL-E 2

0%

win rate

Ties

0%

Imagen 4.0 Ultra Generate 001

0%

win rate

Shared challenges 7

Challenge by challenge

The strongest take from each model on every shared challenge, with the AI judge's read.

Geometric Composition

Text-to-Image

“A glass cube on a wooden table. Inside the cube is a small blue sphere. On top of the cube sits a red book. A green plant is behind the cube, partially visible through the glass. Soft window light from the left.”

DALL-E 2
Imagen 4.0 Ultra Generate 001

AI Judge Analysis

DALL-E 2

  • + Features a small glass cube on a wooden surface.
  • Fails almost all prompt requirements, including the blue sphere inside and a red book on top.
  • The blue sphere mentioned in the prompt has been interpreted as a massive blue planter in the background.
  • Poor resolution and blurry details compared to modern standards.

Imagen 4.0 Ultra Generate 001

  • + Perfect adherence to all spatial relationships and object descriptions.
  • + High visual quality with realistic refraction and soft lighting.
  • + Excellent text rendering on the book spine.
  • The blue sphere appears to be floating rather than resting on the bottom of the cube, which might be a minor physics inaccuracy.

Verdict: DALL-E 2 completely fails the complex spatial instructions, misinterpreting the 'blue sphere' as a background planter and omitting the book entirely. Imagen 4.0 Ultra Generate 001 provides a high-fidelity image that follows every detail of the prompt, including the specific positioning of the sphere inside the cube and the plant visible through the glass.

Candid Street Photography

Text-to-Image

“A candid street photo of an elderly Japanese man repairing a red bicycle in light rain, reflections on wet pavement, shallow depth of field, 50mm lens, natural skin texture, imperfect framing, motion blur from passing cars, cinematic but realistic, no stylization.”

DALL-E 2
Imagen 4.0 Ultra Generate 001

AI Judge Analysis

DALL-E 2

  • + Successfully captures a 'shallow depth of field' and 'imperfect framing' mentioned in the prompt.
  • The subject is completely out of focus, rendering the 'natural skin texture' and identity of the man impossible to see.
  • The image quality is blurry and lacks discernible details of the bicycle repair.

Imagen 4.0 Ultra Generate 001

  • + Excellent adherence to all prompt details including red bicycle, rain, and elderly Japanese man.
  • + Superior visual quality with highly detailed skin textures, wet fabric, and mechanical bicycle parts.
  • + Captures the cinematic mood perfectly with realistic lighting and bokeh in the background.
  • Misses the 'motion blur from passing cars' as the car in the background is static.
  • The bicycle chain and pedal assembly have minor structural inconsistencies common in AI.

Verdict: Imagen 4.0 Ultra is the clear winner as it provides a coherent, high-quality image that captures the essence of the prompt's subject matter. While DALL-E 2 attempted the technical camera settings like focal length and imperfect framing, it produced a blurry and unusable image where the main subject is unrecognizable.

Modern Clean Menu

Text-to-Image

“Modern minimalist restaurant menu design, white background with colorful food photos in grid, sections for appetizers/pizza/mains, bold sans-serif fonts, vibrant accents, clean professional layout for casual dining.”

DALL-E 2
Imagen 4.0 Ultra Generate 001

AI Judge Analysis

DALL-E 2

  • + Strong bold typography
  • + Artistic and abstract layout
  • Does not follow the grid layout request
  • Food photos are distorted and unrecognizable
  • Lacks the specific requested sections (Appetizers/Pizza/Mains)

Imagen 4.0 Ultra Generate 001

  • + Perfectly follows the grid layout request
  • + Accurately includes section headers for Appetizers, Pizza, and Mains
  • + Clear, high-quality food photography
  • Text contains gibberish/hallucinated characters
  • Some currency symbols and numbers are poorly rendered

Verdict: Imagen 4.0 Ultra follows the prompt instructions with high fidelity, creating a usable menu layout with clear sections and a grid of food photos. DALL-E 2 fails significantly on the layout requirements, producing an abstract design where the food photos are sliced and the requested sections are missing.

Isometric Miniature Diorama Scenes

Text-to-Image

“Create a clear, 45° top-down isometric miniature 3D cartoon scene of Japan's signature dish: sushi, with soft refined textures, realistic PBR materials, gentle lighting, on a small raised diorama base with minimal garnish and plate. Solid light blue background. At top-center: 'JAPAN' in large bold text, 'SUSHI' below it, small flag icon. Perfectly centered, ultra-clean, high-clarity, square format.”

DALL-E 2
Imagen 4.0 Ultra Generate 001

AI Judge Analysis

DALL-E 2

  • + Matches the 45-degree isometric angle requested.
  • + Features a clean, solid background color.
  • Fails significantly on text rendering, displaying 'Sush' instead of the requested text.
  • Very poor representation of sushi, looking more like abstract plastic shapes than food.
  • Missing the Japanese flag and 'JAPAN' text.

Imagen 4.0 Ultra Generate 001

  • + Excellent adherence to all text requirements, including 'JAPAN', 'SUSHI', and the flag icon.
  • + High-quality 3D miniature aesthetic with great material textures and lighting.
  • + Accurately renders an appealing variety of sushi on a diorama base.
  • The diorama base is slightly more complex than 'minimal', but fits the theme well.

Verdict: Imagen 4.0 Ultra Generate 001 provides a near-perfect interpretation of the prompt, including accurate text rendering and a high-clarity 3D miniature style. DALL-E 2 fails on almost every specific detail, producing incorrect text, missing elements, and low-quality subjects. Imagen 4.0 Ultra Generate 001 is the clear winner for its superior visual quality and prompt adherence.

Adorable Baby Animals in Sunny Meadow

Text-to-Image

“Hyper-photorealistic scene of fluffy baby animals—a golden retriever puppy, tabby kitten, baby bunny, and red fox kit—with big expressive eyes and ultra-detailed soft fur, playfully chasing butterflies and tumbling together in a lush wildflower meadow, warm golden sunrise light with god rays and dew sparkles, joyful wholesome vibe, 8K masterpiece.”

DALL-E 2
Imagen 4.0 Ultra Generate 001

AI Judge Analysis

DALL-E 2

  • + Dynamic sense of movement and motion blur
  • Heavy anatomical distortions and artifacts, especially in the secondary animals
  • Low resolution with 'mushy' textures that don't meet the 8K/ultra-detailed requirement
  • Fails to clearly render all four requested animals

Imagen 4.0 Ultra Generate 001

  • + Excellent adherence to the prompt, including all four specific animals and environmental details
  • + Clear, high-quality rendering of fur and lighting effects like god rays and dew sparkles
  • + Strong composition with expressive character faces and a joyful atmosphere
  • Slightly stylized appearance rather than true photography
  • Physics of the animals' poses are a bit stiff in their 'playful' stance

Verdict: Imagen 4.0 Ultra is the clear winner as it successfully rendered every element of the prompt, including the specific list of four animals, while maintaining high visual clarity. DALL-E 2 struggled significantly with coherence, producing distorted animal figures and failing to capture the 'hyper-photorealistic' or 'ultra-detailed' quality requested.

Vintage Cafe Logo

Text-to-Image

“Vintage minimalist restaurant logo for "Caffè Florian", retro cloche dome with steam and "Est. 1720" banner, classic typography, warm brown and cream tones, subtle texture on light background, vector emblem style.”

DALL-E 2
Imagen 4.0 Ultra Generate 001

AI Judge Analysis

DALL-E 2

  • + Matches the warm brown and cream color palette
  • + Includes a cloche-style icon
  • Text is nonsensical and fails to spell 'Caffè Florian'
  • Fails to include the 'Est. 1720' banner
  • Visual quality is muddy with unclear shapes

Imagen 4.0 Ultra Generate 001

  • + Perfect text rendering for both the name and the banner
  • + Clean, professional vector emblem style
  • + Accurately represents all prompt elements including the steam and banner
  • Minimalist execution might feel slightly safe, though it fits the prompt perfectly

Verdict: Imagen 4.0 Ultra follows every instruction perfectly, producing a professional-grade logo with accurate text and a clear vector aesthetic. DALL-E 2 fails significantly on spelling and composition, producing a cluttered and illegible design that does not resemble a usable logo.

Apollo 11: Journey to Tranquility

Text-to-Image

“Create a clean, modern vector infographic poster about the Apollo 11 mission. NASA-inspired palette (navy, white, muted red, light gray). Flat-vector style, crisp lines, consistent iconography, subtle gradients only. Steps (stop at landing): 1. Launch (Saturn Vicon) 2. Earth Orbit (Earth + orbit ring icon) 3. Translunar (trajectory arc icon) 4. Lunar Orbit (Moon + orbit ring icon) 5. Descent (lunar module descending icon) 6. Landing (lunar module on the surface icon) Small supporting elements (minimal text): • Crew strip: three silhouette icons with only last names: Armstrong, Aldrin, Collins. • Landing site marker: Moon pin labeled "Tranquility" only. Layout constraints: generous margins, large readable labels, clean background with subtle stars. Vector-only, print-poster look, high resolution.”

DALL-E 2
Imagen 4.0 Ultra Generate 001

AI Judge Analysis

DALL-E 2

  • + Captures a technical blueprint aesthetic.
  • + Uses the requested NASA-inspired color palette effectively.
  • Text is completely illegible and contains major spelling errors for the main title.
  • Layout is cluttered and lacks the logical flow of the requested steps.
  • Fails to present clear, consistent iconography for the mission steps.

Imagen 4.0 Ultra Generate 001

  • + Adheres much better to the requested infographic layout and steps.
  • + Text is significantly more legible with recognizable headers like 'Apollo 11'.
  • + Clean, modern vector style with consistent iconography for the mission stages.
  • Some nonsensical placeholder text ('BEOMBERS', 'SFONDRSS') is present.
  • The vertical central graphic is a bit abstract and doesn't explicitly show the 'Saturn V' or 'Lunar Module' clearly.

Verdict: Imagen 4.0 Ultra Generate 001 follows the prompt much more successfully, providing a clear infographic structure with semi-legible text and a clean vector style. DALL-E 2 produces a chaotic, illegible image with misspelled titles ('ALLPOO APPLOO') and lacks the professional design required for an infographic.

Next steps

Explore each model