Head to head
Esc

Models · slot A

to navigate to pick

DALL-E 3 OpenAI Imagen 4.0 Ultra Generate 001 Google

Settled by community votes across 7 shared challenges, with an AI judge weighing in on each.

DALL-E 3

18.5 arena score

#35 of 44 in Text-to-Image

Skill signature

Not enough comparable category data

The chart appears once both models have ratings across at least three shared arena categories.

Imagen 4.0 Ultra Generate 001

22.3 arena score

#28 of 44 in Text-to-Image

Vote tally

Where the votes landed

DALL-E 3

50.0%

win rate

Ties

0.0%

Imagen 4.0 Ultra Generate 001

50.0%

win rate

50.0% 0.0% ties 50.0%
Shared challenges 7

Challenge by challenge

The strongest take from each model on every shared challenge, with the AI judge's read.

Geometric Composition

Text-to-Image

“A glass cube on a wooden table. Inside the cube is a small blue sphere. On top of the cube sits a red book. A green plant is behind the cube, partially visible through the glass. Soft window light from the left.”

DALL-E 3
Imagen 4.0 Ultra Generate 001

AI Judge Analysis

DALL-E 3

  • + High artistic detail within the sphere
  • + Excellent use of light and shadow on the wooden table
  • Failed the spatial prompt: the red book is inside the cube instead of on top of it
  • The cube has a thick wooden frame not specified in the prompt
  • The plant is not visible through the glass

Imagen 4.0 Ultra Generate 001

  • + Perfect adherence to all spatial instructions
  • + Realistic refraction and transparency in the glass cube
  • + Clean and professional-grade photographic composition
  • The blue sphere appears to be floating without a visible support structure
  • The lighting on the cube is slightly more muted than the bright window suggests

Verdict: Imagen 4.0 Ultra followed every spatial instruction perfectly, correctly placing the red book on top and showing the plant through the glass. DALL-E 3 failed the prompt significantly by placing the red book inside the cube and adding a thick wooden frame that obscured the plant. Imagen 4.0 Ultra is the clear winner for its superior prompt adherence and realistic rendering.

Candid Street Photography

Text-to-Image

“A candid street photo of an elderly Japanese man repairing a red bicycle in light rain, reflections on wet pavement, shallow depth of field, 50mm lens, natural skin texture, imperfect framing, motion blur from passing cars, cinematic but realistic, no stylization.”

DALL-E 3
Imagen 4.0 Ultra Generate 001

AI Judge Analysis

DALL-E 3

  • + Excellent atmospheric lighting and puddles with realistic reflections
  • + Strong cinematic composition with creative foreground framing
  • + Accurate handling of passing car lights and overall street environment
  • Anatomical issues with the man's barefoot feet looking distorted
  • The man appears more like a stylized character than a realistic person
  • Bicycle structure is muddy and poorly defined in the shadows

Imagen 4.0 Ultra Generate 001

  • + Exceptional realism in skin texture and clothing materials
  • + Highly accurate mechanical detail of the red bicycle's chain and gears
  • + Subtle and realistic depiction of raindrops on the man's jacket
  • Lacks the requested 'motion blur' from passing cars
  • Composition is a bit more standard/head-on rather than the 'imperfect candid' look requested

Verdict: Imagen 4.0 Ultra Generate 001 provides a much more convincing and realistic depiction of the man and the bicycle, with impressive skin textures and rain details on the clothing. While DALL-E 3 captures the atmospheric street vibe and 'imperfect framing' better, it fails on basic anatomy and realistic rendering, making the scene feel more like digital art than a candid photograph.

Modern Clean Menu

Text-to-Image

“Modern minimalist restaurant menu design, white background with colorful food photos in grid, sections for appetizers/pizza/mains, bold sans-serif fonts, vibrant accents, clean professional layout for casual dining.”

DALL-E 3
Imagen 4.0 Ultra Generate 001

AI Judge Analysis

DALL-E 3

  • + Provides multiple variations of the layout in a grid format.
  • + Excellent food photography with vibrant, appetizing colors.
  • + Good use of color blocking and contemporary graphic elements.
  • Text is largely illegible or gibberish symbols.
  • The presentation feels more like a mood board or catalog than a functional menu.

Imagen 4.0 Ultra Generate 001

  • + Stronger adherence to the 'grid' prompt for food photos.
  • + More functional layout with clear category headers in bold sans-serif fonts.
  • + Cleaner, more professional white space utilization for a minimalist aesthetic.
  • The alignment of the bottom row is inconsistent.
  • Text under the images contains some nonsensical characters.

Verdict: Model B (Imagen 4.0 Ultra) is the winner as it produces a more cohesive and functional menu design that realistically follows the structural requirements of the prompt. While Model A (DALL-E 3) offers beautiful imagery and variety, its text rendering and layout are less practical for an actual menu design challenge.

Isometric Miniature Diorama Scenes

Text-to-Image

“Create a clear, 45° top-down isometric miniature 3D cartoon scene of Japan's signature dish: sushi, with soft refined textures, realistic PBR materials, gentle lighting, on a small raised diorama base with minimal garnish and plate. Solid light blue background. At top-center: 'JAPAN' in large bold text, 'SUSHI' below it, small flag icon. Perfectly centered, ultra-clean, high-clarity, square format.”

DALL-E 3
Imagen 4.0 Ultra Generate 001

AI Judge Analysis

DALL-E 3

  • + Excellent 3D miniature 'cartoon' aesthetic with soft shadows
  • + Creative integration of text on the diorama base
  • + Vibrant colors and high-quality PBR-style textures
  • Failed to place text at top-center as requested
  • Only includes one type of sushi (Salmon) which looks repetitive
  • Missing the word 'SUSHI' in the text

Imagen 4.0 Ultra Generate 001

  • + Perfect adherence to text placement and content instructions
  • + Variety of sushi types (Nigiri, Maki, Ikura) adds visual interest
  • + Precise 45-degree isometric projection and clean diorama base
  • The textures are a bit more realistic than the requested 'cartoon' style
  • The plate is floating slightly unnaturally above the base

Verdict: Imagen 4.0 Ultra strictly followed all prompt instructions, including the specific top-center text placement and containing both 'JAPAN' and 'SUSHI' along with the flag. While DALL-E 3 captured the 'cartoon' aesthetic much better, it failed several key layout and text instructions, making Imagen 4.0 Ultra the more accurate result.

Adorable Baby Animals in Sunny Meadow

Text-to-Image

“Hyper-photorealistic scene of fluffy baby animals—a golden retriever puppy, tabby kitten, baby bunny, and red fox kit—with big expressive eyes and ultra-detailed soft fur, playfully chasing butterflies and tumbling together in a lush wildflower meadow, warm golden sunrise light with god rays and dew sparkles, joyful wholesome vibe, 8K masterpiece.”

DALL-E 3
Imagen 4.0 Ultra Generate 001
0% wins 0% ties 100% wins

AI Judge Analysis

DALL-E 3

  • + Excellent capture of golden sunrise lighting and god rays
  • + Very soft, expressive textures on the character faces
  • + Magical, whimsical atmosphere that fits the 'joyful' vibe
  • Anatomical errors on the butterflies including strange furry bodies and faces
  • The rabbit is missing its distinctive long ears, looking more like a generic small rodent

Imagen 4.0 Ultra Generate 001

  • + Stronger prompt adherence with distinct and recognizable animal species
  • + Better dynamic composition showing the animals actively standing and playing
  • + Clearer, more realistic butterfly renderings throughout the scene
  • Slightly more artificial, digital-illustration look rather than 'hyper-photorealistic'
  • The kitten's pose is a bit stiff/vertically stretched

Verdict: While DALL-E 3 captures a more beautiful and atmospheric lighting style, Imagen 4.0 Ultra Generate 001 provides much better prompt adherence by correctly depicting all four requested animals, particularly the rabbit and the tabby kitten patterns. Imagen 4.0 also avoids the bizarre 'furry-faced' butterfly artifacts found in the DALL-E 3 output, making it the more coherent composition.

Vintage Cafe Logo

Text-to-Image

“Vintage minimalist restaurant logo for "Caffè Florian", retro cloche dome with steam and "Est. 1720" banner, classic typography, warm brown and cream tones, subtle texture on light background, vector emblem style.”

DALL-E 3
Imagen 4.0 Ultra Generate 001

AI Judge Analysis

DALL-E 3

  • + Features beautiful, intricate woodcut-style shading on the cloche.
  • + Rich use of warm brown tones and stipple textures that create a high-quality vintage feel.
  • + Strong circular emblem composition with ornate flourishes.
  • Failed to include the requested name 'Caffè Florian', replacing it with generic text.
  • The steam effect is a bit bulky compared to the rest of the fine line work.

Imagen 4.0 Ultra Generate 001

  • + Perfect adherence to the requested text 'Caffè Florian'.
  • + True minimalist aesthetic that better suits a modern-retro vector logo.
  • + Clean, professional typography and accurate banner placement.
  • The cloche is very simple and lacks the 'vintage' character seen in the other model.
  • The overall composition feels a bit sparse with significant empty space.

Verdict: While DALL-E 3 produced a more visually stunning piece of art with superior texture and 'vintage' character, it failed the primary task of including the correct brand name. Imagen 4.0 Ultra followed every instruction perfectly, including the specific name and banner placement, making it the practical winner for a logo design task.

Apollo 11: Journey to Tranquility

Text-to-Image

“Create a clean, modern vector infographic poster about the Apollo 11 mission. NASA-inspired palette (navy, white, muted red, light gray). Flat-vector style, crisp lines, consistent iconography, subtle gradients only. Steps (stop at landing): 1. Launch (Saturn Vicon) 2. Earth Orbit (Earth + orbit ring icon) 3. Translunar (trajectory arc icon) 4. Lunar Orbit (Moon + orbit ring icon) 5. Descent (lunar module descending icon) 6. Landing (lunar module on the surface icon) Small supporting elements (minimal text): • Crew strip: three silhouette icons with only last names: Armstrong, Aldrin, Collins. • Landing site marker: Moon pin labeled "Tranquility" only. Layout constraints: generous margins, large readable labels, clean background with subtle stars. Vector-only, print-poster look, high resolution.”

DALL-E 3
Imagen 4.0 Ultra Generate 001
100% wins 0% ties 0% wins

AI Judge Analysis

DALL-E 3

  • + Excellent artistic style with a retro-futuristic space-age aesthetic.
  • + Correctly followed the NASA-inspired color palette.
  • + Great visual complexity and interesting texture-based gradients.
  • Failed the logical sequence of the infographic completely.
  • Included Space Shuttles which were not part of the Apollo missions.
  • Text is entirely illegible and gibberish.

Imagen 4.0 Ultra Generate 001

  • + Followed the specific 6-step infographic structure requested in the prompt.
  • + Crisp, clean flat vector style that matches the 'modern infographic' requirement.
  • + Much better text rendering and general layout coherence.
  • Nonsensical header text (e.g., 'BEOMBERS', 'SPUSTUR') despite being readable fonts.
  • Icons are somewhat abstract and certain NASA branding is slightly distorted.
  • The white background block creates a slightly basic composition compared to Model A.

Verdict: Imagen 4.0 Ultra is the winner because it adhered to the complex structural instructions of the prompt, creating a 6-step process as requested. While DALL-E 3 produced more visually striking art, it ignored the specific steps and incorrectly included Space Shuttles, whereas Imagen 4.0 Ultra correctly attempted the mission stages with better text clarity.

Next steps

Explore each model