Head to head
Esc

Models · slot A

to navigate to pick

DALL-E 3 OpenAI Imagen 4.0 Fast Generate 001 Google

Settled by community votes across 3 shared challenges, with an AI judge weighing in on each.

DALL-E 3

18.5 arena score

#35 of 44 in Text-to-Image

Skill signature

Not enough comparable category data

The chart appears once both models have ratings across at least three shared arena categories.

Imagen 4.0 Fast Generate 001

17.1 arena score

#39 of 44 in Text-to-Image

Vote tally

Where the votes landed

DALL-E 3

0%

win rate

Ties

0%

Imagen 4.0 Fast Generate 001

0%

win rate

Shared challenges 3

Challenge by challenge

The strongest take from each model on every shared challenge, with the AI judge's read.

Candid Street Photography

Text-to-Image

“A candid street photo of an elderly Japanese man repairing a red bicycle in light rain, reflections on wet pavement, shallow depth of field, 50mm lens, natural skin texture, imperfect framing, motion blur from passing cars, cinematic but realistic, no stylization.”

DALL-E 3
Imagen 4.0 Fast Generate 001

AI Judge Analysis

DALL-E 3

  • + Excellent atmospheric lighting and puddles reflections.
  • + Strong composition with foreground elements creating depth.
  • + Captures a traditional Japanese street vibe well.
  • The man's anatomical details, especially his feet and neck, appear distorted.
  • Missing the requested motion blur on the passing cars.

Imagen 4.0 Fast Generate 001

  • + Realistic skin texture and lifelike clothing fabric.
  • + More natural character posing and interaction with the bicycle.
  • + Better adherence to the 'no stylization' and 'candid' photographic requirements.
  • The framing is slightly distracting with the thick dark border.
  • Reflections are present but less visually striking than in the alternate model.

Verdict: Imagen 4.0 delivers a much more convincing, realistic photograph with natural skin textures and a truly candid feel, whereas DALL-E 3 produces a more stylized, cinematic image with significant anatomical errors. While DALL-E 3 has better environmental atmosphere, Imagen 4.0 is the superior choice for a request emphasizing realism and photographic quality.

Fantasy Warrior

Text-to-Image

“Close portrait of a battle-worn paladin in ornate engraved plate armor, hair braided with small beads, faint scars and dirt on the skin, warm torchlight reflecting off metal, shallow depth of field, bokeh sparks, lifelike eyes, highly detailed texture on leather straps and cloth underlayer.”

DALL-E 3
Imagen 4.0 Fast Generate 001

AI Judge Analysis

DALL-E 3

  • + Excellent adherence to the complex prompt including armor, braids, and lighting.
  • + High level of detail on the engraved plate armor and skin textures.
  • + Dynamic use of bokeh and torchlight to create mood.
  • The eyes, while striking, look slightly more stylized than 'lifelike' in a physiological sense.
  • Some of the sparks overlap the character in a way that feels a bit artificial.

Imagen 4.0 Fast Generate 001

  • + Naturalistic lighting and realistic clothing textures.
  • + Good environmental detail in the garden.
  • Failed completely to follow the prompt instructions regarding paladins, armor, and torchlight.
  • Produced a full-length shot instead of the requested close portrait.
  • Ignored specifically requested features like braided hair and battle-worn scars.

Verdict: DALL-E 3 followed the prompt expertly, delivering a high-quality fantasy portrait with detailed armor and specific lighting effects. Imagen 4.0 Fast Generate 001 failed the task entirely, producing an image of an elderly man in a garden that bears no resemblance to the requested paladin character.

Adorable Baby Animals in Sunny Meadow

Text-to-Image

“Hyper-photorealistic scene of fluffy baby animals—a golden retriever puppy, tabby kitten, baby bunny, and red fox kit—with big expressive eyes and ultra-detailed soft fur, playfully chasing butterflies and tumbling together in a lush wildflower meadow, warm golden sunrise light with god rays and dew sparkles, joyful wholesome vibe, 8K masterpiece.”

DALL-E 3
Imagen 4.0 Fast Generate 001

AI Judge Analysis

DALL-E 3

  • + Excellent adherence to the 'wholesome' and 'expressive eyes' prompt descriptors
  • + Vibrant, magical lighting with clear god rays as requested
  • + Includes the butterflies and tumbling action mentioned in the prompt
  • Has a very stylized, 3D-animation look rather than the requested 'hyper-photorealistic' style
  • The butterflies have strange furry bodies that look like hybrids

Imagen 4.0 Fast Generate 001

  • + Achieves a much higher level of photorealism with natural textures
  • + Includes all four requested animals with more realistic anatomical proportions
  • + Beautiful soft morning light and realistic bokeh in the meadow
  • Missed the butterfly element of the prompt entirely
  • The animals are sitting still rather than 'playfully chasing' or 'tumbling'

Verdict: DALL-E 3 followed the action and decorative elements of the prompt more closely, creating a whimsical and lively scene, but failed on the request for photorealism. Imagen 4.0 followed the animal types and lighting requests much more authentically, producing a high-quality photo-like image, though it ignored the butterflies and the specific playful movement.

Next steps

Explore each model