OpenAI's legacy image generation model supporting generations, edits with masks (inpainting), and variations
Settled by community votes across 1 shared challenge, with an AI judge weighing in on each.
DALL-E 2
#37 of 44 in Text-to-Image
Not enough comparable category data
The chart appears once both models have ratings across at least three shared arena categories.
Imagen 4.0 Generate 001
#40 of 44 in Text-to-Image
Where the votes landed
DALL-E 2
0%
win rate
Ties
0%
Imagen 4.0 Generate 001
0%
win rate
Challenge by challenge
The strongest take from each model on every shared challenge, with the AI judge's read.
Adorable Baby Animals in Sunny Meadow
Text-to-Image“Hyper-photorealistic scene of fluffy baby animals—a golden retriever puppy, tabby kitten, baby bunny, and red fox kit—with big expressive eyes and ultra-detailed soft fur, playfully chasing butterflies and tumbling together in a lush wildflower meadow, warm golden sunrise light with god rays and dew sparkles, joyful wholesome vibe, 8K masterpiece.”
AI Judge Analysis
DALL-E 2
- + Natural lighting and realistic grass textures
- + Good sense of motion and action
- − Major anatomical distortions and artifacts, particularly on the animals in the background
- − Failed to clearly include all four requested animals
- − Low overall image clarity and resolution
Imagen 4.0 Generate 001
- + Includes all four requested animals (dog, cat, rabbit, fox) clearly
- + Beautiful, sharp details including dew drops and fur textures
- + Perfect adherence to lighting requests, including god rays and morning sun
- − Composition feels a bit crowded and stylized rather than purely photorealistic
- − The cat's paws look slightly repetitive in their positioning
Verdict: Imagen 4.0 Generate 001 is the clear winner as it successfully rendered all four requested animals with high clarity and detail, whereas DALL-E 2 suffered from severe AI artifacts and anatomical distortions. Imagen 4.0 also captured the specific atmospheric details like dew and morning light much more effectively than DALL-E 2.
Explore each model
Google's latest Imagen 4.0 text-to-image generation model with significantly better text rendering and overall image quality