Google's latest Imagen 4.0 text-to-image generation model with significantly better text rendering and overall image quality
Settled by community votes across 1 shared challenge, with an AI judge weighing in on each.
Imagen 4.0 Generate 001
#40 of 44 in Text-to-Image
Not enough comparable category data
The chart appears once both models have ratings across at least three shared arena categories.
Recraft V4
#8 of 44 in Text-to-Image
Where the votes landed
Imagen 4.0 Generate 001
0.0%
win rate
Ties
0.0%
Recraft V4
100.0%
win rate
Challenge by challenge
The strongest take from each model on every shared challenge, with the AI judge's read.
Adorable Baby Animals in Sunny Meadow
Text-to-Image“Hyper-photorealistic scene of fluffy baby animals—a golden retriever puppy, tabby kitten, baby bunny, and red fox kit—with big expressive eyes and ultra-detailed soft fur, playfully chasing butterflies and tumbling together in a lush wildflower meadow, warm golden sunrise light with god rays and dew sparkles, joyful wholesome vibe, 8K masterpiece.”
AI Judge Analysis
Imagen 4.0 Generate 001
- + Excellent character consistency and focus
- + Beautiful, clear eye reflections and textures
- + Clean composition with distinct lighting effects
- − Leans more toward a CGI/animated 3D style rather than 'hyper-photorealistic'
- − The animals are standing still rather than 'chasing and tumbling'
Recraft V4
- + Successfully captures the action of 'chasing and tumbling'
- + Higher level of realism in fur texture and lighting
- + Detailed environment with a wide variety of flora
- − The bunny appears slightly distorted or awkwardly placed in the background
- − The kitten's pose is a bit stiff compared to the dog and fox
Verdict: Recraft V4 wins by better adhering to the requirement for photorealism and capturing the dynamic action of the animals chasing butterflies. While Imagen 4.0 creates a charming and very clean image, it has a distinct digital illustration/3D render feel that contradicts the 'photorealistic' prompt.
Explore each model
Recraft's latest text-to-image generation model with high-quality output, supporting various aspect ratios and custom color palettes