Imagen 4.0 Generate 001 Google Stable Diffusion 3.5 Large Stability AI

Settled by community votes across 1 shared challenge, with an AI judge weighing in on each.

Imagen 4.0 Generate 001

15.9 arena score

#40 of 44 in Text-to-Image

Skill signature

Not enough comparable category data

The chart appears once both models have ratings across at least three shared arena categories.

Stable Diffusion 3.5 Large

22.9 arena score

#25 of 44 in Text-to-Image

Vote tally

Where the votes landed

Imagen 4.0 Generate 001

50.0%

win rate

Ties

0.0%

Stable Diffusion 3.5 Large

50.0%

win rate

50.0% 0.0% ties 50.0%

Shared challenges 1

Challenge by challenge

The strongest take from each model on every shared challenge, with the AI judge's read.

Adorable Baby Animals in Sunny Meadow

Text-to-Image

“Hyper-photorealistic scene of fluffy baby animals—a golden retriever puppy, tabby kitten, baby bunny, and red fox kit—with big expressive eyes and ultra-detailed soft fur, playfully chasing butterflies and tumbling together in a lush wildflower meadow, warm golden sunrise light with god rays and dew sparkles, joyful wholesome vibe, 8K masterpiece.”

Imagen 4.0 Generate 001

Stable Diffusion 3.5 Large

50% wins 0% ties 50% wins

AI Judge Analysis

Imagen 4.0 Generate 001

+ Perfect adherence to all requested animal types
+ Excellent clarity and sharp focus throughout the frame
+ Detailed depiction of dew drops and diverse wildflower species

− The composition feels a bit crowded and illustrative rather than 'photorealistic'
− The lighting effects such as god rays appear a bit artificial

Stable Diffusion 3.5 Large

+ Achieves a more convincing 'photorealistic' depth of field and soft lighting
+ Captures a more dynamic sense of movement and 'joyful' expression
+ Beautiful bokeh and natural integration of sun glares

− Failed to include a 'tabby' kitten, showing a solid ginger kitten instead
− The back legs of the fox and puppy are somewhat muddled or missing in the grass

Verdict: Imagen 4.0 Generate 001 followed the prompt's specific subject list more accurately by correctly including a tabby kitten, whereas Stable Diffusion 3.5 Large opted for a ginger one. However, Stable Diffusion 3.5 Large produced a much more photorealistic and emotionally resonant image with better lighting and depth, while Imagen 4.0 leaned toward a saturated, illustrative aesthetic.

Next steps

Explore each model

Imagen 4.0 Generate 001

Google

Google's latest Imagen 4.0 text-to-image generation model with significantly better text rendering and overall image quality

Vote this model in the arena

Arena profile Lumenfall catalog

Stable Diffusion 3.5 Large

Stability AI

Stability AI's 8.1-billion parameter Multimodal Diffusion Transformer (MMDiT) text-to-image model featuring improved image quality, typography, complex prompt understanding, and resource-efficiency

Vote this model in the arena

Arena profile Lumenfall catalog