GPT Image 1.5 vs Imagen 4.0 Fast Generate 001

Head-to-head across 3 challenges

GPT Image 1.5

100.0%

win rate

Ties

0.0%

Imagen 4.0 Fast Generate 001

0.0%

win rate

100.0% 0.0% ties 0.0%

Challenge Results

Candid Street Photography

Text-to-Image

“A candid street photo of an elderly Japanese man repairing a red bicycle in light rain, reflections on wet pavement, shallow depth of field, 50mm lens, natural skin texture, imperfect framing, motion blur from passing cars, cinematic but realistic, no stylization.”

GPT Image 1.5
Imagen 4.0 Fast Generate 001

AI Judge Analysis

GPT Image 1.5

  • + Excellent depiction of rain with visible droplets on the man's jacket and hat.
  • + Realistic skin texture and an authentic, natural-looking pose for the subject.
  • + Strong cinematic composition with beautiful reflections and light bokeh in the background.
  • The motion blur of the passing car is somewhat static and lacks a sense of true speed.

Imagen 4.0 Fast Generate 001

  • + Successfully incorporates the 'imperfect framing' prompt with a window-like border.
  • + Clear, high-quality reflection on the wet pavement.
  • + Good natural skin texture and facial detail.
  • The 'motion blur from passing cars' is completely absent; the cars in the background are sharp.
  • Anatomical issues with the hands, specifically the right hand which looks mangled.
  • The composition feels slightly more staged than 'candid'.

Verdict: GPT Image 1.5 is the clear winner as it successfully balances all technical requirements, particularly the atmospheric effects of the light rain and the candidate feel requested. Imagen 4.0 Fast Generate 001 fails to produce the requested motion blur and has significant artifacts in the subject's hands.

Fantasy Warrior

Text-to-Image

“Close portrait of a battle-worn paladin in ornate engraved plate armor, hair braided with small beads, faint scars and dirt on the skin, warm torchlight reflecting off metal, shallow depth of field, bokeh sparks, lifelike eyes, highly detailed texture on leather straps and cloth underlayer.”

GPT Image 1.5
Imagen 4.0 Fast Generate 001
100% wins 0% ties 0% wins

AI Judge Analysis

GPT Image 1.5

  • + Perfect adherence to all prompt elements including armor, beads, scars, and lighting.
  • + Exceptional detail in skin texture, armor engravings, and leather straps.
  • + Dynamic composition with realistic bokeh and cinematic torchlight reflections.
  • The hair strands over the eye are slightly blurred compared to the sharp facial features.

Imagen 4.0 Fast Generate 001

  • + High resolution and natural-looking outdoor lighting.
  • Failed to follow the prompt entirely, showing a modern man in a garden instead of a paladin.
  • Missing all requested elements: armor, scars, torchlight, and beads.
  • Distance is a wide shot rather than the requested close portrait.

Verdict: GPT Image 1.5 followed the prompt with extreme precision, delivering a cinematic and highly detailed portrait of a battle-worn warrior that matched every specific requirement. Imagen 4.0 Fast Generate 001 failed the task completely, producing an unrelated image of an elderly man in a garden which suggests a total failure in prompt interpretation or a generation error.

Adorable Baby Animals in Sunny Meadow

Text-to-Image

“Hyper-photorealistic scene of fluffy baby animals—a golden retriever puppy, tabby kitten, baby bunny, and red fox kit—with big expressive eyes and ultra-detailed soft fur, playfully chasing butterflies and tumbling together in a lush wildflower meadow, warm golden sunrise light with god rays and dew sparkles, joyful wholesome vibe, 8K masterpiece.”

GPT Image 1.5
Imagen 4.0 Fast Generate 001
100% wins 0% ties 0% wins

AI Judge Analysis

GPT Image 1.5

  • + Perfect adherence to all requested animal types (Golden Retriever, Tabby, Bunny, Fox).
  • + Excellent dynamic composition with the 'tumbling' and 'chasing' actions clearly depicted.
  • + Strong lighting effects with visible god rays and sparkling dew consistent with a sunrise.
  • The cat's anatomy is slightly distorted, particularly the leg with too many toe pads.
  • The fox's paw in the bottom right looks somewhat muddy and poorly defined.

Imagen 4.0 Fast Generate 001

  • + High level of fur detail and realistic textures on all animals.
  • + Soft, pleasing bokeh in the background with consistent lighting.
  • Failed multiple prompt requirements: no butterflies, no 'tabby' kitten, and no 'golden retriever' (dog is brown/white).
  • The animals are sitting still rather than 'playfully chasing' or 'tumbling' as requested.

Verdict: GPT Image 1.5 followed the complex prompt much more accurately, including all four specific animal breeds and the action of chasing butterflies. While Imagen 4.0 has high textural quality, it missed several key descriptors including the butterfly element and the specific dog and cat breeds requested. GPT Image 1.5 better captured the 'joyful wholesome vibe' through dynamic posing.

GPT Image 1.5

OpenAI's state-of-the-art image generation model with better instruction following and adherence to prompts

Imagen 4.0 Fast Generate 001

Google's Imagen 4.0 Fast model optimized for speed and efficiency, suitable for high-volume image generation tasks