OpenAI's cost-effective image generation model for when image quality isn't the top priority
Settled by community votes across 3 shared challenges, with an AI judge weighing in on each.
GPT Image 1 Mini
#12 of 44 in Text-to-Image
Not enough comparable category data
The chart appears once both models have ratings across at least three shared arena categories.
Imagen 4.0 Ultra Generate 001
#28 of 44 in Text-to-Image
Where the votes landed
GPT Image 1 Mini
0%
win rate
Ties
0%
Imagen 4.0 Ultra Generate 001
0%
win rate
Challenge by challenge
The strongest take from each model on every shared challenge, with the AI judge's read.
Geometric Composition
Text-to-Image“A glass cube on a wooden table. Inside the cube is a small blue sphere. On top of the cube sits a red book. A green plant is behind the cube, partially visible through the glass. Soft window light from the left.”
AI Judge Analysis
GPT Image 1 Mini
- + Excellent adherence to the 'plant behind the cube' instruction with realistic distortion
- + High visual quality with realistic textures on the book and sphere
- + Natural-looking soft window lighting
- − The sphere appears to be floating unnaturally without a clear support
- − The glass cube looks more like an empty frame or thin container than a solid object
Imagen 4.0 Ultra Generate 001
- + Accurate rendering of a solid glass cube with realistic refractive properties
- + Crisp text rendering on the book spine
- + Dynamic lighting and shadows on the wooden table
- − The plant is more to the side than 'behind' the cube, missing the requested visual interaction
- − The blue sphere has a strange double-reflection/ghosting artifact
- − The sphere is floating in the center of a solid glass block, which is physically impossible
Verdict: GPT Image 1 Mini followed the spatial instructions much better, correctly placing the plant behind the cube so it is visible through the glass. While Imagen 4.0 Ultra Generate 001 produced a more beautiful solid glass texture and sharp text, it failed to place the plant behind the cube and included a distracting visual artifact next to the blue sphere.
Candid Street Photography
Text-to-Image“A candid street photo of an elderly Japanese man repairing a red bicycle in light rain, reflections on wet pavement, shallow depth of field, 50mm lens, natural skin texture, imperfect framing, motion blur from passing cars, cinematic but realistic, no stylization.”
AI Judge Analysis
GPT Image 1 Mini
- + Excellent shallow depth of field and bokeh
- + Strong cinematic mood with realistic lighting and reflections
- + Coherent bicycle structure with logical components like the kickstand and basket
Imagen 4.0 Ultra Generate 001
- + Exceptional skin texture and facial detail
- + Dynamic composition with visible tools and better sense of action
- + Captures 'motion blur from passing cars' more effectively
- − The red bicycle frame has anatomical issues, such as the down tube missing and pedals/gears being strangely placed
- − The white flecks on the ground look more like petals than rain reflections
Verdict: GPT Image 1 Mini produces a more coherent and aesthetically pleasing 'cinematic' image with realistic bicycle geometry, though it lacks the fine skin detail of the competition. Imagen 4.0 Ultra Generate 001 provides incredible detail in the subject's face and hands, but the bicycle is structurally nonsensical and the background blur feels less natural than the 50mm lens look achieved by GPT.
Adorable Baby Animals in Sunny Meadow
Text-to-Image“Hyper-photorealistic scene of fluffy baby animals—a golden retriever puppy, tabby kitten, baby bunny, and red fox kit—with big expressive eyes and ultra-detailed soft fur, playfully chasing butterflies and tumbling together in a lush wildflower meadow, warm golden sunrise light with god rays and dew sparkles, joyful wholesome vibe, 8K masterpiece.”
AI Judge Analysis
GPT Image 1 Mini
- + Excellent photorealism with natural fur textures and lighting.
- + Dynamic and convincing movement, capturing the animals in a playful romp.
- + Subtle and realistic integration of dew sparkles and god rays.
- − The kitten has a slightly strange fifth leg or tail-like protrusion between its front legs.
- − Butterflies are less varied and fewer in number compared to the other model.
Imagen 4.0 Ultra Generate 001
- + Perfect adherence to all requested animals with distinct, expressive poses.
- + Vibrant and colorful composition with a high variety of butterflies and flowers.
- + Very clear 'dew sparkles' on the grass as requested in the prompt.
- − Leans toward a digital illustration or 'hyper-real' CGI style rather than true photorealism.
- − The fox's anatomy, particularly the front paws, feels a bit stylized and stiff.
Verdict: GPT Image 1 Mini produces a much more convincing photorealistic image with beautiful natural lighting, though it suffers from a minor anatomical glitch on the kitten. Imagen 4.0 Ultra Generate 001 creates a charming, storybook-like scene that perfectly captures every element of the prompt but lacks the realistic depth and texture of the former, appearing more like a high-end 3D render.
Explore each model
Google's Imagen 4.0 Ultra model offering the highest fidelity and resolution for professional-grade image generation