Gemini 3 Pro with image generation capabilities. Combines advanced reasoning with the ability to generate and edit images.
Settled by community votes across 7 shared challenges, with an AI judge weighing in on each.
Nano Banana Pro
#2 of 44 in Text-to-Image
Not enough comparable category data
The chart appears once both models have ratings across at least three shared arena categories.
Imagen 4.0 Ultra Generate 001
#28 of 44 in Text-to-Image
Where the votes landed
Nano Banana Pro
28.6%
win rate
Ties
42.9%
Imagen 4.0 Ultra Generate 001
28.6%
win rate
Challenge by challenge
The strongest take from each model on every shared challenge, with the AI judge's read.
Geometric Composition
Text-to-Image“A glass cube on a wooden table. Inside the cube is a small blue sphere. On top of the cube sits a red book. A green plant is behind the cube, partially visible through the glass. Soft window light from the left.”
AI Judge Analysis
Nano Banana Pro
- + Excellent photographic realism with natural textures on the wood and book.
- + Accurate representation of the plant as being physically behind the glass cube.
- + Realistic light scattering and reflections on the glass surfaces.
- − The glass cube looks more like a thin-walled aquarium than a solid cube.
Imagen 4.0 Ultra Generate 001
- + Clear text rendering on the book spine.
- + Vibrant colors and high-contrast lighting.
- + Distinct interpretation of a solid glass block.
- − The plant appears to be inside the glass cube rather than behind it, failing the spatial requirement.
- − The blue sphere is floating unnaturally in the center of the cube.
- − The image has a slightly more 'rendered' CGI look compared to Model A.
Verdict: Gemini 3 Pro Image Preview captures the requested spatial relationships perfectly, placing the plant clearly behind the cube and the sphere on the floor of the glass structure. Imagen 4.0 Ultra produces a visually striking image with better text, but it fails the prompt adherence by making the plant appear to be inside the glass and floating the sphere in mid-air.
Candid Street Photography
Text-to-Image“A candid street photo of an elderly Japanese man repairing a red bicycle in light rain, reflections on wet pavement, shallow depth of field, 50mm lens, natural skin texture, imperfect framing, motion blur from passing cars, cinematic but realistic, no stylization.”
AI Judge Analysis
Nano Banana Pro
- + Excellent photographic quality with a film-like aesthetic and realistic 50mm depth of field.
- + Successfully captures the requested 'imperfect framing' and 'candid' feel with a wide, cinematic street view.
- + Exceptional background details, featuring localized Japanese street signs and authentic vehicle types.
- − The man's hands have significant anatomical errors, appearing fused and lacking clear digit separation.
- − The pavement reflections are good, but the 'light rain' is very subtle almost to the point of being absent.
Imagen 4.0 Ultra Generate 001
- + Features more detailed skin textures and clearer raindrops visible on the man's jacket.
- + The bicycle mechanics are slightly more coherent than in the other image.
- − The composition is a standard tight portrait, failing to capture the 'imperfect framing' or 'candid street photo' atmosphere requested.
- − The white spots on the ground look more like fallen petals or artifacts than reflections on wet pavement.
Verdict: Gemini 3 Pro much better captures the requested 'candid' and 'cinematic' mood with a composition that feels like a real street photograph, despite the typical AI hand artifacts. Imagen 4.0 Ultra produces a high-quality portrait, but the environmental storytelling is weaker and the ground texture is confusing and unrealistic.
Modern Clean Menu
Text-to-Image“Modern minimalist restaurant menu design, white background with colorful food photos in grid, sections for appetizers/pizza/mains, bold sans-serif fonts, vibrant accents, clean professional layout for casual dining.”
AI Judge Analysis
Nano Banana Pro
- + Excellent structure with clearly defined sections for Appetizers, Pizza, and Mains.
- + Effective use of playful geometric accents and bold sans-serif fonts.
- + Consistent and high-quality food photography that fits the grid layout perfectly.
- − Text contains a significant amount of gibberish in the descriptions.
- − Repeating 'Bruschetta' and 'Margherita Pizza' multiple times is a bit lazy in terms of content generation.
Imagen 4.0 Ultra Generate 001
- + Clean, minimalist aesthetic with a very professional whitespace-heavy layout.
- + Good variety of unique dish names and imagery.
- + Accurate text rendering for main headers.
- − The 'Mains' section starts in the middle of a horizontal row, which is unconventional and slightly confusing.
- − Food photos are less 'vibrant' compared to Model A, with a flatter color palette.
Verdict: Gemini 3 Pro Image Preview provides a more cohesive and visually engaging design that feels like a finished marketing asset, thanks to its vibrant food imagery and playful design accents. While Imagen 4.0 Ultra Generate 001 offers a very clean, minimalist look, its layout for the 'Mains' section is awkward and less intuitive for a menu.
Isometric Miniature Diorama Scenes
Text-to-Image“Create a clear, 45° top-down isometric miniature 3D cartoon scene of Japan's signature dish: sushi, with soft refined textures, realistic PBR materials, gentle lighting, on a small raised diorama base with minimal garnish and plate. Solid light blue background. At top-center: 'JAPAN' in large bold text, 'SUSHI' below it, small flag icon. Perfectly centered, ultra-clean, high-clarity, square format.”
AI Judge Analysis
Nano Banana Pro
- + Excellent texture work with realistic subsurface scattering on the fish and rice grains.
- + Beautiful lighting and shading that creates a high-end 3D render feel.
- + Complex diorama base with organic moss and cherry blossom details.
- − The text is slightly off-center compared to the diorama base.
- − The 'S' in SUSHI is slightly stylized in a way that looks less clean than standard bold fonts.
Imagen 4.0 Ultra Generate 001
- + Perfectly centered composition for both text and diorama.
- + Extremely clean, bold typography and crisp flag icon.
- + The miniature cartoon aesthetic is very consistent and 'toy-like'.
- − Textures are much flatter and less realistic than requested in the PBR material prompt.
- − The 'J' in JAPAN is slightly disproportionate.
Verdict: Gemini 3 Pro Image Preview produces a much more sophisticated rendering with beautiful textures and lighting that hit the 'PBR materials' requirement perfectly. While Imagen 4.0 Ultra Generate 001 provides a cleaner, more graphic layout, its materials look more like basic plastic and lack the refined detail found in the sushi of Gemini 3 Pro.
Adorable Baby Animals in Sunny Meadow
Text-to-Image“Hyper-photorealistic scene of fluffy baby animals—a golden retriever puppy, tabby kitten, baby bunny, and red fox kit—with big expressive eyes and ultra-detailed soft fur, playfully chasing butterflies and tumbling together in a lush wildflower meadow, warm golden sunrise light with god rays and dew sparkles, joyful wholesome vibe, 8K masterpiece.”
AI Judge Analysis
Nano Banana Pro
- + Excellent fur texture with realistic lighting interaction.
- + Highly detailed and natural-looking background with sparkling dew drops.
- + Clean, coherent anatomy for all four subjects.
- − Only includes one butterfly despite the plural prompt.
- − Slightly less dynamic 'tumbling' interaction than the competitor.
Imagen 4.0 Ultra Generate 001
- + Strong adherence to 'chasing butterflies' with multiple colorful insects.
- + Very expressive posing with animals reaching and interacting.
- + Vibrant and diverse floral variety in the foreground.
- − Anatomic issues with the kitten, including an extra leg/paw under its chin.
- − The kitten's tail area is muddled and lacks clear structure.
- − The fox's front paws are strangely rendered as dark, digit-less stubs.
Verdict: Gemini 3 Pro Image Preview produces a much more polished and realistic image with superior fur textures and consistent anatomy, though it only included one butterfly. Imagen 4.0 Ultra Generate 001 captures the requested action and butterfly count better but suffers from significant anatomical errors, including extra limbs on the cat and poorly formed paws on the fox. Gemini 3 Pro is the winner for its professional technical execution and visual coherence.
Vintage Cafe Logo
Text-to-Image“Vintage minimalist restaurant logo for "Caffè Florian", retro cloche dome with steam and "Est. 1720" banner, classic typography, warm brown and cream tones, subtle texture on light background, vector emblem style.”
AI Judge Analysis
Nano Banana Pro
- + Perfect text rendering including the 'grave' accent in 'Caffè'.
- + Excellent vintage woodcut-style illustration for the cloche and steam.
- + Warm, cohesive color palette that perfectly matches the 'vintage minimalism' brief.
Imagen 4.0 Ultra Generate 001
- + Clean, minimalist layout with a modern-vintage feel.
- + Accurate text spelling and banner placement.
- + Consistent line-weight and vector-style execution.
- − The accent on 'Caffè' is reversed (acute instead of grave).
- − The steam lines or cloche handle are slightly off-center compared to the main dome.
- − Less 'character' and artistic texture compared to Model A.
Verdict: Gemini 3 Pro Image Preview is the winner because it delivered a superior level of artistic detail that perfectly captured the 'vintage' and 'warm texture' parts of the prompt while maintaining perfect typography. Imagen 4.0 Ultra is a strong minimalist logo, but it used the wrong accent mark on the word 'Caffè' and had slightly less visual appeal in the illustration.
Apollo 11: Journey to Tranquility
Text-to-Image“Create a clean, modern vector infographic poster about the Apollo 11 mission. NASA-inspired palette (navy, white, muted red, light gray). Flat-vector style, crisp lines, consistent iconography, subtle gradients only. Steps (stop at landing): 1. Launch (Saturn Vicon) 2. Earth Orbit (Earth + orbit ring icon) 3. Translunar (trajectory arc icon) 4. Lunar Orbit (Moon + orbit ring icon) 5. Descent (lunar module descending icon) 6. Landing (lunar module on the surface icon) Small supporting elements (minimal text): • Crew strip: three silhouette icons with only last names: Armstrong, Aldrin, Collins. • Landing site marker: Moon pin labeled "Tranquility" only. Layout constraints: generous margins, large readable labels, clean background with subtle stars. Vector-only, print-poster look, high resolution.”
AI Judge Analysis
Nano Banana Pro
- + Perfect text rendering for all steps and names.
- + Exceptional adherence to the logic of the mission sequence.
- + Clean, professional vector style that matches the NASA aesthetic flawlessly.
- − The 'Translunar' text is slightly overlapping the line but remains legible.
Imagen 4.0 Ultra Generate 001
- + Follows the requested color palette well.
- + Clean, high-level layout structure.
- − Failed to include the specific 6 steps requested.
- − Garbled text and nonsensical iconography.
- − No logical flow or mission narrative.
Verdict: Gemini 3 Pro Image Preview is the clear winner as it followed every instruction, including the specific sequence of events, icon descriptions, and text labels with near-perfect accuracy. Imagen 4.0 Ultra generated a generic infographic layout with gibberish text and failed to follow the requested 6-step mission structure.
Explore each model
Google's Imagen 4.0 Ultra model offering the highest fidelity and resolution for professional-grade image generation