Nano Banana Pro vs Grok Imagine Image Pro
Head-to-head across 16 challenges
Nano Banana Pro
66.7%
win rate
Ties
11.1%
Grok Imagine Image Pro
22.2%
win rate
Challenge Results
Geometric Composition
Text-to-Image“A glass cube on a wooden table. Inside the cube is a small blue sphere. On top of the cube sits a red book. A green plant is behind the cube, partially visible through the glass. Soft window light from the left.”
AI Judge Analysis
Nano Banana Pro
- + Excellent adherence to the glass cube prompt, depicting it as a thin-walled, hollow vessel.
- + Very realistic lighting and shadows, particularly the sunlight streaming from the left across the table.
- + The plant is visible both behind and through the glass with appropriate refraction.
- − The glass cube lacks a top pane, making it more of an open tank than a closed cube.
Grok Imagine Image Pro
- + Captures the 'closed cube' concept better with a visible top surface for the book to rest on.
- + The blue sphere has a nice matte texture that contrasts well with the glass.
- + Composition is clean and aesthetically pleasing.
- − The glass cube has physically impossible thickness and strange internal reflections (a ghostly second sphere).
- − The plant's positioning is slightly off, appearing more to the side than 'behind' and visible through the glass.
Verdict: Gemini 3 Pro Image Preview produces a significantly more realistic image with convincing lighting and texture, though the glass container is open at the top. Grok Imagine Image Pro follows the spatial logic of the cube better, but suffers from glass rendering artifacts and a strange 'ghost' sphere reflection that defies physics. Gemini is the preferred choice for its photographic quality and better handling of the interaction between the plant and the glass.
Candid Street Photography
Text-to-Image“A candid street photo of an elderly Japanese man repairing a red bicycle in light rain, reflections on wet pavement, shallow depth of field, 50mm lens, natural skin texture, imperfect framing, motion blur from passing cars, cinematic but realistic, no stylization.”
AI Judge Analysis
Nano Banana Pro
- + Excellent realism with authentic textures on skin, clothing, and the rusted bicycle.
- + Perfectly captures the 'imperfect framing' and 'candid' feel of a street photo.
- + Superior environmental storytelling with blurred Japanese signage and realistic rain effects.
- − Slightly less motion blur on the cars compared to Model B.
Grok Imagine Image Pro
- + Stronger adherence to the 'motion blur from passing cars' instruction.
- + Clearer use of shallow depth of field with visible bokeh highlights.
- − The character and bicycle look slightly too clean/synthetic compared to the gritty realism of Model A.
- − Structural issues with the bicycle frame near the rear wheel/wrench area.
Verdict: Gemini 3 Pro Image Preview wins by a significant margin due to its exceptional photorealism and adherence to the 'candid' and 'no stylization' aspects of the prompt. While Grok Imagine Image Pro captures the motion blur and depth of field well, it feels more like a staged AI generation, whereas the Gemini image looks like a genuine film photograph with tactile textures and authentic environmental details.
Fantasy Warrior
Text-to-Image“Close portrait of a battle-worn paladin in ornate engraved plate armor, hair braided with small beads, faint scars and dirt on the skin, warm torchlight reflecting off metal, shallow depth of field, bokeh sparks, lifelike eyes, highly detailed texture on leather straps and cloth underlayer.”
AI Judge Analysis
Nano Banana Pro
- + Exceptional texturing on the leather straps and metal engravings.
- + Highly realistic skin texture including pores and subtle dirt distribution.
- + Strong use of cinematic lighting with natural-looking bokeh sparks.
- − The torch in the corner is a bit distracting and lacks fine detail.
- − Some beads appear slightly fused with the hair strands.
Grok Imagine Image Pro
- + Successfully incorporated legible text ('Lux in tenebris') into the armor design.
- + Clear interpretation of the braided hair and beads.
- + Good balance of battle-worn elements like rust and facial scars.
- − The leather straps look somewhat flat compared to the metalwork.
- − Skin texture is slightly too smooth and airbrushed for a 'battle-worn' character.
Verdict: Gemini 3 Pro delivers a more lifelike and gritty portrait with superior texture work on the skin and leather, creating a more cohesive 'battle-worn' feel. While Grok Imagine Pro includes an impressive detail with the engraved Latin text, its overall rendering is slightly more digital and less grounded than Gemini's output.
Modern Clean Menu
Text-to-Image“Modern minimalist restaurant menu design, white background with colorful food photos in grid, sections for appetizers/pizza/mains, bold sans-serif fonts, vibrant accents, clean professional layout for casual dining.”
AI Judge Analysis
Nano Banana Pro
- + Comprehensive menu layout including titles, descriptions, and pricing.
- + Higher quality, more realistic food photography with varied textures.
- + Excellent presentation showing the menu in a real-world context on a wooden table.
- − The text becomes illegible and nonsensical in the descriptions.
- − Repetitive item names (the same 'Bruschetta' and 'Margherita Pizza' used multiple times).
Grok Imagine Image Pro
- + Perfect grid alignment and very clean minimalist aesthetic.
- + Accurate organization of photos under the relevant section headers.
- + Very clean, high-contrast white background as requested.
- − Fails to include menu details like item names, descriptions, or prices beneath the photos.
- − The food photography looks slightly more artificial/CG compared to Model A.
- − Lacks the 'vibrant accents' requested, opting for very thin colored lines instead.
Verdict: Gemini 3 Pro Image Preview provides a more realistic and complete menu design with functional sections and pricing, though the fine text is garbled. Grok Imagine Image Pro creates a very clean, aesthetic grid of photos but fails to generate a functional menu with actual item listings and descriptions. Gemini 3 Pro is the better choice for a professional layout that captures the requested 'casual dining' feel.
Bald man challenge
Editing“Give the person a full, thick head of natural hair with realistic texture, density, and a natural hairline. Preserve facial features and lighting.”
AI Judge Analysis
Nano Banana Pro
- + Excellent hair texture that matches the rugged, messy style of the subject's beard.
- + Flawless preservation of the original facial features, glasses, and background.
- + The hairstyle follows the natural volume and lighting of the scene perfectly.
- − Small artifacts around the topmost flyaway hairs where they meet the sky.
Grok Imagine Image Pro
- + Clean skin-to-hair transition at the hairline.
- + Successfully preserved the background and clothing from the source image.
- − The hair texture is slightly too smooth and groomed compared to the rugged beard.
- − The forehead structure was slightly altered/lengthened to accommodate the new hairline.
- − The hair looks a bit like a separate layer rather than naturally integrated.
Verdict: Gemini 3 Pro Image Preview provides a much more convincing edit by matching the hair's texture and 'messiness' to the existing beard, making the result look authentic. Grok Imagine Image Pro produces a cleaner, more groomed hairstyle, but it feels less natural in the context of the rugged source image and slightly alters the forehead shape.
Isometric Miniature Diorama Scenes
Text-to-Image“Create a clear, 45° top-down isometric miniature 3D cartoon scene of Japan's signature dish: sushi, with soft refined textures, realistic PBR materials, gentle lighting, on a small raised diorama base with minimal garnish and plate. Solid light blue background. At top-center: 'JAPAN' in large bold text, 'SUSHI' below it, small flag icon. Perfectly centered, ultra-clean, high-clarity, square format.”
AI Judge Analysis
Nano Banana Pro
- + Excellent adherence to the 'diorama base' request with a square wooden platform and moss details.
- + Superior text rendering and layout for 'JAPAN', 'SUSHI', and the flag icon.
- + Highly varied and realistic 3d cartoon sushi models including nigiri, maki, and gunkan.
- − The camera angle is slightly lower than a true 45-degree isometric perspective.
- − Small artifacts in the cherry blossom branches extending from the plate.
Grok Imagine Image Pro
- + Clean, professional 3D materials with high-quality translucency on the fish textures.
- + Very clean background and overall composition.
- + Perfectly centered and formatted as requested.
- − The text 'JAPAN' is smaller and less impactful than requested.
- − The base is a simple circular board rather than a 'diorama base', lacking the miniature environment feel of its competitor.
- − Repetitive sushi pieces (five identical maki rolls).
Verdict: Gemini 3 Pro Image Preview is the winner as it perfectly captured the 'diorama base' requirement, creating a much more interesting miniature scene with moss and flowers. While Grok Imagine Image Pro has excellent material shaders, its interpretation of the prompt is more basic, and its text rendering is less bold than requested.
Night Sky Transformation
Editing“Change the scene to night: a deep, dark sky with subtle, glistening stars visible behind the mountain.”
AI Judge Analysis
Nano Banana Pro
- + Excellent source preservation, maintaining identical town and mountain details.
- + Realistic conversion of sky lighting to deep night with a visible Milky Way path.
- + Natural desaturation of the landscape to reflect the low-light environment.
- − None notable; very high-fidelity edit.
Grok Imagine Image Pro
- + Successfully applied the night scene and stars requested.
- + Preserved the composition and main elements of the original image well.
- − Introduced slight compression or blurring in the fine details of the village buildings.
- − The stars appear slightly more generic and uniform compared to Model A.
Verdict: Both models performed exceptionally well at this image editing task, preserving the complex details of the town and mountains while accurately shifting the atmosphere to night. Gemini 3 Pro Image Preview is the winner as it maintained the absolute sharpness of the original image's architecture and provided a more nuanced, realistic night sky, whereas Grok Imagine Image Pro introduced a very slight softening of detail in the valley.
Over-the-top cartoon caricature
Editing“Create a caricature of me and my job. Make it exaggerated and humorous, incorporating my profession as a tv show anchor and my love for dogs and hockey.”
AI Judge Analysis
Nano Banana Pro
- + Excellent integration of all three themes into a cohesive 'Dog & Hockey News' studio environment.
- + Strong caricature style that maintains a reasonable likeness to the source subject's facial structure and hair color.
- + Clever details like the hockey-jersey-wearing dogs and the goalie mask logo on the shirt.
Grok Imagine Image Pro
- + High-energy, humorous facial expression that fits the 'exaggerated' part of the prompt well.
- + Very clear and readable text for 'PUPS & PUCKS' and 'Puppy of the Day'.
- + Includes a wide variety of dogs and a large championship trophy for visual interest.
- − The facial likeness to the source woman is heavily distorted, making her look much younger/different.
- − Noticeable anatomy errors, specifically the woman's hand holding the hockey stick, which has an extra-long thumb/finger.
Verdict: Both models followed the instructions well, creating humorous and thematic caricatures. Gemini 3 Pro Image Preview is the winner because it maintained a better likeness to the source image's subject while creatively weaving the dogs and hockey elements together in a high-quality illustration. Grok Imagine Image Pro had better text rendering, but the anatomical errors in the hands and the loss of the original subject's facial identity made it less successful as a portrait edit.
Adorable Baby Animals in Sunny Meadow
Text-to-Image“Hyper-photorealistic scene of fluffy baby animals—a golden retriever puppy, tabby kitten, baby bunny, and red fox kit—with big expressive eyes and ultra-detailed soft fur, playfully chasing butterflies and tumbling together in a lush wildflower meadow, warm golden sunrise light with god rays and dew sparkles, joyful wholesome vibe, 8K masterpiece.”
AI Judge Analysis
Nano Banana Pro
- + Perfectly includes all four requested species: golden retriever, tabby kitten, bunny, and fox.
- + Excellent fur texture and lighting, with very clear 'god rays' and dew effects.
- + The composition feels more dynamic and 'hyper-photorealistic' as requested.
- − Only features a single butterfly despite the plural prompt.
- − The anatomy of the back legs on the puppy and fox is slightly blurred/merged into the grass.
Grok Imagine Image Pro
- + Captures a very playful 'tumbling' interaction between the fox and the group.
- + Includes multiple butterflies and a very expansive, beautiful meadow background.
- + Good rendering of the sunrise and long shadows.
- − Failed the count requirement by including two tabby kittens instead of one.
- − The kitten on the left has a strange, elongated tail and body proportion.
- − The puppy's front paw is oddly fused or malformed near the kitten's head.
Verdict: Gemini 3 Pro Image Preview is the winner because it adhered strictly to the requested animal count and species, whereas Grok Imagine Image Pro added an extra kitten. Gemini also achieved a higher level of 'hyper-photorealistic' detail in the fur and lighting, while Grok's image had several anatomical errors and a slightly more illustrative feel.
Victorian Greenhouse Oasis
Text-to-Image“Hyper-photorealistic interior of a lush Victorian glass greenhouse filled with exotic tropical plants, vibrant blooming orchids, tall ferns, colorful butterflies in flight, sunlight filtering through ornate glass roof creating realistic caustics and dew on leaves, intricate iron framework visible, misty atmosphere, 8K masterpiece.”
AI Judge Analysis
Nano Banana Pro
- + Excellent representation of the 'misty atmosphere' with light scattering through the haze.
- + Very detailed foliage, particularly the moss and individual orchid petals.
- + Captures the height and scale of a large Victorian greenhouse with the vaulting glass roof.
- − Some butterflies appear flat or poorly integrated into the 3D space.
- − The iron framework is slightly repetitive and lacks the ornate centerpiece detail found in some Victorian structures.
Grok Imagine Image Pro
- + Stunning intricate ironwork in the dome, featuring classic Victorian motifs.
- + Strong composition with a leading path that adds depth to the scene.
- + Excellent use of light and shadow on the cobblestones, creating a more grounding environment.
- − A few butterflies are disproportionately large, making them look like stickers on the lens.
- − Slightly less 'misty' atmosphere compared to Model A, feeling a bit crisper than requested.
Verdict: Both models followed the prompt excellently, but Grok Imagine Image Pro stands out for its superior composition and much more intricate, historically accurate Victorian ironwork architecture. While Gemini 3 Pro Image Preview better captured the 'misty atmosphere' and fine dew-like textures on the leaves, Grok's overall lighting and more compelling layout with the stone path make for a more immersive image.
Heroic Super Hero Portrait
Text-to-Image“Hyper-photorealistic full-body portrait of a female superhero standing triumphantly on a New York skyscraper rooftop at golden sunset, wearing a classic modest superhero costume with flowing cape, chest emblem, gloves, and boots in red and blue colors, practical design, short hair, strong determined heroic expression looking into the distance, powerful confident stance with hands on hips and cape billowing dramatically in the wind, detailed urban cityscape background, warm natural sunlight with sharp shadows and fabric highlights, ultra-sharp textures on suit, hair, and concrete, 8K masterpiece, empowering family-friendly style.”
AI Judge Analysis
Nano Banana Pro
- + Excellent sense of scale and depth within the New York City background.
- + Very high quality of light and atmospheric effects from the golden sunset.
- + Highly realistic textures on the costume and the concrete rooftop.
- − Failed to place the character's hands on her hips as requested.
- − The character is looking slightly to the side rather than into the distance as a primary focus.
- − The 'H' emblem is a bit generic and slightly distorted.
Grok Imagine Image Pro
- + Accurately captures the 'hands on hips' pose requested in the prompt.
- + The superhero emblem is sharp and well-defined.
- + Clearer full-body composition with a more direct heroic expression.
- − The background architecture looks less like high-rise Manhattan and more like a generic lower-rise neighborhood.
- − The lighting is somewhat flat compared to the dramatic sunset in Model A.
- − The hands and gloves have slight anatomical clipping issues with the hips.
Verdict: Gemini 3 Pro Image Preview produces a significantly more cinematic and photorealistic image with superior environmental detail and lighting, though it fails the 'hands on hips' pose. Grok Imagine Image Pro follows the posing instructions more accurately but lacks the epic scale and lighting quality that the prompt's 'skyscraper rooftop' and 'golden sunset' keywords implied. Ultimately, Gemini's visual quality and atmosphere make it the better representation of a 'hyper-photorealistic' superhero scene.
Studio Ghibli Anime Style
Editing“Transform this photo into a Studio Ghibli–inspired illustration. Use soft pastel colors, hand-painted textures, gentle lighting, dreamy backgrounds, and a warm, nostalgic mood”
AI Judge Analysis
Nano Banana Pro
- + Perfectly captures the Studio Ghibli cel-shaded aesthetic with clean line work.
- + Creatively enhances the background with a charming yellow tram and architectural details in Ghibli style.
- + Preserves the iconic composition and facial expressions of the 'distracted boyfriend' meme accurately.
- − The lighting is a bit brighter/flatter compared to the requested 'gentle, dreamy' feel.
Grok Imagine Image Pro
- + Excellent watercolor/hand-painted texture that aligns with Ghibli concept art.
- + Successfully creates a soft, dreamy atmoshere with warm, nostalgic pastel tones.
- + Maintains strong faithfulness to the original source image's background structure.
- − The character line art is a bit less defined than typical Ghibli animation styles.
- − The hand of the woman in red is slightly fused with her body.
Verdict: Both models performed exceptionally well at translating a famous meme into the Ghibli style. Gemini 3 Pro looks like a high-quality animation frame from a Ghibli film, complete with character-consistent line art and a beautifully reimagined background. Grok Imagine Pro feels more like a scenic watercolor concept painting with a softer, dreamier mood, but Gemini 3 Pro wins slightly for its superior character rendering and creative background additions.
Neutral Expression to Genuine Smile
Editing{
"action": "image_edit",
"reference": "uploaded neutral portrait",
"change": "Warm genuine Duchenne smile: lips curved up, slight natural teeth, soft eye crinkles, subtle cheek raise",
"details": "Realistic smiling skin (dimples if present, soft cheek shadows), slightly brighter eyes; keep exact eye shape/color/iris",
"preserve_exact": "Face identity/structure, eyes/nose/lips/eyebrows, hair, skin texture/pores/freckles, makeup, clothing, head pose, background, lighting, shadows, framing",
"no_changes": "No face shape change, no new features, no gaze shift, no hair/clothing/lighting/background edits",
"style": "Ultra-photorealistic 8K portrait, sharp face focus, natural soft lighting, realistic skin glow"
}
AI Judge Analysis
Nano Banana Pro
- + Excellent preservation of skin texture and freckles
- + Subtle, realistic Duchenne eye crinkles
- + Highly accurate preservation of hair and background
- − The smile is slightly asymmetrical and feels a bit forced compared to Model B
Grok Imagine Image Pro
- + Very warm and genuine expression as requested
- + Excellent tooth rendering
- + Perfectly maintains identity and features while applying a significant expression change
- − Slightly smooths out some of the skin texture/freckles compared to the original
Verdict: Both models succeeded exceptionally well at this difficult editing task. Gemini 3 Pro Image Preview preserved the original skin texture more accurately, including the fine freckles on the nose. However, Grok Imagine Image Pro produced a more 'genuine' and visually appealing smile that felt more natural to the facial structure, making it the better interpretation of the Duchenne smile prompt.
Golden Hour Stroll
Editing“Add dynamic motion to this photo: make hair blow in the wind, add leaves flying, energetic and lively feel.”
AI Judge Analysis
Nano Banana Pro
- + Excellent hair physics that creates a very dynamic flying effect.
- + Higher level of detail in the added leaves with varied colors (red, orange).
- + Adds realistic movement to the water in the background to match the wind.
- − Some of the leaves near the dog's legs appear as blurry brown smudges.
- − A leaf in the top-right corner is floating in a way that looks slightly disconnected.
Grok Imagine Image Pro
- + Good overall preservation of the source image's identity and lighting.
- + Hair movement is present and looks natural, albeit less extreme.
- + Uniform leaf style creates a consistent aesthetic.
- − The leaves look somewhat flat and like a 2D overlay compared to the rest of the image.
- − Lower sense of 'dynamic motion' compared to Model A; the scene feels less windy.
Verdict: Gemini 3 Pro Image Preview is the winner because it interpreted 'dynamic motion' much more effectively, creating a dramatic wind effect through the hair and even adding choppy ripples to the water. Grok Imagine Image Pro added the requested elements, but the leaves feel more like static stickers placed over the image rather than objects moving through the 3D space.
Vintage Cafe Logo
Text-to-Image“Vintage minimalist restaurant logo for "Caffè Florian", retro cloche dome with steam and "Est. 1720" banner, classic typography, warm brown and cream tones, subtle texture on light background, vector emblem style.”
AI Judge Analysis
Nano Banana Pro
- + Excellent hand-drawn artistic style with beautiful cross-hatching and shading.
- + The stylized steam and banner integration create a cohesive, elegant vector emblem.
- + Perfect typography that maintains the 'vintage' aesthetic requested.
- − The texture on the banner is slightly busy, which might challenge very small-scale minimalist use.
Grok Imagine Image Pro
- + Clean, minimalist design that fits a modern-vintage brand identity.
- + Clear cloche icon and distinct circular border structure.
- + Accurate text placement and correct spelling of all requested elements.
- − The cloche is silver/grey, missing the 'warm brown and cream tones' requested for the whole logo.
- − The steam effect is very simple and lacks the 'retro' flair of the rest of the elements.
Verdict: Gemini 3 Pro Image Preview provides a much more sophisticated and artistic interpretation of a vintage logo, with superior linework and a better grasp of the 'warm brown' color palette. While Grok Imagine Image Pro produced a functional minimalist design, it failed to apply the requested color scheme to the central icon and lacked the classic charm seen in Gemini's version.
Apollo 11: Journey to Tranquility
Text-to-Image“Create a clean, modern vector infographic poster about the Apollo 11 mission. NASA-inspired palette (navy, white, muted red, light gray). Flat-vector style, crisp lines, consistent iconography, subtle gradients only. Steps (stop at landing): 1. Launch (Saturn Vicon) 2. Earth Orbit (Earth + orbit ring icon) 3. Translunar (trajectory arc icon) 4. Lunar Orbit (Moon + orbit ring icon) 5. Descent (lunar module descending icon) 6. Landing (lunar module on the surface icon) Small supporting elements (minimal text): • Crew strip: three silhouette icons with only last names: Armstrong, Aldrin, Collins. • Landing site marker: Moon pin labeled "Tranquility" only. Layout constraints: generous margins, large readable labels, clean background with subtle stars. Vector-only, print-poster look, high resolution.”
AI Judge Analysis
Nano Banana Pro
- + Excellent visual flow that mimics a trajectory path through the poster.
- + Superior integration of text and graphics into a cohesive infographic layout.
- + Includes thoughtful extra details like the crew names and the Tranquility Base marker.
- − Some minor text artifacts in the very small 'translunar trajectory' label.
- − Uses more gradients than the 'flat-vector' request specified.
Grok Imagine Image Pro
- + Strictly adheres to the flat-vector style with crisp, clean circles.
- + Text rendering is very legible and accurate for all steps.
- + Accurate color palette following the NASA-inspired constraints.
- − The vertical layout is very basic and leaves a lot of empty dead space.
- − The trajectory arc in step 3 is tiny and disconnected from the overall flow.
- − Lacks the dynamic 'poster' feel requested, looking more like a simple slide.
Verdict: Gemini 3 Pro Image Preview created a much more engaging and professional-looking infographic with a logical visual flow and creative composition. While Grok Imagine Image Pro followed the 'flat' style more strictly, its layout is overly simplistic and fails to capture the 'poster' aesthetic requested in the prompt. Gemini is the clear winner for its superior information design and visual appeal.
Nano Banana Pro
Gemini 3 Pro with image generation capabilities. Combines advanced reasoning with the ability to generate and edit images.
Grok Imagine Image Pro
xAI's premium image generation model offering higher fidelity output and stronger performance on single-image editing benchmarks compared to the standard Grok Imagine model