Nano Banana 2 vs Grok Imagine Image Pro
Head-to-head across 13 challenges
Nano Banana 2
81.8%
win rate
Ties
0.0%
Grok Imagine Image Pro
18.2%
win rate
Challenge Results
Candid Street Photography
Text-to-Image“A candid street photo of an elderly Japanese man repairing a red bicycle in light rain, reflections on wet pavement, shallow depth of field, 50mm lens, natural skin texture, imperfect framing, motion blur from passing cars, cinematic but realistic, no stylization.”
AI Judge Analysis
Nano Banana 2
- + Excellent photorealistic texture and lighting that feels like a real film photograph.
- + Very strong adherence to the 'reflections on wet pavement' and 'candid street photo' feel with vibrant neon signs.
- + High technical accuracy in showing the bike chain and tools on the ground.
- − The man's right hand holding the wrench is anatomically jumbled and merged with the tool.
Grok Imagine Image Pro
- + Good inclusion of light rain streaks and motion blur on the background car lights.
- + Clear composition with a nice shallow depth of field.
- − The background and pavement look a bit more CGI/rendered compared to the grit of Model A.
- − The bike's kickstand and parts of the frame appear somewhat unnatural or floating.
- − The skin texture on the man's face is slightly too smooth and lacks 'natural skin texture' detail.
Verdict: Nano Banana 2 produces a significantly more realistic and cinematic image that stays true to the gritty, detailed aesthetic of Japanese street photography. While it has a slight anatomical error in the hand, its environmental textures and lighting are far superior to Grok Imagine Image Pro, which looks somewhat cleaner and more digital.
Geometric Composition
Text-to-Image“A glass cube on a wooden table. Inside the cube is a small blue sphere. On top of the cube sits a red book. A green plant is behind the cube, partially visible through the glass. Soft window light from the left.”
AI Judge Analysis
Nano Banana 2
- + Excellent text rendering on the book spine.
- + Highly realistic glass transparency and perspective.
- + Matches all prompt elements including the specific lighting direction.
- − The glass cube is more of a hollow frame/aquarium shape than a solid cube.
Grok Imagine Image Pro
- + Captures the 'glass cube' aesthetic well with thick glass walls.
- + Clean, minimalistic composition.
- + Good depth and blur on the background plant.
- − Includes a strange duplicate reflection/half-sphere on the right side of the cube.
- − The sphere has a matte texture rather than the more common glass/marble look.
- − Perspective of the book on the cube is slightly warped at the front edge.
Verdict: Nano Banana 2 produces a significantly more realistic image with impressive text rendering and natural-looking lighting. Grok Imagine Image Pro suffers from a strange visual artifact where a second partial blue sphere appears inside the glass, detracting from its overall quality.
Modern Clean Menu
Text-to-Image“Modern minimalist restaurant menu design, white background with colorful food photos in grid, sections for appetizers/pizza/mains, bold sans-serif fonts, vibrant accents, clean professional layout for casual dining.”
AI Judge Analysis
Nano Banana 2
- + Complete menu functional design with text, prices, and descriptions.
- + Excellent adherence to the requested sections (Appetizers, Pizza, Mains).
- + Highly professional layout that looks like a real-world restaurant asset.
- − Small text descriptions contain some minor spelling artifacts.
- − The grid of food photos is slightly less uniform in lighting than Model B.
Grok Imagine Image Pro
- + Extremely clean and consistent food photography in a perfect 3x3 grid.
- + High visual clarity and vibrant colors.
- + Strictly minimalist aesthetic.
- − Fails to include actual menu content like item names, descriptions, or prices.
- − The layout is more of a mood board or category header than a functional restaurant menu.
Verdict: Nano Banana 2 produces a fully realized, professional restaurant menu including typography, branding, and pricing, which makes it much more useful for the prompt's intent. Grok Imagine Image Pro creates a beautiful grid of food photos, but fails to provide the textual components of a menu design, leaving it looking like an incomplete template.
Fantasy Warrior
Text-to-Image“Close portrait of a battle-worn paladin in ornate engraved plate armor, hair braided with small beads, faint scars and dirt on the skin, warm torchlight reflecting off metal, shallow depth of field, bokeh sparks, lifelike eyes, highly detailed texture on leather straps and cloth underlayer.”
AI Judge Analysis
Nano Banana 2
- + Excellent adherence to the 'battle-worn' descriptor with heavy dirt and sweat texture on the face.
- + Highly detailed engraving and rune work on the plate armor.
- + The inclusion of the sword hilt and hand adds to the paladin warrior narrative.
- − The hand gripping the sword has some anatomical awkwardness in the finger proportions and lighting.
- − The background torchlight is a bit blown out compared to the rest of the scene.
Grok Imagine Image Pro
- + Superb text rendering on the gorget ('Lux in tenebris') which fits the paladin theme perfectly.
- + Very clean and symmetrical braid/bead work that clearly follows the prompt.
- + Exceptional skin texture and lifelike eyes with realistic catchlights.
- − The character looks more 'gritty fashion' than truly 'battle-worn' compared to the heavier weathering in the other image.
- − The composition is slightly more static and centered.
Verdict: Both models followed the prompt exceptionally well, but Grok Imagine Image Pro wins due to the superior clarity of its textures and the impressive inclusion of legible thematic text on the armor. While Nano Banana 2 captures a more intense 'battle-worn' atmosphere with grittier skin textures, Grok Imagine Image Pro offers better overall visual coherence and more precise detail in the hair beads and engraving.
Bald man challenge
Editing“Give the person a full, thick head of natural hair with realistic texture, density, and a natural hairline. Preserve facial features and lighting.”
AI Judge Analysis
Nano Banana 2
- + Seamless integration of hair with the existing sideburns and beard
- + Excellent preservation of the original head shape and facial features
- + Very realistic, rough texture that matches the character's aesthetic
- − The hairline is slightly high, though physically plausible
Grok Imagine Image Pro
- + Natural-looking hair texture and logical growth direction
- + Good preservation of the original environment and clothing
- − Noticeable distortion of the skull shape, making the forehead appear slightly indented
- − The hair placement feels a bit like a 'topper' rather than a natural extension of the scalp
Verdict: Both models did an excellent job of matching the lighting and texture of the original image. Nano Banana 2 is the winner because it maintained the correct anatomical structure of the subject's head, whereas Grok Imagine Image Pro slightly flattened the top of the forehead, creating a less natural transition.
Isometric Miniature Diorama Scenes
Text-to-Image“Create a clear, 45° top-down isometric miniature 3D cartoon scene of Japan's signature dish: sushi, with soft refined textures, realistic PBR materials, gentle lighting, on a small raised diorama base with minimal garnish and plate. Solid light blue background. At top-center: 'JAPAN' in large bold text, 'SUSHI' below it, small flag icon. Perfectly centered, ultra-clean, high-clarity, square format.”
AI Judge Analysis
Nano Banana 2
- + Excellent text rendering with 'JAPAN' and 'SUSHI' clearly legible and well-positioned.
- + Highly detailed and realistic sushi variety, including accurate textures for roe, eel, and sashimi.
- + Beautiful diorama base featuring moss, rocks, and a secondary wooden platter that adds depth.
- − Lean more towards realism than the requested '3D cartoon' aesthetic.
- − The flag icon is placed to the left of 'SUSHI' rather than below it as implied by standard layout hierarchy.
Grok Imagine Image Pro
- + Perfectly captures the '3D cartoon' style with soft, rounded, and playful textures.
- + Minimalist and ultra-clean composition that adheres strictly to the 'minimal garnish' request.
- + Center-aligned text layout including the flag icon is very balanced.
- − The sushi pieces are repetitive and lack the intricate variety seen in the other model.
- − The wooden base texture is somewhat simple compared to the detailed diorama requested.
Verdict: Nano Banana 2 produces a stunningly detailed diorama with high-quality PBR materials and perfect text, though it leans more towards realism. Grok Imagine Image Pro better captures the '3D cartoon' aesthetic with soft, clean shapes, but lacks the intricate detail and variety provided by the former. Nano Banana 2 is the preferred choice for its superior visual complexity and professional finish while still meeting all text and layout requirements.
Night Sky Transformation
Editing“Change the scene to night: a deep, dark sky with subtle, glistening stars visible behind the mountain.”
AI Judge Analysis
Nano Banana 2
- + Excellent preservation of the source image's geometry and composition.
- + Very high-quality sky with realistic Milky Way details and atmospheric depth.
- + Perfect lighting adjustment, transitioning the warm sunset glow to cool moonlight and town lights.
- − Slightly brighter than 'deep dark sky' might imply, but still very night-appropriate.
Grok Imagine Image Pro
- + Achieves a very dark, high-contrast night sky as requested.
- + Maintains the core structure of the original mountain and town.
- + Good placement of stars across the entire sky.
- − The stars appear as somewhat uniform, sharp white dots compared to the more natural clusters in A.
- − Small artifacts/distortions in the snow textures on the mountain peak compared to the original.
Verdict: Both models successfully turned the scene into a night shot while maintaining the original town and mountain layout. Nano Banana 2 is the winner because its star field is more visually complex and realistic, and it handled the global relighting of the mountain face with better tonal range and texture preservation. Grok Imagine Image Pro produced a very good result, but the stars look a bit more artificial and uniform.
Adorable Baby Animals in Sunny Meadow
Text-to-Image“Hyper-photorealistic scene of fluffy baby animals—a golden retriever puppy, tabby kitten, baby bunny, and red fox kit—with big expressive eyes and ultra-detailed soft fur, playfully chasing butterflies and tumbling together in a lush wildflower meadow, warm golden sunrise light with god rays and dew sparkles, joyful wholesome vibe, 8K masterpiece.”
AI Judge Analysis
Nano Banana 2
- + Perfect adherence to the animal count, featuring exactly one of each requested species.
- + Dynamic and realistic movement with a sense of playful chasing across the meadow.
- + Excellent lighting effects with subtle god rays and morning dew consistent with the prompt.
- − The butterfly on the dog's tail is a bit static and looks slightly pasted on.
Grok Imagine Image Pro
- + Very expressive and cute facial expressions on the puppy and fox.
- + Vibrant colors with high contrast in the wildflower patches.
- + Includes more butterflies to enhance the 'chasing' narrative.
- − Failed the prompt count by including two tabby kittens instead of one.
- − The fox's anatomy is slightly awkward in the 'tumbling' pose.
- − The golden retriever puppy appears much larger relative to the others than a newborn would be.
Verdict: Nano Banana 2 followed the prompt's specific animal list perfectly and captured a more natural sense of movement and '8K' clarity. Grok Imagine Image Pro produced a very charming image but failed on the prompt instructions by adding an extra kitten and had less realistic scaling between the animals.
Over-the-top cartoon caricature
Editing“Create a caricature of me and my job. Make it exaggerated and humorous, incorporating my profession as a tv show anchor and my love for dogs and hockey.”
AI Judge Analysis
Nano Banana 2
- + Excellent caricature style with hand-drawn colored pencil textures.
- + Clever wordplay in the text including 'W-K9 NEWS' and 'Anchor's Puck Drop'.
- + Captures the subject's likeness effectively within the stylized caricature.
- − The hand holding the physical card is an unnecessary meta-element.
- − A few minor floating microphone artifacts in the background.
Grok Imagine Image Pro
- + Strong incorporation of all elements including a hockey trophy and multiple dogs.
- + The facial caricature is very exaggerated and humorous as requested.
- + Clean, professional digital illustration style with good text rendering.
- − The likeness is slightly more generic than Model A.
- − Some repetitive elements like the grid of identical golden retriever puppies.
Verdict: Both models followed the instructions perfectly, creating humorous caricatures that blend the subject's career and hobbies. Nano Banana 2 stands out for its creative wordplay and more traditional artistic texture, while Grok Imagine Image Pro provides a more polished digital look with a wider variety of background details.
Neutral Expression to Genuine Smile
Editing{
"action": "image_edit",
"reference": "uploaded neutral portrait",
"change": "Warm genuine Duchenne smile: lips curved up, slight natural teeth, soft eye crinkles, subtle cheek raise",
"details": "Realistic smiling skin (dimples if present, soft cheek shadows), slightly brighter eyes; keep exact eye shape/color/iris",
"preserve_exact": "Face identity/structure, eyes/nose/lips/eyebrows, hair, skin texture/pores/freckles, makeup, clothing, head pose, background, lighting, shadows, framing",
"no_changes": "No face shape change, no new features, no gaze shift, no hair/clothing/lighting/background edits",
"style": "Ultra-photorealistic 8K portrait, sharp face focus, natural soft lighting, realistic skin glow"
}
AI Judge Analysis
Nano Banana 2
- + Excellent preservation of skin texture and freckles
- + Highly realistic eye crinkles and facial folds for the smile
- + Maintains the exact original lighting and subtle shadows
- − None notable
Grok Imagine Image Pro
- + Successfully creates a warm smile
- + Preserves the overall identity and background correctly
- − Slightly over-smoothens the skin on the cheeks compared to source
- − The teeth look a bit more generic/uniform than Model A
Verdict: Both models followed the instructions very well, correctly implementing the Duchenne smile while preserving the subject's identity. Nano Banana 2 is the superior choice because it perfectly preserved the original skin texture and freckles while adding more natural and realistic 'crows feet' and eye crinkles, whereas Grok Imagine Image Pro slightly softened the skin detail on the cheeks.
Studio Ghibli Anime Style
Editing“Transform this photo into a Studio Ghibli–inspired illustration. Use soft pastel colors, hand-painted textures, gentle lighting, dreamy backgrounds, and a warm, nostalgic mood”
AI Judge Analysis
Nano Banana 2
- + Perfectly captures the Studio Ghibli art style with clean line work and watercolor-esque textures.
- + Maintains the composition and character poses of the original meme extremely well.
- + Enhances the background with charming European-style details like wisteria and flower boxes typical of the requested aesthetic.
- − The man's facial expression is slightly more 'worried' than 'distracted' compared to the original.
Grok Imagine Image Pro
- + Excellent preservation of the original characters' facial features and expressions.
- + Provides a soft, painterly texture that aligns with the prompt.
- − The background is very blurry and lacks the detailed 'dreamy' world-building characteristic of Ghibli films.
- − The color palette feels slightly washed out compared to the vibrant but soft palette expected.
Verdict: Nano Banana 2 is the clear winner as it fully commits to the Studio Ghibli style, transforming the background into a lush, hand-painted environment while maintaining the meme's structure. Grok Imagine Image Pro applies a nice filter-like effect to the characters but fails to provide the rich background detail that defines the requested art style.
Vintage Cafe Logo
Text-to-Image“Vintage minimalist restaurant logo for "Caffè Florian", retro cloche dome with steam and "Est. 1720" banner, classic typography, warm brown and cream tones, subtle texture on light background, vector emblem style.”
AI Judge Analysis
Nano Banana 2
- + Perfect text rendering of the name and 'EST. 1720'
- + Elegant woodcut-style illustration of the cloche and steam
- + Strongest adherence to the 'banner' and 'vintage' aesthetic tags
- − The 'À' in Caffè is slightly disconnected/stylized aggressively
- − Border details are a bit busy for a truly 'minimalist' logo
Grok Imagine Image Pro
- + Clean vector lines following the minimalist requirement
- + Accurate text and date rendering
- + Clear, simple composition
- − Lacks the requested 'banner' for the establishment date
- − Steam element looks like a single generic squiggle
- − Very plain compared to the requested vintage style
Verdict: Nano Banana 2 followed the prompt's specific details much better, particularly the inclusion of the 'EST. 1720' banner and the vintage texture. While Grok Imagine Image Pro is more 'minimalist', it failed to include the requested banner and the illustration lacks the historical character implied by the prompt, whereas Nano Banana 2 produced a high-quality, cohesive emblem.
Apollo 11: Journey to Tranquility
Text-to-Image“Create a clean, modern vector infographic poster about the Apollo 11 mission. NASA-inspired palette (navy, white, muted red, light gray). Flat-vector style, crisp lines, consistent iconography, subtle gradients only. Steps (stop at landing): 1. Launch (Saturn Vicon) 2. Earth Orbit (Earth + orbit ring icon) 3. Translunar (trajectory arc icon) 4. Lunar Orbit (Moon + orbit ring icon) 5. Descent (lunar module descending icon) 6. Landing (lunar module on the surface icon) Small supporting elements (minimal text): • Crew strip: three silhouette icons with only last names: Armstrong, Aldrin, Collins. • Landing site marker: Moon pin labeled "Tranquility" only. Layout constraints: generous margins, large readable labels, clean background with subtle stars. Vector-only, print-poster look, high resolution.”
AI Judge Analysis
Nano Banana 2
- + Excellent graphic design and composition, balancing icons and text effectively.
- + Perfect adherence to the requested NASA-inspired color palette and flat vector style.
- + Accurate text rendering for the mission names and crew members.
- − Includes a fourth unidentified crew member icon, which is historically incorrect for Apollo 11.
Grok Imagine Image Pro
- + Clean, vertical timeline structure that is logical for a mission infographic.
- + Correct number of crew members listed with full names.
- + Minimalist aesthetic that follows the flat vector prompt well.
- − The 'Descent' icon includes a fiery engine plume which contradicts the 'flat vector' and 'clean icon' aesthetic compared to the others.
- − Text is slightly small and harder to read at the bottom.
Verdict: Nano Banana 2 produces a more aesthetically pleasing and professional-looking infographic with superior icon design and layout. While Grok Imagine Image Pro is more historically accurate regarding the crew count, Nano Banana 2's visual clarity and faithful adherence to the modern vector style make it the more successful image for a poster design.
Nano Banana 2
Gemini 3.1 Flash with image generation capabilities. High-efficiency image generation model with support for text rendering, reference images, search grounding, and thinking mode. The efficient counterpart to Gemini 3 Pro Image.
Grok Imagine Image Pro
xAI's premium image generation model offering higher fidelity output and stronger performance on single-image editing benchmarks compared to the standard Grok Imagine model