Gemini 3 Pro with image generation capabilities. Combines advanced reasoning with the ability to generate and edit images.
Settled by community votes across 12 shared challenges, with an AI judge weighing in on each.
Nano Banana Pro
#2 of 44 in Text-to-Image
Not enough comparable category data
The chart appears once both models have ratings across at least three shared arena categories.
Z-Image Turbo
#15 of 44 in Text-to-Image
Where the votes landed
Nano Banana Pro
61.5%
win rate
Ties
7.7%
Z-Image Turbo
30.8%
win rate
Challenge by challenge
The strongest take from each model on every shared challenge, with the AI judge's read.
Geometric Composition
Text-to-Image“A glass cube on a wooden table. Inside the cube is a small blue sphere. On top of the cube sits a red book. A green plant is behind the cube, partially visible through the glass. Soft window light from the left.”
AI Judge Analysis
Nano Banana Pro
- + Excellent adherence to the 'partially visible through glass' requirement for the plant.
- + Highly realistic texture on the wooden table and the vintage red book.
- + Superior lighting and shadows that accurately reflect the soft window light from the left.
- − The glass cube lacks a physical top pane, making the book appear to float slightly or rest only on the vertical edges.
Z-Image Turbo
- + Clear interpretation of all requested objects with vibrant colors.
- + The blue sphere has a nice clean reflection on the bottom of the cube.
- − The plant is entirely behind the book/cube and not visible through the glass as requested.
- − The perspective of the glass cube is slightly warped, particularly the vertical edges.
- − The lighting is flatter and lacks the directional nuance requested by 'light from the left'.
Verdict: Gemini 3 Pro Image Preview provides a much more sophisticated and realistic image, particularly in how it handles the complex transparency of the plant being visible through the glass. Z-Image Turbo captures the basic elements but fails the specific positioning of the plant and lacks the photographic depth and lighting quality of the first model.
Candid Street Photography
Text-to-Image“A candid street photo of an elderly Japanese man repairing a red bicycle in light rain, reflections on wet pavement, shallow depth of field, 50mm lens, natural skin texture, imperfect framing, motion blur from passing cars, cinematic but realistic, no stylization.”
AI Judge Analysis
Nano Banana Pro
- + Excellent photorealism with skin textures and clothing that look like a 35mm film or high-end digital photograph.
- + Accurately represents 'candid street photo' and 'imperfect framing' with the presence of the taxi and pedestrians.
- + Captures the light rain and pavement reflections with high fidelity.
- − The bike's kickstand and rear structure are slightly physically incoherent.
- − Missing the requested 'motion blur' from the passing cars.
Z-Image Turbo
- + Strong adherence to the red bicycle and elderly man subjects.
- + Shows a decent attempt at a shallow depth of field.
- − The image quality is significantly lower, with a digital, flat aesthetic compared to the requested cinematic look.
- − The man appears to be standing over the bike rather than 'repairing' it as requested.
- − The car in the background is frozen in time, lacking the requested motion blur.
Verdict: Gemini 3 Pro Image Preview is the clear winner as it produces a photographically convincing image that feels like a real street scene in Japan, complete with natural textures and a cinematic atmosphere. Z-Image Turbo produces a much flatter, lower-quality image where the subject's pose doesn't quite match the 'repairing' action mentioned in the prompt.
Fantasy Warrior
Text-to-Image“Close portrait of a battle-worn paladin in ornate engraved plate armor, hair braided with small beads, faint scars and dirt on the skin, warm torchlight reflecting off metal, shallow depth of field, bokeh sparks, lifelike eyes, highly detailed texture on leather straps and cloth underlayer.”
AI Judge Analysis
Nano Banana Pro
- + Exceptional texture on the engraved plate armor and leather straps.
- + Very high resolution with lifelike skin details and convincing battle scars.
- + Excellent adherence to the 'bokeh sparks' and 'warm torchlight' lighting atmosphere.
- − The torch flame on the right edge is a bit blurry and lacks definition.
Z-Image Turbo
- + Excellent character design with a realistic facial expression and authentic-looking dirt/scars.
- + The braid details and beads are very clear and intricate.
- + Good composition with a clear view of the torch and the resulting light reflections.
- − The bokeh effect is less pronounced than requested.
- − Some armor details, especially on the gorget/neck area, look slightly cluttered or muddy compared to Model A.
Verdict: Gemini 3 Pro Image Preview provides a superior level of fine-grained detail, particularly in the micro-textures of the armor engravings and the worn leather straps. While both models captured the prompt's essence effectively, Gemini 3 Pro's lighting and shallow depth of field feel more cinematic, whereas Z-Image Turbo has slightly less clarity in its textures.
Modern Clean Menu
Text-to-Image“Modern minimalist restaurant menu design, white background with colorful food photos in grid, sections for appetizers/pizza/mains, bold sans-serif fonts, vibrant accents, clean professional layout for casual dining.”
AI Judge Analysis
Nano Banana Pro
- + Excellent adherence to the grid layout for food photos.
- + Categorization into Appetizers, Pizza, and Mains is clearly executed and logically aligned with the photos above.
- + Typography is professional with distinct hierarchy between dish names, descriptions, and prices.
- − The text contains hallucinated 'lorem ipsum' style words rather than perfectly coherent English.
- − Some repeat menu items (e.g., Bruschetta listed four times) reduce the variety.
Z-Image Turbo
- + High-contrast, bold typography creates a strong modern aesthetic.
- + Vibrant orange accents provide the requested pop of color effectively.
- − The layout is messy, with text overlapping photo areas and inconsistent column widths.
- − Fails to follow the categorization prompt correctly, using a generic 'SE III ON' and merging headers into 'PIZZA MANS'.
- − Text rendering is poor and contains many nonsensical characters.
Verdict: Gemini 3 Pro Image Preview produces a much more realistic and usable menu layout that correctly maps food categories to organized lists. While Z-Image Turbo has bold colors, it fails on compositional logic, with poor text rendering and a disorganized grid that makes it feel less like a professional design tool output.
Magic Burger Explosion: Fiery Photorealism Challenge
Text-to-Image“Ad for 'Magic Burger'. Dynamic, exploded burger with all components (bun, patty, cheese, lettuce, tomato, sauce) suspended in mid-air. Emphasize photorealistic detail and a sense of motion. Dark, fiery background with glowing embers. Integrate text: 'MAGIC BURGER' as a prominent title, 'LIMITED TIME ONLY' as a secondary message, and '€6.99' in a starburst, all rendered with a fiery, glowing effect.”
AI Judge Analysis
Nano Banana Pro
- + Excellent adherence to the 'exploded' concept with suspended ingredients.
- + Superior text rendering with a consistent glowing fire effect.
- + Crisp texture on the bun and vegetables for a photorealistic look.
- − The sauce droplets look slightly artificial compared to the rest of the ingredients.
Z-Image Turbo
- + Strong fiery atmosphere with realistic lighting in the starburst.
- + Rich colors and high contrast in the background.
- − Failed the 'exploded burger' prompt; the burger is mostly assembled and static.
- − Repetitive text rendering, resulting in 'MAGIC BURGER BURGER'.
- − Lower dynamic energy than requested.
Verdict: Nano Banana Pro followed the prompt instructions much more accurately, creating a dynamic exploded burger layout with all text elements rendered perfectly. Z-Image Turbo failed to separate the burger components as requested and included a redundant word in the main title.
Chalkboard Menu
Text-to-Image“Handwritten-style chalkboard menu in a cozy café, all text rendered in the exact same realistic chalk handwriting style with natural variations in letter size, slight slant, and chalk texture — no printed or digital fonts anywhere on the board. Title at the top in elegant cursive chalk handwriting: ‘TODAY’S SPECIALS – APRIL 30, 2026’. Below it, three menu items also in the same handwritten chalk style: ‘Truffle Mushroom Risotto – $24’, ‘Grilled Octopus with Lemon & Herbs – $28’, ‘Brown Butter Chocolate Chip Cookies – $9’. At the very bottom, smaller text in the identical handwritten chalk style (slightly smaller but still clearly legible with the same handwriting characteristics): ‘All items made fresh daily • Ask about our gluten-free options’. Warm ambient café lighting, visible chalk dust and smudges, realistic handwriting imperfections, no clean printed text anywhere.”
AI Judge Analysis
Nano Banana Pro
- + Excellent photographic quality and atmosphere.
- + Perfectly rendered, realistic chalk texture with faint smudges and dust.
- + Handwriting is elegant, legible, and maintains a consistent style throughout.
- − The title is in all-caps rather than 'elegant cursive' as requested.
- − The handwriting looks a bit too perfect and uniform to be completely natural.
Z-Image Turbo
- + Very accurate text rendering including the multi-line layout.
- + Clearer chalk texture and variations in letter size.
- − Contains a spelling error ('Mustroom' instead of Mushroom).
- − Lacks the environmental context of the café requested in the prompt.
- − The 'cursive' request for the title was not followed.
Verdict: Nano Banana Pro produces a much more visually appealing and complete scene by including the cozy café atmosphere, and it handles the text with high realism and no spelling errors. Z-Image Turbo captures the texture of chalk well but fails on the environment and misspells a primary menu item. Nano Banana Pro is the clear winner for its superior composition and adherence to the overall aesthetic.
The Capybara Taxi Driver
Text-to-Image“Photorealistic scene inside a yellow New York taxi at night. A capybara is driving, wearing a yellow taxi driver cap and a dark jacket. It has a calm, professional expression and both front paws on the steering wheel. In the back seat sits a human businesswoman in a coat, looking at her phone with a completely normal, bored expression (as if this is just another normal ride). Through the windows you can see the streets of Manhattan at night with blurred lights. Realistic taxi interior, photorealistic, detailed fur and fabric, 35mm lens, night lighting with reflections, shallow depth of field.”
AI Judge Analysis
Nano Banana Pro
- + Excellent grit and photorealistic texture for a rainy NYC night atmosphere.
- + Comprehensive taxi details including the meter, dashboard, and raindrops on glass.
- + Accurate positioning of the capybara's paws on the steering wheel.
- − The capybara's scale relative to the car interior is slightly too large.
- − The passenger is squeezed into the corner of the frame.
Z-Image Turbo
- + Clean, high-quality rendering of the characters' faces and textures.
- + Good adherence to the clothing description for both the capybara and the businesswoman.
- − The 'steering wheel' is floating or disconnected from a steering column, with the capybara's hand not actually gripping it.
- − The background lacks the characteristic 'Manhattan night' depth and lights requested in the prompt.
- − General lack of realistic taxi interior props like a meter or partition.
Verdict: Nano Banana Pro wins for its superior atmospheric storytelling and attention to environmental detail, making the scene feel like a real NYC taxi. While Z-Image Turbo has clean character renders, it fails on basic spatial logic with a floating steering wheel and a generic background that doesn't capture the Manhattan setting.
The Halloween Invitation
Text-to-Image“Vintage gothic Halloween party invitation. Dark parchment poster, spooky border with webs and thorns, central glowing jack-o-lantern, bats, twisted trees, moody night sky. Add elegant gothic title text saying "Halloween Party Invitation", a small scroll banner saying "You are invited to a night of frights", and event details at the bottom: Date: 30.10.2026 Time: 7pm Location: The Arches, NYC Spooky but polished, cinematic lighting, square format.”
AI Judge Analysis
Nano Banana Pro
- + Excellent typography rendering with perfect spelling.
- + Stronger cinematic lighting and atmosphere with a cohesive color palette.
- + Superior composition where all elements (webs, thorns, bats) feel integrated into the scene.
- − The 'scroll banner' is more of a floating ribbon than a traditional architectural scroll.
Z-Image Turbo
- + Good use of the parchment texture requested in the prompt.
- + Clear central jack-o-lantern with a classic carved face.
- − Spelling error in the location text ('The Archves' instead of 'The Arches').
- − The border elements like the thorns and webs feel like flat clip-art overlaid on the scene.
- − The small scroll banner text is placed at the very top, not as a banner for the specific phrase requested.
Verdict: Nano Banana Pro produced a vastly superior result with professional-grade typography and atmospheric lighting. While Z-Image Turbo followed the parchment request literally, it suffered from spelling errors and a fragmented composition that lacked the cinematic quality of the first image.
Bald man challenge
Image Editing“Give the person a full, thick head of natural hair with realistic texture, density, and a natural hairline. Preserve facial features and lighting.”
AI Judge Analysis
Nano Banana Pro
- + Successfully added a full, thick head of hair with natural texture.
- + Preserved the facial features, glasses, and background perfectly.
- + The lighting on the hair matches the original scene lighting.
- − The hairline is slightly high, though it looks intentional and realistic.
Z-Image Turbo
- + Maintains the overall character and environment.
- − Failed to provide a 'full, thick head of hair', instead adding a buzz cut.
- − Removed the subject's glasses.
- − The background and clothing details have shifted significantly compared to the source.
Verdict: Gemini 3 Pro Image Preview perfectly executed the edit by adding realistic hair while keeping every other detail of the source image, including the glasses and background, identical. Z-Image Turbo failed the prompt by only adding a short buzz cut and also made several unwanted changes, such as removing the glasses and altering the background.
Isometric Miniature Diorama Scenes
Text-to-Image“Create a clear, 45° top-down isometric miniature 3D cartoon scene of Japan's signature dish: sushi, with soft refined textures, realistic PBR materials, gentle lighting, on a small raised diorama base with minimal garnish and plate. Solid light blue background. At top-center: 'JAPAN' in large bold text, 'SUSHI' below it, small flag icon. Perfectly centered, ultra-clean, high-clarity, square format.”
AI Judge Analysis
Nano Banana Pro
- + Excellent text rendering with clean, professional typography.
- + Accurate Japanese flag icon matches the context.
- + High-quality 3D assets with realistic wood and ceramic textures.
- − The steam effect is slightly inconsistent with the 'miniature' 3D cartoon style.
Z-Image Turbo
- + Strong 'cartoon' aesthetic with soft, clay-like textures.
- + Clean isometric composition.
- − Incorrect flag icon (shows the flag of China instead of Japan).
- − The text rendering is slightly less refined and less centered than the competitor.
- − The sushi roll features a green element (wasabi/avocado) in an unusual location inside the rice block.
Verdict: Gemini 3 Pro Image Preview is the clear winner as it followed all instructions, including the correct flag for Japan and high-quality text layout. Z-Image Turbo failed a critical cultural context check by displaying the Chinese flag for a Japanese-themed prompt and had less sophisticated detailing on the diorama elements.
Adorable Baby Animals in Sunny Meadow
Text-to-Image“Hyper-photorealistic scene of fluffy baby animals—a golden retriever puppy, tabby kitten, baby bunny, and red fox kit—with big expressive eyes and ultra-detailed soft fur, playfully chasing butterflies and tumbling together in a lush wildflower meadow, warm golden sunrise light with god rays and dew sparkles, joyful wholesome vibe, 8K masterpiece.”
AI Judge Analysis
Nano Banana Pro
- + Perfect adherence to the requested animal count and types
- + Excellent fur texture and sharp eye details
- + Captures the 'god rays' and 'chasing' motion perfectly
- − Lighting is slightly over-processed, giving it a digital art feel more than a photograph
Z-Image Turbo
- + Soft, naturalistic lighting that feels more like a real photograph
- + High-quality fur rendering on the puppy and fox
- − Anatomy issues with the bunny, which has three ears
- − Cluttered composition with animals overlapping awkwardly
Verdict: Gemini 3 Pro Image Preview follows the prompt much more accurately, including all four distinct animals with clear, dynamic expressions and better anatomical correctness. Z-Image Turbo creates a more realistic photographic texture, but fails on composition and details, notably giving the rabbit a third ear and merging the kitten and fox too closely together.
Vintage Cafe Logo
Text-to-Image“Vintage minimalist restaurant logo for "Caffè Florian", retro cloche dome with steam and "Est. 1720" banner, classic typography, warm brown and cream tones, subtle texture on light background, vector emblem style.”
AI Judge Analysis
Nano Banana Pro
- + Excellent adherence to the 'vintage' and 'cloche dome with steam' prompt details.
- + Very high-quality logo illustration with professional cross-hatching and shading.
- + Accurate and clean rendering of all requested text including the accent on 'Caffè'.
- − Slightly more complex than a standard 'minimalist' logo, though fitting for the vintage theme.
Z-Image Turbo
- + Leaner, more modern minimalist interpretation of the vector style.
- + Clean text rendering with no spelling errors.
- + Good use of professional negative space in the cloche design.
- − The steam effect is very small and lacks the 'vintage' flair requested.
- − The 'banner' for the date is quite flat compared to the requested classic style.
- − The typography for 'Caffè' is slightly less elegant than Model A.
Verdict: Gemini 3 Pro Image Preview perfectly captures the requested vintage aesthetic, providing a beautifully detailed cloche and steam illustration that feels like a real historical brand. While Z-Image Turbo is more minimalist, it lacks the character and artistic depth requested by the 'vintage' and 'classic' keywords in the prompt. Gemini 3 Pro is the clear winner for its superior illustration quality and better adherence to the specific visual elements like the ornate steam and banner.
Explore each model
Tongyi-MAI's 6-billion parameter distilled text-to-image model optimized for speed, achieving high-quality generation in 8 steps or fewer with support for bilingual text rendering