Nano Banana vs Z-Image Turbo

Head-to-head across 12 challenges

Nano Banana

40.0%

win rate

Ties

13.3%

Z-Image Turbo

46.7%

win rate

40.0% 13.3% ties 46.7%

Challenge Results

Geometric Composition

Text-to-Image

“A glass cube on a wooden table. Inside the cube is a small blue sphere. On top of the cube sits a red book. A green plant is behind the cube, partially visible through the glass. Soft window light from the left.”

Nano Banana
Z-Image Turbo
0% wins 50% ties 50% wins

AI Judge Analysis

Nano Banana

  • + Excellent lighting and atmosphere with dust motes and soft shadows.
  • + The plant is clearly visible through the glass cube as requested.
  • + High visual quality with realistic textures on the book and wooden table.
  • The blue sphere is floating unnaturally in the center of the cube.
  • The scale of the plant in the background makes it dominant over the subject.

Z-Image Turbo

  • + Realistic glass construction with visible seams and base reflections.
  • + Logical physics with the sphere resting on the bottom of the cube.
  • + Clear adherence to all spatial requirements of the prompt.
  • The green plant is extremely blurred and barely recognizable in the background.
  • The lighting is somewhat flat compared to the other model.

Verdict: Both models followed the prompt instructions well, but Nano Banana created a much more aesthetically pleasing and artistic image with superior lighting and texture. While Nano Banana's sphere is floating, Z-Image Turbo's plant is so out of focus that it fails to satisfy the 'partially visible through the glass' requirement as effectively as Nano Banana.

Candid Street Photography

Text-to-Image

“A candid street photo of an elderly Japanese man repairing a red bicycle in light rain, reflections on wet pavement, shallow depth of field, 50mm lens, natural skin texture, imperfect framing, motion blur from passing cars, cinematic but realistic, no stylization.”

Nano Banana
Z-Image Turbo
50% wins 0% ties 50% wins

AI Judge Analysis

Nano Banana

  • + Excellent atmosphere with cinematic lighting and reflections.
  • + Accurately depicts the action of repairing with tools and a kneeling pose.
  • + Strong adherence to the 'imperfect framing' and 'shallow depth of field' requirements.
  • The bicycle frame geometry is slightly warped/incoherent.
  • The background cars lack the specific 'motion blur' requested, appearing mostly stationary even if out of focus.

Z-Image Turbo

  • + Good skin texture and natural lighting.
  • + Captures a candid feel with a very realistic, unstylized aesthetic.
  • Does not show the man 'repairing' the bike; he is simply holding the handlebars.
  • Background car is sharp rather than having 'motion blur'.
  • Composition is very tight and lacks the requested cinematic depth.

Verdict: Nano Banana followed the prompt much more effectively, capturing the 'repairing' action and the atmospheric conditions like reflections and rain. While Z-Image Turbo has a very realistic skin texture, it failed at the core action of the prompt and ignored the 'motion blur' and 'cinematic' keywords.

Fantasy Warrior

Text-to-Image

“Close portrait of a battle-worn paladin in ornate engraved plate armor, hair braided with small beads, faint scars and dirt on the skin, warm torchlight reflecting off metal, shallow depth of field, bokeh sparks, lifelike eyes, highly detailed texture on leather straps and cloth underlayer.”

Nano Banana
Z-Image Turbo
33% wins 0% ties 67% wins

AI Judge Analysis

Nano Banana

  • + Excellent depiction of ornate, engraved plate armor with high-contrast detailing.
  • + Strong character portrait with very lifelike skin texture and sharp facial features.
  • + Clear bokeh effect with distinct background sparks.
  • Missed the request for beads in the hair braids.
  • The lighting feels slightly more studio-generated than natural torchlight.

Z-Image Turbo

  • + Accurately included the small beads within the braided hair.
  • + Excellent implementation of warm torchlight reflecting off the metal surfaces.
  • + The facial dirt and 'battle-worn' aesthetic feel more organic and less like surgical scars.
  • Armor texture and engravings are slightly softer and less distinct than Model A.
  • Overall image resolution or sharpness appears slightly lower.

Verdict: Nano Banana excels in rendering intricate armor details and crisp facial features, but it fails to include the requested beads in the braids. Z-Image Turbo adheres better to the specific prompt details like the hair beads and provides a more atmospheric lighting setup that truly feels like torchlight, making it the more faithful interpretation of the prompt.

Modern Clean Menu

Text-to-Image

“Modern minimalist restaurant menu design, white background with colorful food photos in grid, sections for appetizers/pizza/mains, bold sans-serif fonts, vibrant accents, clean professional layout for casual dining.”

Nano Banana
Z-Image Turbo
0% wins 0% ties 100% wins

AI Judge Analysis

Nano Banana

  • + Clean, professional layout with a consistent grid of food photos.
  • + Accurate food photography representation for various categories like pizza, salmon, and pasta.
  • + Good use of colorful accents through colored photo borders and section headings.
  • Several typos in the text, including 'Appeitiers' and 'Brusuechta'.
  • Logic issues in the menu text, such as a 'NY Strip' being described as 'vanilla bean ice cream'.

Z-Image Turbo

  • + Excellent visual quality on the food images with high clarity and appetizing appeal.
  • + Strong modern aesthetic with bold typography and a clear visual hierarchy.
  • + Better adherence to the 'professional layout' aspect of the prompt.
  • Nonsensical text throughout, including the prominent 'PIZZA MANS' header.
  • Missing 'Mains' section, instead using filler text like 'SE IIION'.

Verdict: Nano Banana followed the layout requirements well and produced more legible (though misspelled) text, whereas Z-Image Turbo produced significantly higher quality food photography and a more stylish graphic design. While Nano Banana's grid is more orderly, Z-Image Turbo captures the 'modern minimalist/casual dining' aesthetic more effectively despite the nonsensical text.

Magic Burger Explosion: Fiery Photorealism Challenge

Text-to-Image

“Ad for 'Magic Burger'. Dynamic, exploded burger with all components (bun, patty, cheese, lettuce, tomato, sauce) suspended in mid-air. Emphasize photorealistic detail and a sense of motion. Dark, fiery background with glowing embers. Integrate text: 'MAGIC BURGER' as a prominent title, 'LIMITED TIME ONLY' as a secondary message, and '€6.99' in a starburst, all rendered with a fiery, glowing effect.”

Nano Banana
Z-Image Turbo

AI Judge Analysis

Nano Banana

  • + Excellent adherence to the 'exploded' concept with all ingredients clearly suspended.
  • + Perfect text rendering for all requested strings, including the specific currency symbol.
  • + Strong composition with a sense of motion and a cohesive fiery theme.
  • The '€6.99' text is slightly small compared to the other elements.
  • The bottom bun looks a bit flat in perspective.

Z-Image Turbo

  • + High-quality food photography aesthetics with vibrant colors.
  • + Strong glowing effect on the starburst and primary title text.
  • Failed to create an 'exploded' view, providing a fully assembled burger instead.
  • Repeated 'BURGER' in the title ('MAGIC BURGER BURGER').
  • The starburst element is slightly cluttered with overlapping fire effects.

Verdict: Nano Banana followed the layout instructions much more closely, accurately depicting the exploded burger view and precise text strings requested. Z-Image Turbo produced a high-quality image but failed the core 'exploded' prompt requirement and included a typo in the title.

Chalkboard Menu

Text-to-Image

“Handwritten-style chalkboard menu in a cozy café, all text rendered in the exact same realistic chalk handwriting style with natural variations in letter size, slight slant, and chalk texture — no printed or digital fonts anywhere on the board. Title at the top in elegant cursive chalk handwriting: ‘TODAY’S SPECIALS – APRIL 30, 2026’. Below it, three menu items also in the same handwritten chalk style: ‘Truffle Mushroom Risotto – $24’, ‘Grilled Octopus with Lemon & Herbs – $28’, ‘Brown Butter Chocolate Chip Cookies – $9’. At the very bottom, smaller text in the identical handwritten chalk style (slightly smaller but still clearly legible with the same handwriting characteristics): ‘All items made fresh daily • Ask about our gluten-free options’. Warm ambient café lighting, visible chalk dust and smudges, realistic handwriting imperfections, no clean printed text anywhere.”

Nano Banana
Z-Image Turbo

AI Judge Analysis

Nano Banana

  • + Excellent adherence to all text requirements including the full third menu item.
  • + Realistic chalk texture with smears and dust details on the board.
  • + Beautiful cursive header as requested in the prompt.
  • The text style looks slightly more like a digital font than organic handwriting.
  • Background is highly blurred, reducing the 'cozy cafe' environment details.

Z-Image Turbo

  • + The handwriting style feels very organic and manually lettered.
  • + Strong contrast makes the text very easy to read.
  • + Accurate chalk-style rendering.
  • Includes a spelling error ('Mustroom' instead of Mushroom).
  • Layout is cramped, causing the last two items to wrap poorly.
  • Does not show the cafe environment, focusing only on the board.

Verdict: Nano Banana followed the prompt instructions much better, correctly including all specific menu items and the exact date without spelling errors. While Z-Image Turbo captures a slightly more authentic 'handwritten' feel, it fails on text accuracy and layout by misspelling 'Mushroom' and struggling to fit the text on the board.

The Capybara Taxi Driver

Text-to-Image

“Photorealistic scene inside a yellow New York taxi at night. A capybara is driving, wearing a yellow taxi driver cap and a dark jacket. It has a calm, professional expression and both front paws on the steering wheel. In the back seat sits a human businesswoman in a coat, looking at her phone with a completely normal, bored expression (as if this is just another normal ride). Through the windows you can see the streets of Manhattan at night with blurred lights. Realistic taxi interior, photorealistic, detailed fur and fabric, 35mm lens, night lighting with reflections, shallow depth of field.”

Nano Banana
Z-Image Turbo

AI Judge Analysis

Nano Banana

  • + Excellent photorealism with cinematic lighting and reflections on the glass.
  • + The businesswoman's bored expression perfectly matches the prompt's tone.
  • + Composition effectively captures the gritty New York taxi atmosphere.
  • The capybara's 'front paws' look a bit too much like human hands in gloves.
  • The businesswoman is sitting in the front passenger seat instead of the back seat.

Z-Image Turbo

  • + Successfully placed the businesswoman in the back seat as requested.
  • + The capybara's expression is very frontal and professional.
  • + Clean texture on the capybara's fur and the taxi cap.
  • The paws are floating and not actually gripping the steering wheel.
  • The background lighting is quite sparse and doesn't fully capture the 'night in Manhattan' feel compared to the other image.
  • The businesswoman's hands and phone interaction look slightly distorted.

Verdict: Nano Banana captures a far more realistic and atmospheric scene with superior textures and lighting, though it failed the spatial instruction of placing the passenger in the back seat. Z-Image Turbo followed the placement instructions better but suffered from significant technical issues like floating paws and a less convincing environment. Nano Banana is the winner for its high visual quality and the perfectly captured 'bored' expression of the passenger.

The Halloween Invitation

Text-to-Image

“Vintage gothic Halloween party invitation. Dark parchment poster, spooky border with webs and thorns, central glowing jack-o-lantern, bats, twisted trees, moody night sky. Add elegant gothic title text saying "Halloween Party Invitation", a small scroll banner saying "You are invited to a night of frights", and event details at the bottom: Date: 30.10.2026 Time: 7pm Location: The Arches, NYC Spooky but polished, cinematic lighting, square format.”

Nano Banana
Z-Image Turbo

AI Judge Analysis

Nano Banana

  • + Excellent text rendering with no spelling errors.
  • + Sophisticated cinematic lighting and atmospheric depth.
  • + Perfect adherence to all prompt elements including the specific date and location.
  • The scrolls at the bottom are slightly disconnected from each other.

Z-Image Turbo

  • + Strong vintage parchment aesthetic.
  • + Clear, legible gothic font for the title.
  • + Good use of the thorn and web border theme.
  • Typographical error in the location ('The Archves' instead of 'The Arches').
  • The small banner text is placed at the top and lacks a scroll graphic around the words.
  • Composition is a bit cluttered with multiple torn paper effects overlapping.

Verdict: Nano Banana is the clear winner as it followed every instruction perfectly, including rendering the text with 100% accuracy. While Z-Image Turbo captured a nice vintage feel, it failed on the location spelling and placed the secondary banner text without the requested scroll banner graphic.

Bald man challenge

Image Editing
Edit instruction

“Give the person a full, thick head of natural hair with realistic texture, density, and a natural hairline. Preserve facial features and lighting.”

Before After
Nano Banana
Before After
Z-Image Turbo
75% wins 25% ties 0% wins

AI Judge Analysis

Nano Banana

  • + Successfully added a full, thick head of hair as requested.
  • + Preserved the original facial features and glasses almost perfectly.
  • + Hair texture and lighting match the original scene well.
  • The hair merges slightly awkwardly with the frames of the glasses.
  • Small artifacts around the ears where the hair meets the skin.

Z-Image Turbo

  • + Preserved the general aesthetic of the background.
  • Failed the primary edit instruction; the person still appears largely bald/shaved.
  • Altered the facial features significantly, removing the glasses and changing the face shape.
  • Failed to preserve the source image identity.

Verdict: Nano Banana followed the instructions perfectly, providing a realistic and thick head of hair while maintaining the person's identity and original glasses. Z-Image Turbo failed to add the requested hair and completely changed the person's face, removing their glasses and losing the source preservation.

Isometric Miniature Diorama Scenes

Text-to-Image

“Create a clear, 45° top-down isometric miniature 3D cartoon scene of Japan's signature dish: sushi, with soft refined textures, realistic PBR materials, gentle lighting, on a small raised diorama base with minimal garnish and plate. Solid light blue background. At top-center: 'JAPAN' in large bold text, 'SUSHI' below it, small flag icon. Perfectly centered, ultra-clean, high-clarity, square format.”

Nano Banana
Z-Image Turbo
0% wins 0% ties 100% wins

AI Judge Analysis

Nano Banana

  • + Perfect adherence to the Japanese flag icon request.
  • + Excellent layout of diverse sushi types on the diorama base.
  • + Extremely clean and bold text rendering that matches the prompt.
  • The textures are more plastic-like than 'realistic PBR' materials.

Z-Image Turbo

  • + Good realistic subsurface scattering and material textures on the fish.
  • + Followed the 45-degree isometric camera angle well.
  • Incorrectly used the flag of China instead of Japan.
  • The text 'SUSHI' is slightly off-center compared to 'JAPAN'.
  • The diorama base has odd clipping/layering at the bottom left.

Verdict: Nano Banana is the clear winner as it correctly rendered the Japanese flag and provided a professional, balanced composition of various sushi pieces. Z-Image Turbo failed a critical part of the prompt by including the flag of China for a prompt specifically requesting Japanese cultural items, and the text layout was less polished.

Adorable Baby Animals in Sunny Meadow

Text-to-Image

“Hyper-photorealistic scene of fluffy baby animals—a golden retriever puppy, tabby kitten, baby bunny, and red fox kit—with big expressive eyes and ultra-detailed soft fur, playfully chasing butterflies and tumbling together in a lush wildflower meadow, warm golden sunrise light with god rays and dew sparkles, joyful wholesome vibe, 8K masterpiece.”

Nano Banana
Z-Image Turbo

AI Judge Analysis

Nano Banana

  • + Excellent adherence to the lighting request with visible god rays and dew sparkles.
  • + Superior rendering of fur texture and high-resolution details on all four animals.
  • + Well-composed scene with a clear 'masterpiece' aesthetic and magical atmosphere.
  • The animals look slightly more 'posed' as portraits rather than actively tumbling.
  • The kitten's anatomy seems slightly flatter compared to the fox and puppy.

Z-Image Turbo

  • + Captures the 'tumbling' and 'playful chasing' aspect of the prompt more dynamically.
  • + The kitten has a very expressive, joyful facial expression.
  • Resolution is noticeably lower with blurred textures and less detail in the fur.
  • Missing the specific 'god rays' requested in the lighting prompt.
  • Artifacts are visible on the butterflies and the puppy's paw.

Verdict: Nano Banana is the clear winner due to its high visual fidelity, meeting the '8K masterpiece' requirement with intricate fur detail and beautiful atmospheric lighting (god rays and dew). While Z-Image Turbo captured the movement of the scene better, it failed on technical quality, appearing blurry and lacking the specific lighting elements requested.

Vintage Cafe Logo

Text-to-Image

“Vintage minimalist restaurant logo for "Caffè Florian", retro cloche dome with steam and "Est. 1720" banner, classic typography, warm brown and cream tones, subtle texture on light background, vector emblem style.”

Nano Banana
Z-Image Turbo
50% wins 0% ties 50% wins

AI Judge Analysis

Nano Banana

  • + Perfect text rendering for both the name and the date banner.
  • + Higher adherence to the 'vector emblem' style with the circular border.
  • + Elegant integration of the steam inside the cloche graphic.
  • The 'steam' lines are a bit thick compared to the overall minimalist aesthetic.

Z-Image Turbo

  • + Clean, minimalist layout that is easy to read.
  • + Accurate adherence to color scheme and theme.
  • Incorrectly spelled 'Caffè' as 'Caffé' (wrong accent direction).
  • The 'Est. 1720' text is slightly off-center within the banner.
  • The steam icon floating above the cloche is a bit disconnected.

Verdict: Nano Banana is the clear winner as it perfectly executed the text rendering, including the specific Italian grave accent in 'Caffè' and the banner text. While Z-Image Turbo produced a decent minimalist design, it failed on the spelling detail and the overall composition felt less cohesive than Nano Banana's emblem style.

Nano Banana

Gemini 2.5 Flash Image is optimized for image understanding and generation, offering a balance of price and performance with fast and efficient image generation and editing capabilities.

Z-Image Turbo

Tongyi-MAI's 6-billion parameter distilled text-to-image model optimized for speed, achieving high-quality generation in 8 steps or fewer with support for bilingual text rendering