Nano Banana Pro vs Qwen Image 2512
Head-to-head across 6 challenges
Nano Banana Pro
66.7%
win rate
Ties
0.0%
Qwen Image 2512
33.3%
win rate
Challenge Results
Geometric Composition
Text-to-Image“A glass cube on a wooden table. Inside the cube is a small blue sphere. On top of the cube sits a red book. A green plant is behind the cube, partially visible through the glass. Soft window light from the left.”
AI Judge Analysis
Nano Banana Pro
- + Perfect adherence to all spatial instructions
- + Highly realistic lighting and photographic texture
- + Naturalistic plant and book details
- − The sphere is quite small relative to the cube
Qwen Image 2512
- + Vibrant colors and clean composition
- + The blue sphere is very prominent
- − The cube is not shaped like a cube (rectangular/low aspect ratio)
- − Reflections inside the glass are physically inconsistent and cluttered
- − The book's placement looks slightly floating or poorly integrated with the glass edge
Verdict: Gemini 3 Pro Image Preview captures the scene with much higher realism and better adherence to the 'cube' geometry. While Qwen Image 2512 produces a bright and clear image, the glass container is a rectangle rather than a cube, and the internal reflections are distracting and illogical. Gemini's lighting from the window feels more authentic and creates a better sense of depth.
Candid Street Photography
Text-to-Image“A candid street photo of an elderly Japanese man repairing a red bicycle in light rain, reflections on wet pavement, shallow depth of field, 50mm lens, natural skin texture, imperfect framing, motion blur from passing cars, cinematic but realistic, no stylization.”
AI Judge Analysis
Nano Banana Pro
- + Excellent adherence to the 'candid' and 'imperfect framing' prompts, creating a very believable street photography aesthetic.
- + Highly realistic skin texture, clothing details, and environment that avoids an 'AI look'.
- + Effective use of reflections on the wet pavement and realistic rain effects.
- − The bicycle components are socially fragmented, with spokes and frames intersecting poorly near the rear wheel.
- − Minor motion blur on cars is less pronounced than requested.
Qwen Image 2512
- + Good implementation of shallow depth of field with a bokeh effect on background headlights.
- + Clear focus on the subject's face with decent natural skin texture.
- + The bicycle's red color is vibrant and matches the prompt requirements.
- − The subject is looking directly at the camera, failing the 'candid' and 'repairing' aspect of the prompt.
- − The bicycle construction is physically impossible, with major structural errors in the frame and pedal placement.
- − Lacks the 'imperfect framing' and 'light rain' atmosphere requested, appearing more like a staged portrait.
Verdict: Gemini 3 Pro much more successfully captures the requested atmosphere of a candid street photo, utilizing an off-center composition and a believable rainy environment. While Qwen Image 2512 produces a sharp portrait, it ignores the 'repairing' and 'candid' instructions, and the anatomical/mechanical errors in its bicycle are far more distracting than Gemini's. Gemini 3 Pro feels grounded in reality, whereas Qwen feels like a typical AI-generated studio portrait.
Modern Clean Menu
Text-to-Image“Modern minimalist restaurant menu design, white background with colorful food photos in grid, sections for appetizers/pizza/mains, bold sans-serif fonts, vibrant accents, clean professional layout for casual dining.”
AI Judge Analysis
Nano Banana Pro
- + Excellent typography with clear, legible text and distinct sections.
- + High-quality, realistic food photography photography that matches the menu descriptions.
- + Followed the prompt exactly with separate Appetizers, Pizza, and Mains sections.
- − Text contains 'llorem ipsum' style nonsense words despite being legible.
- − Repetitive item names (e.g., 'Bruschetta' listed four times).
Qwen Image 2512
- + Clean grid layout for photos.
- + Consistent aesthetic across the food imagery.
- − Text is garbled and largely illegible with significant artifacts.
- − Layout is less professional with narrow text columns that are difficult to read.
- − Missed the specific 'Appetizers/Pizza/Mains' sectioning, merging them into 'Appetimers' and 'Pizza/Means'.
Verdict: Gemini 3 Pro Image Preview is the clear winner as it produced a highly professional, functional menu layout with legible typography and perfectly rendered food photos. While Qwen Image 2512 followed the grid request, its text rendering and overall composition are significantly lower in quality.
Isometric Miniature Diorama Scenes
Text-to-Image“Create a clear, 45° top-down isometric miniature 3D cartoon scene of Japan's signature dish: sushi, with soft refined textures, realistic PBR materials, gentle lighting, on a small raised diorama base with minimal garnish and plate. Solid light blue background. At top-center: 'JAPAN' in large bold text, 'SUSHI' below it, small flag icon. Perfectly centered, ultra-clean, high-clarity, square format.”
AI Judge Analysis
Nano Banana Pro
- + Excellent adherence to the '45° top-down isometric' instruction.
- + Superior PBR materials, showing realistic wood grain, ceramic glaze, and rice textures.
- + Very clean typography and flag icon that looks professional and well-integrated.
- − The composition is slightly high in the frame, making the bottom feel a bit empty.
Qwen Image 2512
- + Captures the 'miniature 3D cartoon' style well with soft, rounded shapes.
- + Balanced composition that fills the square format effectively.
- + Vibrant colors that make the food look appealing.
- − The text 'JAPAN' has some minor kerning/alignment issues and the 'N' is slightly malformed.
- − The perspective is more of a standard 3/4 view rather than a strict isometric 45° angle.
- − The flag icon is a simple rectangle and lacks the polished feel of Model A.
Verdict: Gemini 3 Pro Image Preview is the winner as it perfectly executed the technical requirements of the prompt, specifically the isometric perspective and the PBR material textures. While Qwen Image 2512 created a charming cartoon scene, Gemini 3 Pro provided much higher clarity in its typography and more realistic material rendering for the wood and food.
Adorable Baby Animals in Sunny Meadow
Text-to-Image“Hyper-photorealistic scene of fluffy baby animals—a golden retriever puppy, tabby kitten, baby bunny, and red fox kit—with big expressive eyes and ultra-detailed soft fur, playfully chasing butterflies and tumbling together in a lush wildflower meadow, warm golden sunrise light with god rays and dew sparkles, joyful wholesome vibe, 8K masterpiece.”
AI Judge Analysis
Nano Banana Pro
- + Captures the 'playfully chasing' and 'tumbling' aspect of the prompt with dynamic poses.
- + Excellent fur texture and lighting on the animals, particularly the golden retriever and fox.
- + Well-composed scene with a clear sense of movement and interaction with the butterfly.
- − The kitten's facial features look slightly more caricatured and less photorealistic than the other animals.
- − Includes only one butterfly despite the plural 'butterflies' in the prompt.
Qwen Image 2512
- + Includes multiple butterflies as requested in the prompt.
- + Beautiful lighting and dew sparkle effects that create a very dreamy atmosphere.
- + High level of fine detail in the fur and whiskers of all four animals.
- − The animals are posing for a portrait rather than 'playfully chasing and tumbling'.
- − Anatomical blending issue where the fox's body seems to morph into the puppy's paws.
Verdict: Gemini 3 Pro much better captures the requested action of the prompt, showing the animals actively running and playing in a dynamic composition. While Qwen Image 2512 has a more serene and detailed aesthetic with beautiful lighting, it fails the 'tumbling/chasing' part of the prompt by presenting a static group portrait.
Vintage Cafe Logo
Text-to-Image“Vintage minimalist restaurant logo for "Caffè Florian", retro cloche dome with steam and "Est. 1720" banner, classic typography, warm brown and cream tones, subtle texture on light background, vector emblem style.”
AI Judge Analysis
Nano Banana Pro
- + Perfect text rendering including the 'è' accent.
- + Clean vector-style illustration that fits the minimalist request.
- + Strong adherence to the requested layout with the banner positioned over the illustration.
- − The steam effect is a bit simple compared to model B.
Qwen Image 2512
- + Excellent artistic shading and texture on the cloche.
- + More dynamic and detailed steam effects.
- + Classic, elegant script typography.
- − Missed the 'è' accent in 'Caffè'.
- − Slightly less 'minimalist' than requested due to heavy shading and complex steam.
Verdict: Gemini 3 Pro Image Preview captures the prompt's requirements more accurately, specifically concerning the correct spelling of 'Caffè' and the requested minimalist vector style. While Qwen Image 2512 offers more impressive illustrative detail and a more sophisticated aesthetic, it failed on a key textual detail and is less minimalist in its execution.
Nano Banana Pro
Gemini 3 Pro with image generation capabilities. Combines advanced reasoning with the ability to generate and edit images.
Qwen Image 2512
Improved version of Alibaba's Qwen image model with better text rendering, finer natural textures, and more realistic human generation.