Head to head
Esc

Models · slot A

to navigate to pick

Nano Banana Pro Google Qwen Image 2512 Alibaba

Settled by community votes across 8 shared challenges, with an AI judge weighing in on each.

Nano Banana Pro

28.2 arena score

#2 of 44 in Text-to-Image

Best Image Editing right now Top 2 in Text-to-Image
Skill signature

Not enough comparable category data

The chart appears once both models have ratings across at least three shared arena categories.

Qwen Image 2512

22.4 arena score

#26 of 44 in Text-to-Image

Vote tally

Where the votes landed

Nano Banana Pro

62.5%

win rate

Ties

0.0%

Qwen Image 2512

37.5%

win rate

62.5% 0.0% ties 37.5%
Shared challenges 8

Challenge by challenge

The strongest take from each model on every shared challenge, with the AI judge's read.

Geometric Composition

Text-to-Image

“A glass cube on a wooden table. Inside the cube is a small blue sphere. On top of the cube sits a red book. A green plant is behind the cube, partially visible through the glass. Soft window light from the left.”

Nano Banana Pro
Qwen Image 2512
67% wins 0% ties 33% wins

AI Judge Analysis

Nano Banana Pro

  • + Perfect adherence to all spatial instructions
  • + Highly realistic lighting and photographic texture
  • + Naturalistic plant and book details
  • The sphere is quite small relative to the cube

Qwen Image 2512

  • + Vibrant colors and clean composition
  • + The blue sphere is very prominent
  • The cube is not shaped like a cube (rectangular/low aspect ratio)
  • Reflections inside the glass are physically inconsistent and cluttered
  • The book's placement looks slightly floating or poorly integrated with the glass edge

Verdict: Gemini 3 Pro Image Preview captures the scene with much higher realism and better adherence to the 'cube' geometry. While Qwen Image 2512 produces a bright and clear image, the glass container is a rectangle rather than a cube, and the internal reflections are distracting and illogical. Gemini's lighting from the window feels more authentic and creates a better sense of depth.

Candid Street Photography

Text-to-Image

“A candid street photo of an elderly Japanese man repairing a red bicycle in light rain, reflections on wet pavement, shallow depth of field, 50mm lens, natural skin texture, imperfect framing, motion blur from passing cars, cinematic but realistic, no stylization.”

Nano Banana Pro
Qwen Image 2512
100% wins 0% ties 0% wins

AI Judge Analysis

Nano Banana Pro

  • + Excellent adherence to the 'candid' and 'imperfect framing' prompts, creating a very believable street photography aesthetic.
  • + Highly realistic skin texture, clothing details, and environment that avoids an 'AI look'.
  • + Effective use of reflections on the wet pavement and realistic rain effects.
  • The bicycle components are socially fragmented, with spokes and frames intersecting poorly near the rear wheel.
  • Minor motion blur on cars is less pronounced than requested.

Qwen Image 2512

  • + Good implementation of shallow depth of field with a bokeh effect on background headlights.
  • + Clear focus on the subject's face with decent natural skin texture.
  • + The bicycle's red color is vibrant and matches the prompt requirements.
  • The subject is looking directly at the camera, failing the 'candid' and 'repairing' aspect of the prompt.
  • The bicycle construction is physically impossible, with major structural errors in the frame and pedal placement.
  • Lacks the 'imperfect framing' and 'light rain' atmosphere requested, appearing more like a staged portrait.

Verdict: Gemini 3 Pro much more successfully captures the requested atmosphere of a candid street photo, utilizing an off-center composition and a believable rainy environment. While Qwen Image 2512 produces a sharp portrait, it ignores the 'repairing' and 'candid' instructions, and the anatomical/mechanical errors in its bicycle are far more distracting than Gemini's. Gemini 3 Pro feels grounded in reality, whereas Qwen feels like a typical AI-generated studio portrait.

Modern Clean Menu

Text-to-Image

“Modern minimalist restaurant menu design, white background with colorful food photos in grid, sections for appetizers/pizza/mains, bold sans-serif fonts, vibrant accents, clean professional layout for casual dining.”

Nano Banana Pro
Qwen Image 2512
100% wins 0% ties 0% wins

AI Judge Analysis

Nano Banana Pro

  • + Excellent typography with clear, legible text and distinct sections.
  • + High-quality, realistic food photography photography that matches the menu descriptions.
  • + Followed the prompt exactly with separate Appetizers, Pizza, and Mains sections.
  • Text contains 'llorem ipsum' style nonsense words despite being legible.
  • Repetitive item names (e.g., 'Bruschetta' listed four times).

Qwen Image 2512

  • + Clean grid layout for photos.
  • + Consistent aesthetic across the food imagery.
  • Text is garbled and largely illegible with significant artifacts.
  • Layout is less professional with narrow text columns that are difficult to read.
  • Missed the specific 'Appetizers/Pizza/Mains' sectioning, merging them into 'Appetimers' and 'Pizza/Means'.

Verdict: Gemini 3 Pro Image Preview is the clear winner as it produced a highly professional, functional menu layout with legible typography and perfectly rendered food photos. While Qwen Image 2512 followed the grid request, its text rendering and overall composition are significantly lower in quality.

Chalkboard Menu

Text-to-Image

“Handwritten-style chalkboard menu in a cozy café, all text rendered in the exact same realistic chalk handwriting style with natural variations in letter size, slight slant, and chalk texture — no printed or digital fonts anywhere on the board. Title at the top in elegant cursive chalk handwriting: ‘TODAY’S SPECIALS – APRIL 30, 2026’. Below it, three menu items also in the same handwritten chalk style: ‘Truffle Mushroom Risotto – $24’, ‘Grilled Octopus with Lemon & Herbs – $28’, ‘Brown Butter Chocolate Chip Cookies – $9’. At the very bottom, smaller text in the identical handwritten chalk style (slightly smaller but still clearly legible with the same handwriting characteristics): ‘All items made fresh daily • Ask about our gluten-free options’. Warm ambient café lighting, visible chalk dust and smudges, realistic handwriting imperfections, no clean printed text anywhere.”

Nano Banana Pro
Qwen Image 2512

AI Judge Analysis

Nano Banana Pro

  • + Excellent chalk texture on the board surface with realistic smudges and scratches.
  • + Accurate spelling on all items including complex words like 'Risotto'.
  • + High-quality environment rendering with realistic depth of field and lighting.
  • The handwriting style is a bit inconsistent, with the title feeling more like a font than the other items.

Qwen Image 2512

  • + Beautiful, consistent cursive calligraphy that looks genuinely hand-lettered.
  • + Excellent layout that fills the board frame well.
  • + Strong adherence to the request for natural variation in letter size and slant.
  • Spelling error in 'Risotto' (spelled 'Risitto').
  • The chalk texture on the letters looks a bit too clean and digital compared to the board background.

Verdict: Nano Banana Pro produced a more realistic environment and nailed the difficult spelling of the menu items, though its handwriting was slightly less elegant. Qwen Image 2512 featured much more beautiful and artistic handwriting that better fit the 'elegant cursive' prompt, but suffered from a spelling error in a key menu item. Nano Banana Pro is the winner for overall accuracy and professional execution.

The Capybara Taxi Driver

Text-to-Image

“Photorealistic scene inside a yellow New York taxi at night. A capybara is driving, wearing a yellow taxi driver cap and a dark jacket. It has a calm, professional expression and both front paws on the steering wheel. In the back seat sits a human businesswoman in a coat, looking at her phone with a completely normal, bored expression (as if this is just another normal ride). Through the windows you can see the streets of Manhattan at night with blurred lights. Realistic taxi interior, photorealistic, detailed fur and fabric, 35mm lens, night lighting with reflections, shallow depth of field.”

Nano Banana Pro
Qwen Image 2512

AI Judge Analysis

Nano Banana Pro

  • + Excellent photorealism in the textures of the capybara's fur and the taxi interior.
  • + Highly realistic lighting and depth of field, especially the raindrops on the windshield.
  • + Accurate passenger expression and realistic interaction with her phone.
  • The capybara's paws look slightly more like bird talons or small claws than capybara feet.

Qwen Image 2512

  • + The taxi driver cap is more detailed and visually distinct as a uniform.
  • + Good adherence to the pose prompt with both paws clearly gripping the steering wheel.
  • + The passenger is well-framed and clearly matches the requested bored expression.
  • The lighting is a bit flat compared to Model A, making it feel slightly less photorealistic.
  • The capybara's face looks a bit more symmetrical/digital and less natural than in Model A.

Verdict: Both models followed the prompt very well, but Nano Banana Pro stands out for its superior photorealism, particularly in the lighting and the atmospheric details like the rain on the windshield. While Qwen Image 2512 provided a more formal-looking cap, Nano Banana Pro's overall composition and texture quality make it feel like a real photograph of a surreal moment.

Isometric Miniature Diorama Scenes

Text-to-Image

“Create a clear, 45° top-down isometric miniature 3D cartoon scene of Japan's signature dish: sushi, with soft refined textures, realistic PBR materials, gentle lighting, on a small raised diorama base with minimal garnish and plate. Solid light blue background. At top-center: 'JAPAN' in large bold text, 'SUSHI' below it, small flag icon. Perfectly centered, ultra-clean, high-clarity, square format.”

Nano Banana Pro
Qwen Image 2512

AI Judge Analysis

Nano Banana Pro

  • + Excellent adherence to the '45° top-down isometric' instruction.
  • + Superior PBR materials, showing realistic wood grain, ceramic glaze, and rice textures.
  • + Very clean typography and flag icon that looks professional and well-integrated.
  • The composition is slightly high in the frame, making the bottom feel a bit empty.

Qwen Image 2512

  • + Captures the 'miniature 3D cartoon' style well with soft, rounded shapes.
  • + Balanced composition that fills the square format effectively.
  • + Vibrant colors that make the food look appealing.
  • The text 'JAPAN' has some minor kerning/alignment issues and the 'N' is slightly malformed.
  • The perspective is more of a standard 3/4 view rather than a strict isometric 45° angle.
  • The flag icon is a simple rectangle and lacks the polished feel of Model A.

Verdict: Gemini 3 Pro Image Preview is the winner as it perfectly executed the technical requirements of the prompt, specifically the isometric perspective and the PBR material textures. While Qwen Image 2512 created a charming cartoon scene, Gemini 3 Pro provided much higher clarity in its typography and more realistic material rendering for the wood and food.

Adorable Baby Animals in Sunny Meadow

Text-to-Image

“Hyper-photorealistic scene of fluffy baby animals—a golden retriever puppy, tabby kitten, baby bunny, and red fox kit—with big expressive eyes and ultra-detailed soft fur, playfully chasing butterflies and tumbling together in a lush wildflower meadow, warm golden sunrise light with god rays and dew sparkles, joyful wholesome vibe, 8K masterpiece.”

Nano Banana Pro
Qwen Image 2512

AI Judge Analysis

Nano Banana Pro

  • + Captures the 'playfully chasing' and 'tumbling' aspect of the prompt with dynamic poses.
  • + Excellent fur texture and lighting on the animals, particularly the golden retriever and fox.
  • + Well-composed scene with a clear sense of movement and interaction with the butterfly.
  • The kitten's facial features look slightly more caricatured and less photorealistic than the other animals.
  • Includes only one butterfly despite the plural 'butterflies' in the prompt.

Qwen Image 2512

  • + Includes multiple butterflies as requested in the prompt.
  • + Beautiful lighting and dew sparkle effects that create a very dreamy atmosphere.
  • + High level of fine detail in the fur and whiskers of all four animals.
  • The animals are posing for a portrait rather than 'playfully chasing and tumbling'.
  • Anatomical blending issue where the fox's body seems to morph into the puppy's paws.

Verdict: Gemini 3 Pro much better captures the requested action of the prompt, showing the animals actively running and playing in a dynamic composition. While Qwen Image 2512 has a more serene and detailed aesthetic with beautiful lighting, it fails the 'tumbling/chasing' part of the prompt by presenting a static group portrait.

Vintage Cafe Logo

Text-to-Image

“Vintage minimalist restaurant logo for "Caffè Florian", retro cloche dome with steam and "Est. 1720" banner, classic typography, warm brown and cream tones, subtle texture on light background, vector emblem style.”

Nano Banana Pro
Qwen Image 2512
0% wins 0% ties 100% wins

AI Judge Analysis

Nano Banana Pro

  • + Perfect text rendering including the 'è' accent.
  • + Clean vector-style illustration that fits the minimalist request.
  • + Strong adherence to the requested layout with the banner positioned over the illustration.
  • The steam effect is a bit simple compared to model B.

Qwen Image 2512

  • + Excellent artistic shading and texture on the cloche.
  • + More dynamic and detailed steam effects.
  • + Classic, elegant script typography.
  • Missed the 'è' accent in 'Caffè'.
  • Slightly less 'minimalist' than requested due to heavy shading and complex steam.

Verdict: Gemini 3 Pro Image Preview captures the prompt's requirements more accurately, specifically concerning the correct spelling of 'Caffè' and the requested minimalist vector style. While Qwen Image 2512 offers more impressive illustrative detail and a more sophisticated aesthetic, it failed on a key textual detail and is less minimalist in its execution.

Next steps

Explore each model