Gemini 2.5 Flash Image is optimized for image understanding and generation, offering a balance of price and performance with fast and efficient image generation and editing capabilities.
Settled by community votes across 8 shared challenges, with an AI judge weighing in on each.
Nano Banana
#20 of 44 in Text-to-Image
Not enough comparable category data
The chart appears once both models have ratings across at least three shared arena categories.
Qwen Image 2512
#26 of 44 in Text-to-Image
Where the votes landed
Nano Banana
55.6%
win rate
Ties
11.1%
Qwen Image 2512
33.3%
win rate
Challenge by challenge
The strongest take from each model on every shared challenge, with the AI judge's read.
Geometric Composition
Text-to-Image“A glass cube on a wooden table. Inside the cube is a small blue sphere. On top of the cube sits a red book. A green plant is behind the cube, partially visible through the glass. Soft window light from the left.”
AI Judge Analysis
Nano Banana
- + Excellent photographic quality with realistic soft lighting and dust particles.
- + Perfect adherence to spatial instructions with the sphere floating inside the glass.
- + Natural depth of field and beautiful bokeh on the background plant.
- − The sphere is floating unnaturally (though visually pleasing, it might not be physically realistic for a 'sphere inside a cube').
Qwen Image 2512
- + Good material rendering on the wood table and the red book cover.
- + Accurate placement of all requested elements including the plant visible through the glass.
- − The cube's physics are confusing, appearing more like a glass-bottomed box or mirror in some parts.
- − Noticeable digital artifacts or 'ghost' spheres appearing as weird reflections on the right side.
Verdict: Nano Banana produces a much more polished and aesthetically pleasing image with superior lighting and clarity. While Qwen Image 2512 follows the prompt accurately, it suffers from messy reflections and less convincing glass transparency compared to the clean, professional look of Nano Banana.
Candid Street Photography
Text-to-Image“A candid street photo of an elderly Japanese man repairing a red bicycle in light rain, reflections on wet pavement, shallow depth of field, 50mm lens, natural skin texture, imperfect framing, motion blur from passing cars, cinematic but realistic, no stylization.”
AI Judge Analysis
Nano Banana
- + Excellent atmospheric lighting and believable rain/wet texture.
- + The character is actively engaged in 'repairing' as requested, with tools visible on a newspaper.
- + Stronger cinematic composition with a clear 50mm feel and wet pavement reflections.
- − The man's right hand and the bicycle's drivetrain have anatomical and mechanical glitches.
- − Slightly too 'clean' for an 'imperfect framing' prompt.
Qwen Image 2512
- + Very realistic skin texture and facial features.
- + Captures the 'imperfect framing' and 'candid' look effectively.
- + Good integration of the character with the background traffic.
- − The man is posing/looking at the camera rather than repairing the bike.
- − Significant anatomical issues with the hands (multiple extra fingers on the left hand).
- − The bicycle's rear frame and chain guard are warped and physically impossible.
Verdict: Nano Banana creates a much more compelling scene that adheres to the 'repairing' part of the prompt, featuring a grounded environment with tools and a realistic atmosphere. While Qwen Image 2512 captures a more candid facial expression, it fails the 'repairing' action and suffers from severe anatomical errors in the hands and mechanical distortions in the bike.
Modern Clean Menu
Text-to-Image“Modern minimalist restaurant menu design, white background with colorful food photos in grid, sections for appetizers/pizza/mains, bold sans-serif fonts, vibrant accents, clean professional layout for casual dining.”
AI Judge Analysis
Nano Banana
- + Excellent text legibility and mostly correct spelling including menu items.
- + High-quality, distinct, and appetizing food photography in a neat grid.
- + Clean, professional layout that perfectly matches the 'modern minimalist' request.
- − Minor spelling errors (e.g., 'Appeitiers', 'Brusuechta').
- − Some nonsensical item descriptions, such as 'NY Strip' being described as 'vanilla bean ice cream'.
Qwen Image 2512
- + Strong use of vibrant accents and color coding for categories.
- + Good composition with a large featured pizza at the bottom.
- + Consistent art style for the food photography.
- − Text is largely illegible or gibberish (e.g., 'RESSAGRENT', 'PIZUMAREL').
- − Many artifacts in the text rendering and logical inconsistencies in prices.
- − Cluttered layout compared to the requested minimalist aesthetic.
Verdict: Nano Banana is the clear winner as it produces a functional menu design with highly legible text and high-quality food photography that adheres perfectly to the professional, minimalist prompt. While Qwen Image 2512 has a nice color palette, it fails significantly on text rendering, producing gibberish that makes the menu unusable.
Chalkboard Menu
Text-to-Image“Handwritten-style chalkboard menu in a cozy café, all text rendered in the exact same realistic chalk handwriting style with natural variations in letter size, slight slant, and chalk texture — no printed or digital fonts anywhere on the board. Title at the top in elegant cursive chalk handwriting: ‘TODAY’S SPECIALS – APRIL 30, 2026’. Below it, three menu items also in the same handwritten chalk style: ‘Truffle Mushroom Risotto – $24’, ‘Grilled Octopus with Lemon & Herbs – $28’, ‘Brown Butter Chocolate Chip Cookies – $9’. At the very bottom, smaller text in the identical handwritten chalk style (slightly smaller but still clearly legible with the same handwriting characteristics): ‘All items made fresh daily • Ask about our gluten-free options’. Warm ambient café lighting, visible chalk dust and smudges, realistic handwriting imperfections, no clean printed text anywhere.”
AI Judge Analysis
Nano Banana
- + Excellent spelling accuracy on all menu items
- + Clean composition with realistic café background bokeh
- + Consistent handwriting style across the entire board
- − Handwriting looks slightly more like a digital font than natural chalk
- − Minimal chalk dust or smudging texture compared to Model B
Qwen Image 2512
- + Superb chalk texture with realistic smudging and pressure variations
- + Highly organic handwriting with natural character slants
- + Better use of the board space with larger, more legible text
- − Spelling error in 'Risitto' (should be Risotto)
- − The word 'Risitto' is somewhat disconnected from the first line of the item
Verdict: Nano Banana excels in typographical accuracy, correctly spelling every word in the complex prompt. However, Qwen Image 2512 produces a much more convincing chalk aesthetic with superior texture and more natural handwriting, despite a minor spelling error in 'Risotto'.
The Capybara Taxi Driver
Text-to-Image“Photorealistic scene inside a yellow New York taxi at night. A capybara is driving, wearing a yellow taxi driver cap and a dark jacket. It has a calm, professional expression and both front paws on the steering wheel. In the back seat sits a human businesswoman in a coat, looking at her phone with a completely normal, bored expression (as if this is just another normal ride). Through the windows you can see the streets of Manhattan at night with blurred lights. Realistic taxi interior, photorealistic, detailed fur and fabric, 35mm lens, night lighting with reflections, shallow depth of field.”
AI Judge Analysis
Nano Banana
- + Excellent photorealistic texture on the capybara's fur and leather jacket
- + Strong cinematic lighting and composition from a side-view angle
- + The businesswoman's bored expression perfectly matches the prompt
- − The passenger appears to be in the front seat or mid-cabin instead of the back seat
- − The steering wheel placement is awkward and disconnected from the dashboard
Qwen Image 2512
- + Follows the seating arrangement more accurately with the passenger in the back seat
- + Centrally aligned composition provides a clear view of both subjects
- + The capybara's paws are clearly placed on the steering wheel as requested
- − The passenger's expression looks sad or pouting rather than 'bored' or 'normal'
- − The hands of the passenger are poorly rendered with anatomical artifacts
- − The capybara's face is slightly less realistic compared to Model A
Verdict: Nano Banana produces a much more realistic and aesthetically pleasing image with superior textures and lighting, though it fails to place the passenger in the back seat. Qwen Image 2512 follows the spatial instructions of the prompt more closely but suffers from significant distortions in the passenger's face and hands. Nano Banana is the preferred choice for its high visual quality and better adherence to the character's requested emotional tone.
Isometric Miniature Diorama Scenes
Text-to-Image“Create a clear, 45° top-down isometric miniature 3D cartoon scene of Japan's signature dish: sushi, with soft refined textures, realistic PBR materials, gentle lighting, on a small raised diorama base with minimal garnish and plate. Solid light blue background. At top-center: 'JAPAN' in large bold text, 'SUSHI' below it, small flag icon. Perfectly centered, ultra-clean, high-clarity, square format.”
AI Judge Analysis
Nano Banana
- + Perfectly rendered bold black text with correct positioning.
- + Clean, soft-shaded 3D aesthetic that matches the 'cartoon scene' prompt well.
- + Simple and professional layout on a high-clarity square format.
- − The flag icon is slightly off-center relative to the text block.
- − The wooden base is a bit thick, though it fits the isometric style.
Qwen Image 2512
- + Highly detailed textures on the sushi and garnish, showing realistic PBR material qualities.
- + Excellent miniature diorama feel with added greenery and environment.
- + Great adherence to the isometric perspective and lighting.
- − The typography is stylized with outlines rather than the 'large bold text' requested.
- − The flag icon is placed to the side rather than top-center as implyed by the text arrangement.
Verdict: Nano Banana followed the layout and typography instructions more precisely, delivering very clean and legible text in the requested top-center position. However, Qwen Image 2512 produced a more visually sophisticated 3D diorama with superior texture work and a more interesting composition, making it the better choice for a 'miniature miniature 3D cartoon scene'.
Adorable Baby Animals in Sunny Meadow
Text-to-Image“Hyper-photorealistic scene of fluffy baby animals—a golden retriever puppy, tabby kitten, baby bunny, and red fox kit—with big expressive eyes and ultra-detailed soft fur, playfully chasing butterflies and tumbling together in a lush wildflower meadow, warm golden sunrise light with god rays and dew sparkles, joyful wholesome vibe, 8K masterpiece.”
AI Judge Analysis
Nano Banana
- + Perfect inclusion of all four requested species with clear separation.
- + Excellent representation of 'chasing and tumbling' with the fox on its back and the kitten's playful pose.
- + Beautiful atmospheric lighting with clearly defined god rays and dew sparkles.
- − The fur texture on the kitten looks slightly more 'illustrated' than hyper-photorealistic.
- − One butterfly is merging into the dog's ear.
Qwen Image 2512
- + Stronger fur detail and realistic textures on the puppy and fox.
- + Bright, vibrant color palette that matches the 'wholesome' vibe well.
- + Good composition with the animals grouped tightly and looking at the camera.
- − The fox and cat are merging into the puppy's body in an unnatural way.
- − The animals are sitting still rather than 'chasing and tumbling' as requested.
- − The butterfly on the left has anatomical issues where it meets the cat.
Verdict: Nano Banana followed the prompt more accurately by depicting a playful, tumbling scene with all four animals clearly visible and distinct. Qwen Image 2512 has slightly better individual fur textures but fails on the 'tumbling' action and has significant clipping issues where the animals' bodies merge together.
Vintage Cafe Logo
Text-to-Image“Vintage minimalist restaurant logo for "Caffè Florian", retro cloche dome with steam and "Est. 1720" banner, classic typography, warm brown and cream tones, subtle texture on light background, vector emblem style.”
AI Judge Analysis
Nano Banana
- + Clean, minimalist vector emblem style as requested.
- + Excellent typography with accurate spelling and accent marks.
- + Balanced circular composition that works well as a logo.
- − The steam is integrated inside the dome rather than rising from it.
- − The 'Est. 1720' text is slightly clipped by the banner border.
Qwen Image 2512
- + Sophisticated illustrative style with great use of vintage textures and shading.
- + Excellent rendering of 'Est. 1720' on a dynamic banner.
- + Strong visual appeal with high-quality cross-hatching details.
- − Less 'minimalist' than requested, leaning more into a complex illustration.
- − The main typography 'Caffé Florian' uses an acute accent (é) instead of the correct grave accent (è) requested in the prompt.
Verdict: Nano Banana captures the 'minimalist vector emblem' aesthetic perfectly, providing a clean and usable logo design with correct Italian orthography. Qwen Image 2512 offers a much more detailed and visually stunning illustration, but it misses the 'minimalist' requirement and contains a small spelling error in the name.
Explore each model
Improved version of Alibaba's Qwen image model with better text rendering, finer natural textures, and more realistic human generation.