Gemini 3.1 Flash with image generation capabilities. High-efficiency image generation model with support for text rendering, reference images, search grounding, and thinking mode. The efficient counterpart to Gemini 3 Pro Image.
Settled by community votes across 8 shared challenges, with an AI judge weighing in on each.
Nano Banana 2
#1 of 44 in Text-to-Image
Not enough comparable category data
The chart appears once both models have ratings across at least three shared arena categories.
Qwen Image 2512
#26 of 44 in Text-to-Image
Where the votes landed
Nano Banana 2
100.0%
win rate
Ties
0.0%
Qwen Image 2512
0.0%
win rate
Challenge by challenge
The strongest take from each model on every shared challenge, with the AI judge's read.
Geometric Composition
Text-to-Image“A glass cube on a wooden table. Inside the cube is a small blue sphere. On top of the cube sits a red book. A green plant is behind the cube, partially visible through the glass. Soft window light from the left.”
AI Judge Analysis
Nano Banana 2
- + Excellent adherence to all prompt elements, including the position of the sphere and the plant.
- + Very high photographic realism with convincing textures on the wood, book cover, and glass.
- + Correct rendering of lighting from the left and clear text on the book spine.
- − The plant is behind the cube but doesn't show much distortion/refraction through the glass panels, looking slightly like it's just a background layer.
Qwen Image 2512
- + Good composition with a clear perspective on the cube.
- + Effective lighting and color palette matching the prompt description.
- + Accurately placed red book and blue sphere.
- − The internal logic of the glass is confusing; it appears to have a mirrored base and back panel that wasn't requested.
- − The plant is almost entirely obscured by the reflections/mirrors rather than being 'partially visible through the glass'.
Verdict: Nano Banana 2 is the superior image because it accurately captures every detail of the prompt with high realism, including the specific requirement of the plant being visible through the glass. Qwen Image 2512 introduced unnecessary mirrored surfaces inside the cube, which distracted from the clarity of the scene and obscured the plant in the background.
Candid Street Photography
Text-to-Image“A candid street photo of an elderly Japanese man repairing a red bicycle in light rain, reflections on wet pavement, shallow depth of field, 50mm lens, natural skin texture, imperfect framing, motion blur from passing cars, cinematic but realistic, no stylization.”
AI Judge Analysis
Nano Banana 2
- + Excellent adherence to the 'repairing' action with tools and visible chain work
- + Highly authentic street photography aesthetic with realistic Japanese signage
- + Great use of reflections and wet pavement textures
- − The man's hands and the wrench have some structural merging issues
- − The bicycle kickstand logic is slightly physically impossible
Qwen Image 2512
- + Very natural skin texture and facial features
- + Good implementation of shallow depth of field and motion blur in the background
- + Captures the 'light rain' atmosphere well with visible droplets on the bike
- − The man is posing rather than 'repairing' the bicycle as requested
- − Severe anatomy issues with the hands merged into the seat and frame
- − Missing the 'candid' feel, looking more like a staged portrait
Verdict: Nano Banana 2 followed the prompt much more effectively by showing the man actually repairing the bicycle within a complex, realistic street environment. Qwen Image 2512 struggled with the specific action requested and suffered from significant anatomical defects in the hands, resulting in a static portrait rather than a candid scene.
Modern Clean Menu
Text-to-Image“Modern minimalist restaurant menu design, white background with colorful food photos in grid, sections for appetizers/pizza/mains, bold sans-serif fonts, vibrant accents, clean professional layout for casual dining.”
AI Judge Analysis
Nano Banana 2
- + Excellent text legibility and mostly coherent English words.
- + Logical layout with photos corresponding to specific menu items.
- + Professional use of bold sans-serif typography and consistent color accents.
- − Small spelling errors in some descriptions (e.g., 'witi', 'somon').
- − Pricing symbols slightly overlap descriptions in the Mains section.
Qwen Image 2512
- + Vibrant food photography with good color saturation.
- + Creative use of geometric color blocks for section headers.
- − Text is entirely unintelligible gibberish.
- − The layout is cluttered and the 'grid' of photos feels disconnected from the text lists.
- − Failed to include a 'Mains' section, using '/MEANS' instead.
Verdict: Nano Banana 2 is the superior choice because it produces a functional, realistic menu with legible text and a logical design where the images actually match the menu items. Qwen Image 2512 fails significantly on text rendering, producing nonsensical characters that make the design unusable for its intended purpose.
Chalkboard Menu
Text-to-Image“Handwritten-style chalkboard menu in a cozy café, all text rendered in the exact same realistic chalk handwriting style with natural variations in letter size, slight slant, and chalk texture — no printed or digital fonts anywhere on the board. Title at the top in elegant cursive chalk handwriting: ‘TODAY’S SPECIALS – APRIL 30, 2026’. Below it, three menu items also in the same handwritten chalk style: ‘Truffle Mushroom Risotto – $24’, ‘Grilled Octopus with Lemon & Herbs – $28’, ‘Brown Butter Chocolate Chip Cookies – $9’. At the very bottom, smaller text in the identical handwritten chalk style (slightly smaller but still clearly legible with the same handwriting characteristics): ‘All items made fresh daily • Ask about our gluten-free options’. Warm ambient café lighting, visible chalk dust and smudges, realistic handwriting imperfections, no clean printed text anywhere.”
AI Judge Analysis
Nano Banana 2
- + Perfect spelling on all requested menu items and numbers.
- + The chalk texture on the board looks very authentic with realistic smears and dusting.
- + Excellent composition and spacing that fills the board naturally.
- − The handwriting looks slightly more digital/font-like compared to Model B.
- − Failed to make the title 'elegant cursive' as requested, opting for a print-slant style.
Qwen Image 2512
- + Excellent 'elegant cursive' script for the title as requested in the prompt.
- + Very realistic chalk stroke pressure, especially on the larger letters.
- + Better adherence to the 'handwritten-style' with more natural character variations.
- − Minor spelling error ('Risitto' instead of 'Risotto').
- − Missing the currency symbol for the 'Brown Butter Chocolate Chip Cookies'.
Verdict: Nano Banana 2 followed the text prompt perfectly with no spelling errors, though the handwriting feels slightly standardized. Qwen Image 2512 captured the 'elegant cursive' and artistic chalk texture much more effectively, but it struggled with minor spelling and missing symbols. Nano Banana 2 is the winner for total accuracy, while Qwen Image 2512 is better for purely aesthetic handwritten realism.
The Capybara Taxi Driver
Text-to-Image“Photorealistic scene inside a yellow New York taxi at night. A capybara is driving, wearing a yellow taxi driver cap and a dark jacket. It has a calm, professional expression and both front paws on the steering wheel. In the back seat sits a human businesswoman in a coat, looking at her phone with a completely normal, bored expression (as if this is just another normal ride). Through the windows you can see the streets of Manhattan at night with blurred lights. Realistic taxi interior, photorealistic, detailed fur and fabric, 35mm lens, night lighting with reflections, shallow depth of field.”
AI Judge Analysis
Nano Banana 2
- + Excellent photographic quality with a high level of detail in the fur and environment.
- + Highly realistic taxi interior with accurate lighting from external city signs.
- + Great cinematic composition that captures a sense of a real Manhattan street.
- − Failed to include the human businesswoman in the back seat.
- − The capybara only has one paw clearly on the steering wheel while the other is obscured.
Qwen Image 2512
- + Followed all prompt instructions including the human passenger and the phone.
- + Good front-facing symmetry and clear placement of both paws on the wheel.
- + Accurately captured the 'bored' expression of the passenger.
- − The passenger's hands and face have some slight anatomical artifacts typical of AI.
- − Overall image is slightly softer/less sharp than the competitor.
Verdict: Nano Banana 2 produces a significantly more realistic and detailed image with superior lighting and texture, but it completely failed to include the passenger requested in the prompt. Qwen Image 2512 followed the prompt much more accurately and captured the comedic contrast requested, even though its technical visual fidelity is slightly lower than Nano Banana 2.
Isometric Miniature Diorama Scenes
Text-to-Image“Create a clear, 45° top-down isometric miniature 3D cartoon scene of Japan's signature dish: sushi, with soft refined textures, realistic PBR materials, gentle lighting, on a small raised diorama base with minimal garnish and plate. Solid light blue background. At top-center: 'JAPAN' in large bold text, 'SUSHI' below it, small flag icon. Perfectly centered, ultra-clean, high-clarity, square format.”
AI Judge Analysis
Nano Banana 2
- + Excellent text rendering with a professional graphic design feel.
- + High-quality realistic PBR textures on the fish and wood grain.
- + Highly detailed and diverse selection of sushi types.
- − The camera angle is a standard perspective rather than a true 45-degree isometric view.
- − Missing the 'miniature 3D cartoon' style, leaning more toward photorealism.
Qwen Image 2512
- + Captured the miniature 3D cartoon style and isometric perspective perfectly.
- + Excellent soft, refined clay-like textures as requested.
- + Clean, bold text and flag icon that match the aesthetic of the scene.
- − Individual rice grains look slightly like tiny beads rather than realistic rice.
- − The 'JAPAN' text is slightly less polished than the typography in the other model.
Verdict: While Nano Banana 2 produced a visually stunning and realistic image with superior typography, Qwen Image 2512 followed the aesthetic prompts much more closely. Qwen Image 2512 successfully delivered the requested 45-degree isometric perspective and the 'miniature 3D cartoon' style on a square diorama base, whereas Nano Banana 2 ignored the isometric and cartoon instructions in favor of a photorealistic render.
Adorable Baby Animals in Sunny Meadow
Text-to-Image“Hyper-photorealistic scene of fluffy baby animals—a golden retriever puppy, tabby kitten, baby bunny, and red fox kit—with big expressive eyes and ultra-detailed soft fur, playfully chasing butterflies and tumbling together in a lush wildflower meadow, warm golden sunrise light with god rays and dew sparkles, joyful wholesome vibe, 8K masterpiece.”
AI Judge Analysis
Nano Banana 2
- + Excellent depiction of motion with the animals playfully 'tumbling' as requested.
- + Strong adherence to all four specified animals with distinct, realistic body shapes.
- + Great atmospheric lighting with visible god rays and a sense of depth in the meadow.
- − The fox kit's face is slightly less 'expressive' than the other animals.
- − The blue butterfly looks a bit flat compared to the surrounding environment.
Qwen Image 2512
- + Extremely detailed fur textures and very expressive 'big eyes'.
- + Beautiful, warm golden lighting that creates a wholesome atmosphere.
- + Crisp rendering of the butterflies and wildflowers in the foreground.
- − Static composition misses the 'chasing' and 'tumbling' action requested in the prompt.
- − The fox's anatomy is a bit unusual, with very long ears that make it look more like a fennec fox or a hybrid.
- − The animals are unnaturally huddled together for a portrait rather than playing.
Verdict: Nano Banana 2 is the winner because it captures the active, joyful energy of the prompt, showing the animals actually playing and chasing butterflies in a realistic meadow. While Qwen Image 2512 has slightly sharper textures and bigger eyes, it produces a static 'family portrait' style image that ignores the 'tumbling' and 'chasing' actions requested.
Vintage Cafe Logo
Text-to-Image“Vintage minimalist restaurant logo for "Caffè Florian", retro cloche dome with steam and "Est. 1720" banner, classic typography, warm brown and cream tones, subtle texture on light background, vector emblem style.”
AI Judge Analysis
Nano Banana 2
- + Perfect adherence to circular vector emblem style
- + Accurate rendering of the accent in 'Caffè'
- + Superior minimalist composition suitable for a real logo
- − The steam lines are a bit thin compared to the bold linework of the cloche
Qwen Image 2512
- + Excellent shading and metallic texture on the cloche dome
- + Dynamic and detailed steam effects
- + Strong typography with high readability
- − Missed the grave accent on 'Caffè' (used an acute accent)
- − The composition is more of a complex illustration than a minimalist vector emblem
Verdict: Nano Banana 2 followed the 'minimalist vector emblem' request much better, producing a clean, professional-grade logo that feels authentic to the prompt's style. Qwen Image 2512 produced a beautiful, high-quality illustration, but it lacks the minimalism requested and failed on the specific Italian orthography for the word 'Caffè'.
Explore each model
Improved version of Alibaba's Qwen image model with better text rendering, finer natural textures, and more realistic human generation.