Gemini 3.1 Flash with image generation capabilities. High-efficiency image generation model with support for text rendering, reference images, search grounding, and thinking mode. The efficient counterpart to Gemini 3 Pro Image.
Settled by community votes across 3 shared challenges, with an AI judge weighing in on each.
Nano Banana 2
#1 of 44 in Text-to-Image
Not enough comparable category data
The chart appears once both models have ratings across at least three shared arena categories.
Qwen Image 2.0 Pro
#27 of 44 in Text-to-Image
Where the votes landed
Nano Banana 2
100.0%
win rate
Ties
0.0%
Qwen Image 2.0 Pro
0.0%
win rate
Challenge by challenge
The strongest take from each model on every shared challenge, with the AI judge's read.
Chalkboard Menu
Text-to-Image“Handwritten-style chalkboard menu in a cozy café, all text rendered in the exact same realistic chalk handwriting style with natural variations in letter size, slight slant, and chalk texture — no printed or digital fonts anywhere on the board. Title at the top in elegant cursive chalk handwriting: ‘TODAY’S SPECIALS – APRIL 30, 2026’. Below it, three menu items also in the same handwritten chalk style: ‘Truffle Mushroom Risotto – $24’, ‘Grilled Octopus with Lemon & Herbs – $28’, ‘Brown Butter Chocolate Chip Cookies – $9’. At the very bottom, smaller text in the identical handwritten chalk style (slightly smaller but still clearly legible with the same handwriting characteristics): ‘All items made fresh daily • Ask about our gluten-free options’. Warm ambient café lighting, visible chalk dust and smudges, realistic handwriting imperfections, no clean printed text anywhere.”
AI Judge Analysis
Nano Banana 2
- + Excellent text rendering with perfect spelling and punctuation.
- + High-quality realistic chalkboard texture with authentic smudges.
- + Clean composition with a well-framed wooden border.
- − The text look slightly too uniform, more like a handwriting font than natural hand-lettering.
- − The title is not in the requested 'elegant cursive' style.
Qwen Image 2.0 Pro
- + Features a more authentic, varied handwriting style that genuinely looks hand-drawn.
- + Successfully followed the request for cursive flourishes in the title.
- + Good chalk texture and dust effects on the board.
- − Slightly awkward perspective/angle on the board compared to Model A.
- − Missing the wooden frame requested implicitly by 'handwritten-style chalkboard menu' in a café setting.
Verdict: Both models performed exceptionally well on the text generation. Nano Banana 2 provides a cleaner, more legible result with perfect spelling, while Qwen Image 2.0 Pro captures a more authentic hand-lettered feel with the requested cursive slant and varied stroke weights, making it feel more like a real café board.
The Capybara Taxi Driver
Text-to-Image“Photorealistic scene inside a yellow New York taxi at night. A capybara is driving, wearing a yellow taxi driver cap and a dark jacket. It has a calm, professional expression and both front paws on the steering wheel. In the back seat sits a human businesswoman in a coat, looking at her phone with a completely normal, bored expression (as if this is just another normal ride). Through the windows you can see the streets of Manhattan at night with blurred lights. Realistic taxi interior, photorealistic, detailed fur and fabric, 35mm lens, night lighting with reflections, shallow depth of field.”
AI Judge Analysis
Nano Banana 2
- + Excellent photorealistic lighting and textures
- + Highly realistic taxi interior details including the meter and GPS
- + Great cinematic composition with clear landmarks like Radio City Music Hall
- − Fails to include the human passenger in the back seat
- − The capybara's paws look slightly more like human hands than actual capybara feet
Qwen Image 2.0 Pro
- + Successfully includes the human passenger as requested
- + Capture the requested 'bored' expression on the woman very well
- + Both paws are clearly on the steering wheel as requested
- − Lower image resolution and overall clarity compared to Model A
- − Text on the dashboard sticker has a spelling error ('Licesed')
- − The camera angle makes the interior space feel a bit cramped and distorted
Verdict: Nano Banana 2 produces a much more visually stunning and realistic image, but it completely misses the human passenger required by the prompt. Qwen Image 2.0 Pro follows all instructions, including the complex interaction between the capybara and the businesswoman, but suffers from slightly lower visual quality and a spelling error on the dashboard.
The Halloween Invitation
Text-to-Image“Vintage gothic Halloween party invitation. Dark parchment poster, spooky border with webs and thorns, central glowing jack-o-lantern, bats, twisted trees, moody night sky. Add elegant gothic title text saying "Halloween Party Invitation", a small scroll banner saying "You are invited to a night of frights", and event details at the bottom: Date: 30.10.2026 Time: 7pm Location: The Arches, NYC Spooky but polished, cinematic lighting, square format.”
AI Judge Analysis
Nano Banana 2
- + Excellent typography with a glowing effect that matches the aesthetic.
- + High-quality, detailed illustration including a spooky archway and artistic skulls.
- + Perfect layout with a cohesive vintage gothic color palette.
- − The parchment texture is slightly less pronounced than in Model B.
Qwen Image 2.0 Pro
- + Accurate depiction of the requested thorn and web border.
- + Clean and legible text rendering for all event details.
- + Interesting green glow variation for the jack-o-lantern.
- − The composition feels a bit sparse with the bats floating awkwardly.
- − The central jack-o-lantern has internal candle artifacts (candles floating in eyes).
Verdict: Nano Banana 2 is the clear winner as it provides a much more polished and cinematically composed illustration that feels like a professional invitation. While Qwen Image 2.0 Pro followed the border instructions well, its central elements are less cohesive and contain minor visual artifacts in the pumpkin details.
Explore each model
Alibaba's Qwen Image 2.0 Pro model offering higher quality image generation with enhanced detail and accuracy