Gemini 3.1 Flash with image generation capabilities. High-efficiency image generation model with support for text rendering, reference images, search grounding, and thinking mode. The efficient counterpart to Gemini 3 Pro Image.
Settled by community votes across 2 shared challenges, with an AI judge weighing in on each.
Nano Banana 2
#1 of 44 in Text-to-Image
Not enough comparable category data
The chart appears once both models have ratings across at least three shared arena categories.
Qwen Image 2.0
#32 of 44 in Text-to-Image
Where the votes landed
Nano Banana 2
100.0%
win rate
Ties
0.0%
Qwen Image 2.0
0.0%
win rate
Challenge by challenge
The strongest take from each model on every shared challenge, with the AI judge's read.
The Reversed Rodeo
Text-to-Image“Horse riding astronaut in space — horse on top, not vice versa. Surreal, highly detailed, cinematic.”
AI Judge Analysis
Nano Banana 2
- + Perfectly follows the specific constraint of the horse being on top of the riding astronaut.
- + High level of detail in the horse's musculature and the astronaut's suit.
- + Vibrant, cinematic space background with beautiful nebulae and lighting effects.
- − The horse's hoof appears to be merging with the astronaut's helmet visor.
- − The rein placement is physically confusing and disconnected.
Qwen Image 2.0
- + Clean, high-resolution rendering with a polished aesthetic.
- + Strong anatomical consistency for the horse and astronaut.
- − Fails the primary prompt constraint; the astronaut is riding the horse, not vice versa.
- − The addition of water droplets/bubbles feels random and unexplained.
Verdict: Nano Banana 2 followed the specific and difficult instruction of placing the horse on top of the astronaut, creating a truly surreal image as requested. Qwen Image 2.0 produced a high-quality visual but completely ignored the spatial instruction 'horse on top, not vice versa,' resulting in a standard interpretation of the prompt.
The Halloween Invitation
Text-to-Image“Vintage gothic Halloween party invitation. Dark parchment poster, spooky border with webs and thorns, central glowing jack-o-lantern, bats, twisted trees, moody night sky. Add elegant gothic title text saying "Halloween Party Invitation", a small scroll banner saying "You are invited to a night of frights", and event details at the bottom: Date: 30.10.2026 Time: 7pm Location: The Arches, NYC Spooky but polished, cinematic lighting, square format.”
AI Judge Analysis
Nano Banana 2
- + Excellent typography with a glowing effect that matches the cinematic lighting.
- + High level of detail in the border illustrations including spiders and ornate flourishes.
- + Perfect adherence to all text requirements and a cohesive vintage illustration style.
- − The parchment edges look slightly clipped or cut off by the square frame.
Qwen Image 2.0
- + Strong 'thorns' representation in the border as specifically requested in the prompt.
- + Clear, legible gothic font for all required text fields.
- + Good atmospheric lighting on the central jack-o-lantern.
- − The composition feels a bit disjointed with the background image floating on a flat parchment texture.
- − The text '30.10.2026' has slight spacing issues.
- − Fewer interesting details compared to Nano Banana 2.
Verdict: Nano Banana 2 is the superior output because it creates a fully integrated, polished illustration where the text and artwork feel like a single cohesive vintage poster. Qwen Image 2.0 follows all instructions well but feels more like a standard digital collage with less artistic depth. Nano Banana 2's attention to detail in the border and the glowing typography provides a much more cinematic and high-quality feel.
Explore each model
Alibaba's Qwen Image 2.0 model with enhanced text rendering, supporting both Chinese and English prompts with up to 6 images per request