Gemini 3.1 Flash with image generation capabilities. High-efficiency image generation model with support for text rendering, reference images, search grounding, and thinking mode. The efficient counterpart to Gemini 3 Pro Image.
Settled by community votes across 2 shared challenges, with an AI judge weighing in on each.
Nano Banana 2
#1 of 44 in Text-to-Image
Not enough comparable category data
The chart appears once both models have ratings across at least three shared arena categories.
Stable Diffusion 3.5 Large Turbo
#44 of 44 in Text-to-Image
Where the votes landed
Nano Banana 2
100.0%
win rate
Ties
0.0%
Stable Diffusion 3.5 Large Turbo
0.0%
win rate
Challenge by challenge
The strongest take from each model on every shared challenge, with the AI judge's read.
Magic Burger Explosion: Fiery Photorealism Challenge
Text-to-Image“Ad for 'Magic Burger'. Dynamic, exploded burger with all components (bun, patty, cheese, lettuce, tomato, sauce) suspended in mid-air. Emphasize photorealistic detail and a sense of motion. Dark, fiery background with glowing embers. Integrate text: 'MAGIC BURGER' as a prominent title, 'LIMITED TIME ONLY' as a secondary message, and '€6.99' in a starburst, all rendered with a fiery, glowing effect.”
AI Judge Analysis
Nano Banana 2
- + Perfect text rendering according to the prompt specifications.
- + Excellent exploded view with clear separation of all burger ingredients.
- + Dynamic composition with realistic food textures and convincing motion effects.
- − The lighting on the lettuce is slightly oversaturated compared to the background.
Stable Diffusion 3.5 Large Turbo
- + Strong atmospheric lighting and vibrant fire effects.
- + Creative use of smoke and glowing embers in the background.
- − Complete failure to include any of the requested text elements.
- − The burger is mostly intact rather than being in a dynamic exploded view.
- − The image quality has a more illustrative, less photorealistic appearance.
Verdict: Nano Banana 2 followed the prompt's complex instructions perfectly, delivering an exploded burger with all requested text accurately rendered. Stable Diffusion 3.5 Large Turbo failed to include 'MAGIC BURGER' or the pricing and did not provide the exploded view requested.
Chalkboard Menu
Text-to-Image“Handwritten-style chalkboard menu in a cozy café, all text rendered in the exact same realistic chalk handwriting style with natural variations in letter size, slight slant, and chalk texture — no printed or digital fonts anywhere on the board. Title at the top in elegant cursive chalk handwriting: ‘TODAY’S SPECIALS – APRIL 30, 2026’. Below it, three menu items also in the same handwritten chalk style: ‘Truffle Mushroom Risotto – $24’, ‘Grilled Octopus with Lemon & Herbs – $28’, ‘Brown Butter Chocolate Chip Cookies – $9’. At the very bottom, smaller text in the identical handwritten chalk style (slightly smaller but still clearly legible with the same handwriting characteristics): ‘All items made fresh daily • Ask about our gluten-free options’. Warm ambient café lighting, visible chalk dust and smudges, realistic handwriting imperfections, no clean printed text anywhere.”
AI Judge Analysis
Nano Banana 2
- + Perfect text rendering that accurately follows the complex prompt instructions.
- + Highly realistic chalkboard texture with authentic chalk smudges and dust.
- + Consistent and convincing handwritten style that looks organic rather than digital.
- − The background cafe scene is a bit generic and out of focus.
- − The text is centered very precisely, which slightly reduces the 'hand-drawn' randomness.
Stable Diffusion 3.5 Large Turbo
- + Clean and modern composition with pleasant lighting and decor.
- + Creative layout using columns and graphic Dividers.
- − Failed significantly on spelling, including 'ocpus', 'lerbs', and 'choclde'.
- − Text appears like a digital vector font rather than realistic chalk texture.
- − Incorrect date rendering ('apri 31 20-') and missing most of the requested text.
Verdict: Nano Banana 2 is the clear winner as it perfectly captured all the text requested in the prompt with high fidelity and realistic chalk textures. Stable Diffusion 3.5 Large Turbo struggled with spelling, failed to complete the requested items, and produced text that looked like a digital font rather than handwriting.
Explore each model
Distilled version of SD 3.5 Large that generates high-quality images in just 4 steps, offering faster inference and reduced costs