The lightweight, low-cost variant of Nano Banana 2 (Gemini 3.1 Flash Image). Ultra-low-latency image generation and editing at a fixed 1K resolution, designed for high-volume interactive use cases.
Settled by community votes across 13 shared challenges, with an AI judge weighing in on each.
Nano Banana 2 Lite
#27 of 48 in Text-to-Image
Qwen Image 2512
#29 of 48 in Text-to-Image
Where the votes landed
Nano Banana 2 Lite
100.0%
win rate
Ties
0.0%
Qwen Image 2512
0.0%
win rate
Challenge by challenge
The strongest take from each model on every shared challenge, with the AI judge's read.
Geometric Composition
Text-to-Image“A glass cube on a wooden table. Inside the cube is a small blue sphere. On top of the cube sits a red book. A green plant is behind the cube, partially visible through the glass. Soft window light from the left.”
AI Judge Analysis
Nano Banana 2 Lite
- + Excellent rendering of refraction and reflections within a solid glass block.
- + Impressive handling of text on the book spine that relates to the prompt subject.
- + Highly realistic lighting and depth of field that feels like a professional photograph.
- − The blue sphere appears to be levitating in the center of a solid block rather than sitting inside a hollow cube.
- − The plant behind the cube is slightly less integrated into the composition due to the blur.
Qwen Image 2512
- + Perfectly captures the 'hollow cube' structure with the sphere resting on the bottom surface.
- + Shows the green plant clearly through the glass walls as requested.
- + The physical interaction between the objects (book on rim, sphere on surface) is very logical.
- − The glass edges have a slightly synthetic teal tint compared to the more natural look of Model A.
- − The lighting on the book's top surface is a bit flat compared to the soft window light requested.
Verdict: Both models followed the prompt instructions near-perfectly, including the specific arrangement of objects and lighting direction. Nano Banana 2 Lite produced a more aesthetically pleasing, 'artistic' shot with impressive text generation on the book, but interpreted the cube as a solid block of glass; Qwen Image 2512 followed the spatial physics of the prompt more literally by placing the sphere inside a hollow glass box.
Candid Street Photography
Text-to-Image“A candid street photo of an elderly Japanese man repairing a red bicycle in light rain, reflections on wet pavement, shallow depth of field, 50mm lens, natural skin texture, imperfect framing, motion blur from passing cars, cinematic but realistic, no stylization.”
AI Judge Analysis
Nano Banana 2 Lite
- + Excellent action-oriented composition showing the man actually repairing the bike with tools.
- + Superior environmental storytelling with realistic Japanese street signs and background pedestrians.
- + Highly realistic textures on the wet pavement and the man's work clothes.
- − The man's right hand and the wrench have some slight structural merging/distortion.
- − Minor anatomical issues with how the man is crouching relative to the ground.
Qwen Image 2512
- + Features very realistic skin texture and facial details on the subject.
- + Effective use of shallow depth of field and bokeh from car headlights.
- + Clearer rendering of the bicycle frame and components.
- − The man is posing/looking at the camera, failing the 'candid' and 'repairing' parts of the prompt.
- − The 'motion blur' on the cars is minimal, appearing more like static out-of-focus objects.
- − Anatomical issues with the hands, which look swollen and have merged fingers.
Verdict: Nano Banana 2 Lite followed the complex prompt instructions much more effectively, capturing a true candid moment of repair with tools and environmental context. Qwen Image 2512 produced a high-quality portrait, but the subject is simply looking at the camera rather than performing the requested action, making it feel less like a street photo and more like a studio shot.
Fantasy Warrior
Text-to-Image“Close portrait of a battle-worn paladin in ornate engraved plate armor, hair braided with small beads, faint scars and dirt on the skin, warm torchlight reflecting off metal, shallow depth of field, bokeh sparks, lifelike eyes, highly detailed texture on leather straps and cloth underlayer.”
AI Judge Analysis
Nano Banana 2 Lite
- + Exceptional realism in the facial textures and skin weary with battle.
- + Highly detailed engraving on the plate armor with natural-looking wear and scratches.
- + Stronger adherence to the request for leather straps and cloth underlayer visibility.
- − The image includes distracting white sidebars or cropping artifacts.
- − The bokeh sparks in the background are somewhat muted compared to the prompt's potential.
Qwen Image 2512
- + Excellent lighting with vibrant torchlight reflections on the face and armor.
- + Great execution of the hair braids with colorful beads.
- + High visual appeal with dynamic sparks and a balanced composition.
- − The scars look somewhat superficial or painted on rather than integrated into the skin.
- − The armor engraving is a bit more repetitive and less 'worn' than model A.
Verdict: Both models followed the prompt exceptionally well, capturing the braided hair, battle-worn appearance, and ornate armor. Nano Banana 2 Lite offers a more grit-focused and realistic texture on the skin and metal, though it is marred by large white margins. Qwen Image 2512 provides a more cinematically pleasing image with superior lighting and better use of the bokeh sparks, making it the more visually striking overall.
Modern Clean Menu
Text-to-Image“Modern minimalist restaurant menu design, white background with colorful food photos in grid, sections for appetizers/pizza/mains, bold sans-serif fonts, vibrant accents, clean professional layout for casual dining.”
AI Judge Analysis
Nano Banana 2 Lite
- + Excellent adherence to the layout prompt, featuring a clean grid and correct category headers.
- + Strong readability with professional bold sans-serif fonts and clear pricing.
- + High visual quality in food photography that aligns with the described menu items.
- − Minor text artifacts in descriptions, though headings are mostly intelligible.
- − The background wood grain border was not explicitly requested but adds to the 'casual dining' feel.
Qwen Image 2512
- + Features a bold, vibrant aesthetic with colorful icons and accents.
- + Follows the grid layout requested for the food photography section.
- − Text consists of nonsensical characters and severe legibility issues.
- − The category names fail to match the prompt (e.g., 'Appetiizizers' and 'Piesmakets').
- − Prices are logically inconsistent, using three-digit numbers for casual dining.
Verdict: Nano Banana 2 Lite is the clear winner as it produces a functional and professional-looking menu with legible headings and realistic food representation. While Qwen Image 2512 follows the basic grid layout, its text is completely illegible and it fails to accurately label the requested sections like 'Mains' or 'Pizza'. Nano Banana 2 Lite successfully captures the 'modern minimalist' and 'casual dining' aesthetic specifically requested.
Magic Burger Explosion: Fiery Photorealism Challenge
Text-to-Image“Ad for 'Magic Burger'. Dynamic, exploded burger with all components (bun, patty, cheese, lettuce, tomato, sauce) suspended in mid-air. Emphasize photorealistic detail and a sense of motion. Dark, fiery background with glowing embers. Integrate text: 'MAGIC BURGER' as a prominent title, 'LIMITED TIME ONLY' as a secondary message, and '€6.99' in a starburst, all rendered with a fiery, glowing effect.”
AI Judge Analysis
Nano Banana 2 Lite
- + Excellent photorealistic texture on the burger patty and bacon
- + Includes every requested text element with perfect spelling
- + Highly dynamic exploded view with a clear sense of vertical motion
- − The starburst for the price is a bit messy and overlaps with the background embers
Qwen Image 2512
- + Strong typography for the main title with a vivid fiery effect
- + Clean starburst element for the price tags
- + Vibrant colors and high clarity
- − Missing the word 'TIME' in 'LIMITED TIME ONLY'
- − The burger is less 'exploded' than Model A, with most components still clumped together
Verdict: Nano Banana 2 Lite is the superior output because it successfully included all requested text elements, whereas Qwen Image 2512 missed a word in the secondary message. Furthermore, Nano Banana 2 Lite provided a better interpretation of the 'exploded' burger prompt, showing individual layers suspended in air with high detail.
Chalkboard Menu
Text-to-Image“Handwritten-style chalkboard menu in a cozy café, all text rendered in the exact same realistic chalk handwriting style with natural variations in letter size, slight slant, and chalk texture — no printed or digital fonts anywhere on the board. Title at the top in elegant cursive chalk handwriting: ‘TODAY’S SPECIALS – APRIL 30, 2026’. Below it, three menu items also in the same handwritten chalk style: ‘Truffle Mushroom Risotto – $24’, ‘Grilled Octopus with Lemon & Herbs – $28’, ‘Brown Butter Chocolate Chip Cookies – $9’. At the very bottom, smaller text in the identical handwritten chalk style (slightly smaller but still clearly legible with the same handwriting characteristics): ‘All items made fresh daily • Ask about our gluten-free options’. Warm ambient café lighting, visible chalk dust and smudges, realistic handwriting imperfections, no clean printed text anywhere.”
AI Judge Analysis
Nano Banana 2 Lite
- + Excellent text legibility and accuracy
- + Perfectly logical placement of the menu items
- + Background café scene is rich in detail and enhances the atmosphere
- − Text looks slightly too clean and uniform like a digital font despite the chalk texture
- − Title is a mix of cursive and caps rather than purely elegant cursive
Qwen Image 2512
- + Natural-looking cursive handwriting style
- + Realistic chalk smudges and variations on the board surface
- + Accurately renders the cursive request for the title
- − Spelling error in 'Risotto' (spelled 'Risitto')
- − Pricing dashes are floating or disconnected from the text
- − Composition is a bit tighter on the board with less environment shown
Verdict: Nano Banana 2 Lite produced a perfectly spelled and highly legible menu that feels professionally organized, though the letters are a bit too uniform. Qwen Image 2512 captured the 'handwritten' and 'cursive' aesthetic much more realistically with natural chalk imperfections, but failed on the spelling of 'Risotto'. Nano Banana 2 Lite is the preferred winner for its overall clarity and correctness.
The Reversed Rodeo
Text-to-Image“Horse riding astronaut in space — horse on top, not vice versa. Surreal, highly detailed, cinematic.”
AI Judge Analysis
Nano Banana 2 Lite
- + Successfully follows the surreal instruction of the horse on top.
- + Beautifully detailed space background with a cinematic color palette.
- + Excellent use of lighting through the horse's mane and on the astronaut's suit.
- − The astronaut is on a strange mechanical seat rather than being directly 'ridden' by the horse.
- − The horse's legs are slightly truncated or poorly positioned near the astronaut.
Qwen Image 2512
- + High textural detail on the horse and spacesuit.
- + Realistic interpretation of a horse in orbit.
- − Completely failed the negative constraint; the astronaut is riding the horse, not vice versa.
- − The composition is a standard trope and lacks the requested surrealism.
Verdict: Nano Banana 2 Lite is the clear winner because it correctly interpreted the difficult logical inversion requested in the prompt ('horse on top, not vice versa'). Qwen Image 2512 ignored the specific spatial instruction and generated a standard astronaut-riding-a-horse image, whereas Nano Banana 2 Lite embraced the surrealism with better cinematic lighting and composition.
The Capybara Taxi Driver
Text-to-Image“Photorealistic scene inside a yellow New York taxi at night. A capybara is driving, wearing a yellow taxi driver cap and a dark jacket. It has a calm, professional expression and both front paws on the steering wheel. In the back seat sits a human businesswoman in a coat, looking at her phone with a completely normal, bored expression (as if this is just another normal ride). Through the windows you can see the streets of Manhattan at night with blurred lights. Realistic taxi interior, photorealistic, detailed fur and fabric, 35mm lens, night lighting with reflections, shallow depth of field.”
AI Judge Analysis
Nano Banana 2 Lite
- + Excellent photorealism with gritty, cinematic lighting
- + Includes realistic taxi details like a fare meter and NYC logo on the steering wheel
- + Captures the specific 'bored' expression of the passenger perfectly
- − The driver cap is black/navy rather than the requested yellow
- − The capybara's paws are somewhat blended into the steering wheel texture
- − The interior looks a bit more like a generic dark sedan than a bright yellow cab interior
Qwen Image 2512
- + Accurately followed the prompt for a yellow taxi driver cap
- + Perfectly symmetrical composition with the capybara facing forward
- + Strong fur texture and clear rendering of the paws on the steering wheel
- − The passenger's expression looks more 'unhappy' or 'pouty' than bored
- − Perspective is slightly flat compared to the depth in the other image
- − The front paws have a slightly claw-like, unnatural appearance
Verdict: Nano Banana 2 Lite wins on overall photographic quality and atmosphere, creating a believable cinematic scene with excellent attention to the passenger's expression and car interior details. While Qwen Image 2512 followed the color prompt for the hat more accurately, its passenger expression was less 'normal' and the overall composition felt more like an AI collage than a cohesive photograph.
The Halloween Invitation
Text-to-Image“Vintage gothic Halloween party invitation. Dark parchment poster, spooky border with webs and thorns, central glowing jack-o-lantern, bats, twisted trees, moody night sky. Add elegant gothic title text saying "Halloween Party Invitation", a small scroll banner saying "You are invited to a night of frights", and event details at the bottom: Date: 30.10.2026 Time: 7pm Location: The Arches, NYC Spooky but polished, cinematic lighting, square format.”
AI Judge Analysis
Nano Banana 2 Lite
- + Excellent typography with perfect spelling in all requested text fields.
- + Richly detailed composition featuring secondary elements like the haunted house and graveyard.
- + Superb adherence to the aesthetic request for a vintage parchment texture and ornate border.
- − The composition is slightly crowded due to the high amount of detail.
Qwen Image 2512
- + Strong cinematic lighting with a clear focal point on the central jack-o-lantern.
- + Clean and legible layout for the event details at the bottom of the image.
- − Spelling error in the main title ('Hallowern' instead of 'Halloween').
- − The border feels less integrated and slightly more repetitive than Model A.
Verdict: Nano Banana 2 Lite produced a superior result by correctly spelling all text and providing a much more sophisticated vintage gothic illustration. While Qwen Image 2512 has strong lighting, the misspelling of 'Halloween' and the simpler background elements make it less effective as a final invitation design.
Isometric Miniature Diorama Scenes
Text-to-Image“Create a clear, 45° top-down isometric miniature 3D cartoon scene of Japan's signature dish: sushi, with soft refined textures, realistic PBR materials, gentle lighting, on a small raised diorama base with minimal garnish and plate. Solid light blue background. At top-center: 'JAPAN' in large bold text, 'SUSHI' below it, small flag icon. Perfectly centered, ultra-clean, high-clarity, square format.”
AI Judge Analysis
Nano Banana 2 Lite
- + Excellent typography rendering with clean, professional fonts.
- + Superior realistic PBR materials, especially on the fish textures and ceramic plate.
- + Perfect adherence to the 'rounded diorama base' and '45° isometric' perspectives.
- − The flag icon is slightly separated from the text alignment requested.
Qwen Image 2512
- + Strong 'cartoon' aesthetic with bold outlines and vibrant colors.
- + Creative diorama base with grass and foliage details.
- + Correct placement of the flag icon next to the 'SUSHI' text.
- − The typography has slight artifacts and inconsistent spacing.
- − Lighting is a bit flat compared to the requested realistic PBR look.
Verdict: Nano Banana 2 Lite produced a higher quality, more professional image with realistic textures that perfectly match the 'PBR materials' and 'refined textures' prompt. Qwen Image 2512 captured the 'cartoon' aspect well, but fell short on the clean typography and sophisticated lighting seen in the competing model.
Adorable Baby Animals in Sunny Meadow
Text-to-Image“Hyper-photorealistic scene of fluffy baby animals—a golden retriever puppy, tabby kitten, baby bunny, and red fox kit—with big expressive eyes and ultra-detailed soft fur, playfully chasing butterflies and tumbling together in a lush wildflower meadow, warm golden sunrise light with god rays and dew sparkles, joyful wholesome vibe, 8K masterpiece.”
AI Judge Analysis
Nano Banana 2 Lite
- + Excellent dynamic composition with animals actively playing and tumbling.
- + Perfectly captures the 'god rays' and 'dew sparkles' requested in the prompt.
- + Highly detailed fur textures and expressive facial animations for all four animals.
- − The fox's front paw connection to the puppy is slightly anatomically ambiguous.
- − The scene is a bit crowded with many small butterfly distractions.
Qwen Image 2512
- + Very clean, portrait-style lighting with a strong central focus.
- + High clarity on the facial features of the animals.
- + Good background bokeh that makes the subjects pop.
- − Static 'posed' composition misses the 'playfully chasing' and 'tumbling' action requested.
- − The fox's eyes appear somewhat human-like and slightly asymmetrical.
- − Lacks the environmental details like dew sparkles and distinct god rays found in the other version.
Verdict: Nano Banana 2 Lite is the clear winner as it successfully captures the movement and atmosphere described in the prompt, featuring all animals in a playful, tumbling interaction. Qwen Image 2512 produces a high-quality image, but it is a static portrait that ignores the 'chasing' and 'tumbling' action keywords. Nano Banana 2 Lite also better executes the specific lighting effects like dew sparkles and sunbeams.
Vintage Cafe Logo
Text-to-Image“Vintage minimalist restaurant logo for "Caffè Florian", retro cloche dome with steam and "Est. 1720" banner, classic typography, warm brown and cream tones, subtle texture on light background, vector emblem style.”
AI Judge Analysis
Nano Banana 2 Lite
- + Excellent typography including the correct grave accent on 'Caffè'
- + Authentic minimalist vector emblem style suitable for a real logo
- + Balanced and clean composition with professional spacing
- − The steam is represented by very abstract swirls that may be too simplified
Qwen Image 2512
- + Rich illustrative detail on the cloche and steam
- + Good use of warm brown and cream tones with textured shading
- + Accurate spelling and placement of the banner text
- − Misses the 'minimalist' and 'vector' style requirement, appearing more like an illustration than a logo
- − The typography for 'Caffè' uses an acute accent instead of the correct grave accent
Verdict: Nano Banana 2 Lite followed the 'minimalist vector emblem' instruction perfectly, producing a clean, professional logo that includes the correct linguistic accent for the name. While Qwen Image 2512 created a beautiful vintage illustration with great texture, it failed to meet the minimalist style requested and made a small spelling error in the name.
Apollo 11: Journey to Tranquility
Text-to-Image“Create a clean, modern vector infographic poster about the Apollo 11 mission. NASA-inspired palette (navy, white, muted red, light gray). Flat-vector style, crisp lines, consistent iconography, subtle gradients only. Steps (stop at landing): 1. Launch (Saturn Vicon) 2. Earth Orbit (Earth + orbit ring icon) 3. Translunar (trajectory arc icon) 4. Lunar Orbit (Moon + orbit ring icon) 5. Descent (lunar module descending icon) 6. Landing (lunar module on the surface icon) Small supporting elements (minimal text): • Crew strip: three silhouette icons with only last names: Armstrong, Aldrin, Collins. • Landing site marker: Moon pin labeled "Tranquility" only. Layout constraints: generous margins, large readable labels, clean background with subtle stars. Vector-only, print-poster look, high resolution.”
AI Judge Analysis
Nano Banana 2 Lite
- + Excellent typography with nearly perfect spelling across all labels.
- + Clean, professional vertical timeline layout that is easy to follow.
- + Strict adherence to the requested NASA-inspired color palette and flat vector style.
- − The alignment of the '06 Landing' text is slightly overlapping the line graphic.
Qwen Image 2512
- + High-quality vector illustrations of the Saturn V and Lunar Module.
- + Good use of the dark navy background for a space theme.
- − Multiple spelling errors in the text including 'Translartocit' and 'Desceeint'.
- − Confusing and repetitive step numbering (two step 2s, two step 3s).
- − Included 'Steps stop at landing' instruction text directly into the final design.
Verdict: Nano Banana 2 Lite produced a superior infographic with clear, logical flows and accurate text rendering, perfectly following the requested steps and icons. Qwen Image 2512 had significant text coherence issues and failed to organize the mission steps in a chronological or numerically logical order.
Explore each model
Improved version of Alibaba's Qwen image model with better text rendering, finer natural textures, and more realistic human generation.