Black Forest Labs' premium multimodal flow transformer with greatly improved prompt adherence and typography generation for in-context image generation and editing without compromise on speed
Settled by community votes across 17 shared challenges, with an AI judge weighing in on each.
FLUX.1 Kontext [max]
#45 of 48 in Text-to-Image
Nano Banana 2 Lite
#27 of 48 in Text-to-Image
Where the votes landed
FLUX.1 Kontext [max]
0%
win rate
Ties
0%
Nano Banana 2 Lite
0%
win rate
Challenge by challenge
The strongest take from each model on every shared challenge, with the AI judge's read.
Geometric Composition
Text-to-Image“A glass cube on a wooden table. Inside the cube is a small blue sphere. On top of the cube sits a red book. A green plant is behind the cube, partially visible through the glass. Soft window light from the left.”
AI Judge Analysis
FLUX.1 Kontext [max]
- + Excellent adherence to lighting instructions with dramatic soft window light and caustic shadows
- + Superior glass rendering with realistic reflections and refractions
- + High photographic resolution and clean textures
- − The sphere appears slightly textured/glittery rather than a smooth sphere
- − The plant behind is less distinct than in the other model
Nano Banana 2 Lite
- + Successfully placed the plant behind the cube visible through the glass
- + Spherical object is smooth and clearly rendered
- + Natural film-like aesthetic and soft lighting
- − The sphere is defying gravity/floating without context in the center of the cube
- − Image has lower resolution and noticeable grain compared to model a
Verdict: FLUX.1 Kontext [max] produced a much more realistic and professionally lit photograph with impressive caustic light effects, though the sphere has a rough texture. Nano Bana 2 Lite captured the composition well but suffered from lower image quality and an awkwardly floating sphere. FLUX.1 Kontext [max] is the winner for its superior visual fidelity and lighting work.
Candid Street Photography
Text-to-Image“A candid street photo of an elderly Japanese man repairing a red bicycle in light rain, reflections on wet pavement, shallow depth of field, 50mm lens, natural skin texture, imperfect framing, motion blur from passing cars, cinematic but realistic, no stylization.”
AI Judge Analysis
FLUX.1 Kontext [max]
- + Excellent depiction of heavy, streaking rain
- + High skin texture detail and realistic hand lighting
- + Successful implementation of shallow depth of field
- − The bike chain is physically disconnected and floating in mid-air
- − The man appears generic rather than specifically Japanese
- − Composition is a bit too clean and centered for 'imperfect framing'
Nano Banana 2 Lite
- + Stronger 'candid' feel with better environmental storytelling and signs
- + Includes tools and a kickstand, making the repair look more functional
- + Better adherence to 'imperfect framing' and 'Japanese' facial features
- − The right hand is poorly rendered with garbled fingers
- − Rain effects are very subtle and almost invisible compared to the pavement reflections
- − Lower overall resolution/clarity compared to Model A
Verdict: FLUX.1 Kontext [max] has much higher technical image quality and a more dramatic cinematic atmosphere, though it fails significantly on the logic of the bicycle chain. Nano Bana 2 Lite captures the 'candid street' aesthetic better with authentic-looking Japanese signage and a more natural layout, but it suffers from typical AI artifacts in the hands. FLUX.1 Kontext [max] is the preferred image because the rain and lighting better match the provided prompt's cinematic requirements.
Fantasy Warrior
Text-to-Image“Close portrait of a battle-worn paladin in ornate engraved plate armor, hair braided with small beads, faint scars and dirt on the skin, warm torchlight reflecting off metal, shallow depth of field, bokeh sparks, lifelike eyes, highly detailed texture on leather straps and cloth underlayer.”
AI Judge Analysis
FLUX.1 Kontext [max]
- + Excellent depiction of ornate engraved plate armor with high-fidelity metal textures.
- + Strong adherence to the 'torchlight reflecting off metal' prompt with vibrant lighting.
- + High skin detail including sweat, pores, and fine hair textures.
- − Missed the specific 'hair braided with small beads' detail, opting for plain braids.
- − Skin tone appears overly saturated or orange/bronzed due to the lighting.
Nano Banana 2 Lite
- + Includes the 'braided with small beads' detail accurately.
- + Superior texture on the leather straps and diverse cloth/mail underlayers.
- + Very realistic depiction of 'battle-worn' through authentic scars, dirt, and tired eyes.
- − The image has strange white borders on the left and right sides.
- − The lighting is more muted and less 'warm torchlight' compared to the first image.
Verdict: Nano Bana 2 Lite captures the technical details of the prompt better, specifically the beads in the hair and the complex layering of leather and mail, though it has presentation issues with the white pillars on the sides. FLUX.1 Kontext [max] provides a more visually striking 'paladin' aesthetic with superior metal engraving detail and lighting, but misses the finer beads mentioned in the prompt. Overall, Nano Bana 2 Lite feels more like a lived-in character, whereas FLUX.1 Kontext [max] feels like a high-end cinematic render.
Modern Clean Menu
Text-to-Image“Modern minimalist restaurant menu design, white background with colorful food photos in grid, sections for appetizers/pizza/mains, bold sans-serif fonts, vibrant accents, clean professional layout for casual dining.”
AI Judge Analysis
FLUX.1 Kontext [max]
- + Excellent high-resolution food imagery
- + Professional graphic design layout
- + Clean and minimalist use of white space
- − Text is mostly illegible gibberish
- − Food variety is almost entirely limited to pizza unlike the prompt request
Nano Banana 2 Lite
- + Perfect adherence to requested sections (Appetizers, Pizza, Mains)
- + Remarkable text legibility for item names and descriptions
- + Balanced layout with vibrant accent colors
- − Some minor spelling errors in small font text
- − Perspective of food photos is slightly inconsistent
Verdict: Nano Bana 2 Lite is the clear winner as it perfectly adheres to the structural requirements of the prompt, including distinct sections for appetizers, pizza, and mains. While FLUX.1 Kontext [max] has slightly more artistic food photography, its failure to generate legible text and its repetitive food choices make it less effective as a menu design.
Magic Burger Explosion: Fiery Photorealism Challenge
Text-to-Image“Ad for 'Magic Burger'. Dynamic, exploded burger with all components (bun, patty, cheese, lettuce, tomato, sauce) suspended in mid-air. Emphasize photorealistic detail and a sense of motion. Dark, fiery background with glowing embers. Integrate text: 'MAGIC BURGER' as a prominent title, 'LIMITED TIME ONLY' as a secondary message, and '€6.99' in a starburst, all rendered with a fiery, glowing effect.”
AI Judge Analysis
FLUX.1 Kontext [max]
- + Crisp, clean typography that is highly legible
- + High-resolution rendering of the main burger
- − Failed the 'exploded' prompt as the main burger is mostly assembled
- − The starchurst element is missing or poorly integrated
- − The price tag uses a comma instead of a period
Nano Banana 2 Lite
- + Perfectly captures the 'exploded' burger concept with suspended ingredients
- + Followed all text instructions including the fiery starburst for the price
- + Excellent sense of motion with sauce splashes and embers
- − The text is slightly less sharp than Model A
- − The top bun is slightly distorted in shape
Verdict: Nano Bana 2 Lite is the clear winner as it successfully interpreted the 'exploded' instruction, creating a dynamic composition with suspended layers and splashes. FLUX.1 Kontext [max] produced a high-quality image, but failed the core creative requirement by keeping the burger largely whole and failing to include the requested starburst element.
Chalkboard Menu
Text-to-Image“Handwritten-style chalkboard menu in a cozy café, all text rendered in the exact same realistic chalk handwriting style with natural variations in letter size, slight slant, and chalk texture — no printed or digital fonts anywhere on the board. Title at the top in elegant cursive chalk handwriting: ‘TODAY’S SPECIALS – APRIL 30, 2026’. Below it, three menu items also in the same handwritten chalk style: ‘Truffle Mushroom Risotto – $24’, ‘Grilled Octopus with Lemon & Herbs – $28’, ‘Brown Butter Chocolate Chip Cookies – $9’. At the very bottom, smaller text in the identical handwritten chalk style (slightly smaller but still clearly legible with the same handwriting characteristics): ‘All items made fresh daily • Ask about our gluten-free options’. Warm ambient café lighting, visible chalk dust and smudges, realistic handwriting imperfections, no clean printed text anywhere.”
AI Judge Analysis
FLUX.1 Kontext [max]
- + Excellent text legibility and accuracy
- + Realistic chalk smudges and eraser marks on the board
- + Natural variations in handwriting which feels authentic
- − The title is in print-style block letters rather than the 'elegant cursive' requested
- − Composition is a very tight crop on the board with less café atmosphere
Nano Banana 2 Lite
- + Successfully followed the instruction for cursive handwriting in the title
- + Better overall café composition and depth of field
- + Textured chalk strokes feel very tactile and consistent
- − The small text at the bottom begins to look like a digital font rather than handwriting
- − Minor spacing issues in the price of the first item
Verdict: Both models followed the complex text requirements with impressive accuracy. Nano Bana 2 Lite is the winner because it successfully followed the specific prompt for 'elegant cursive' in the title, which FLUX.1 Kontext [max] ignored, and it provided a much richer visual context for the café setting.
The Reversed Rodeo
Text-to-Image“Horse riding astronaut in space — horse on top, not vice versa. Surreal, highly detailed, cinematic.”
AI Judge Analysis
FLUX.1 Kontext [max]
- + High visual clarity and realistic textures
- + Excellent lighting and cinematic composition
- + Clean rendering of the astronaut and horse
- − Failed the negative constraint to have the horse on top of the astronaut
- − Predictable interpretation of the prompt
Nano Banana 2 Lite
- + Intricate mechanical details on the cybernetic horse
- + Dynamic background with vibrant nebulae and asteroids
- + Good use of the 'highly detailed' and 'surreal' keywords
- − Failed the negative constraint to have the horse on top of the astronaut
- − The NASA-like logos are garbled and messy
Verdict: Both models failed the specific spatial logic request ('horse on top, not vice versa'), instead providing the standard astronaut-on-horse image. FLUX.1 Kontext [max] produced a much cleaner, more photographic image with superior lighting, whereas Nano Bana 2 Lite went for a more maximalist, sci-fi aesthetic that suffers from messy details and artifacts.
The Capybara Taxi Driver
Text-to-Image“Photorealistic scene inside a yellow New York taxi at night. A capybara is driving, wearing a yellow taxi driver cap and a dark jacket. It has a calm, professional expression and both front paws on the steering wheel. In the back seat sits a human businesswoman in a coat, looking at her phone with a completely normal, bored expression (as if this is just another normal ride). Through the windows you can see the streets of Manhattan at night with blurred lights. Realistic taxi interior, photorealistic, detailed fur and fabric, 35mm lens, night lighting with reflections, shallow depth of field.”
AI Judge Analysis
FLUX.1 Kontext [max]
- + Excellent fur texture and lighting on the capybara's face.
- + Accurate yellow taxi driver cap with 'TAXI' text.
- + High-quality bokeh effect for the city background.
- − The passenger is on a phone call rather than looking at the screen as requested.
- − The capybara only has one paw clearly on the wheel.
- − Composition is a bit tight, losing some of the interior detail.
Nano Banana 2 Lite
- + Perfectly captures the passenger looking at her phone with a bored expression.
- + Includes great interior details like the taxi meter and GPS.
- + Both paws are securely on the steering wheel as requested.
- − The hat looks more like a police/chauffeur cap than a yellow taxi hat.
- − The steering wheel has some AI artifacting on the left side.
- − The cab interior looks a bit grimy/dirty compared to a standard professional setting.
Verdict: Nano Bana 2 Lite is the overall winner for superior prompt adherence, particularly regarding the passenger's expression and the specific instruction for both paws to be on the wheel. While FLUX.1 Kontext [max] has slightly better fur rendering and a more accurate hat color, it fails to capture the interior environment as effectively as Nano Bana 2 Lite.
The Halloween Invitation
Text-to-Image“Vintage gothic Halloween party invitation. Dark parchment poster, spooky border with webs and thorns, central glowing jack-o-lantern, bats, twisted trees, moody night sky. Add elegant gothic title text saying "Halloween Party Invitation", a small scroll banner saying "You are invited to a night of frights", and event details at the bottom: Date: 30.10.2026 Time: 7pm Location: The Arches, NYC Spooky but polished, cinematic lighting, square format.”
AI Judge Analysis
FLUX.1 Kontext [max]
- + Excellent typography rendering with clean, readable fonts
- + Accurate adherence to all provided event details
- + Moody, cinematic lighting with a focused central subject
- − The scrolling banner is fragmented and positioned awkwardly
- − Redundant text repetition in the location field
- − The border is a bit simplistic compared to the requested 'thorns' detail
Nano Banana 2 Lite
- + Beautifully detailed gothic illustration with skulls, webs, and haunted house elements
- + Perfectly executed scroll banner with curved text
- + Rich, vintage parchment texture and intricate border
- − Slightly less 'cinematic' and more illustrative/cartoonish in style
- − Small artifacts in the background details (tiny extra bats/limbs)
Verdict: Nano Bana 2 Lite provided a much more complete and artistic interpretation of the 'vintage gothic' prompt, featuring superior integration of the scroll banner and more intricate decorative elements. While FLUX.1 Kontext [max] had cleaner typography, it suffered from text repetition and a less engaging background composition.
Bald man challenge
Image Editing“Give the person a full, thick head of natural hair with realistic texture, density, and a natural hairline. Preserve facial features and lighting.”
AI Judge Analysis
FLUX.1 Kontext [max]
- + Adds a thick, dense head of hair as requested.
- + Matches the hair color well to the existing beard.
- − Significantly alters the person's face, making him look noticeably different from the source.
- − The hair texture looks slightly artificial and stiff compared to the lighting of the scene.
- − The glasses frames and facial structure were altered.
Nano Banana 2 Lite
- + Excellent source preservation, keeping the face and features identical to the original.
- + The hair texture and style appear very natural and fit the wind/environment perfectly.
- + Maintains the exact lighting and skin details of the source image.
- − The hairline transition on the forehead is slightly soft.
Verdict: Nano Bana 2 Lite is the clear winner because it successfully adds a realistic head of hair while preserving the identity and features of the man in the source image. FLUX.1 Kontext [max] fails the core task of identity preservation, essentially generating a new person who looks vaguely similar to the original.
Isometric Miniature Diorama Scenes
Text-to-Image“Create a clear, 45° top-down isometric miniature 3D cartoon scene of Japan's signature dish: sushi, with soft refined textures, realistic PBR materials, gentle lighting, on a small raised diorama base with minimal garnish and plate. Solid light blue background. At top-center: 'JAPAN' in large bold text, 'SUSHI' below it, small flag icon. Perfectly centered, ultra-clean, high-clarity, square format.”
AI Judge Analysis
FLUX.1 Kontext [max]
- + Excellent typography with a playful, rounded aesthetic that matches the 3D style.
- + Outstanding lighting and subsurface scattering on the sushi textures.
- + Perfect adherence to the 'soft refined textures' and 'miniature 3D cartoon' style.
- − Failed to include the requested flag icon.
- − The raised diorama base is quite simple compared to the object detail.
Nano Banana 2 Lite
- + Successfully included all prompt elements, including the flag icon.
- + Greater variety of sushi types on the plate.
- + Very clean, professional-looking diorama base.
- − The 'JAPAN' text is slightly off-center relative to the sushi below it.
- − The text style is a bit plain compared to the stylized cartoon prompt.
- − The sushi textures are slightly less 'cartoon-miniature' and feel more photorealistic.
Verdict: FLUX.1 Kontext [max] produced a more visually cohesive 3D cartoon aesthetic with superior lighting and playful typography, though it missed the flag icon. Nano Bana 2 Lite followed the prompt instructions more literally by including the flag and more sushi variety, but the text alignment was slightly off. FLUX.1 Kontext [max] is preferred for its higher artistic quality and better rendering of the requested soft textures.
Over-the-top cartoon caricature
Editing“Create a caricature of me and my job. Make it exaggerated and humorous, incorporating my profession as a tv show anchor and my love for dogs and hockey.”
AI Judge Analysis
FLUX.1 Kontext [max]
- + Successfully incorporates all elements (TV news, dog, hockey stick).
- + Preserves the subject's outfit (denim shirt over black top) perfectly.
- + Clean, modern cartoon aesthetic with clear resolution.
- − The caricature style is a generic 'Bitmoji' style rather than an exaggerated caricature of the specific person.
- − The hockey stick is awkwardly cropped and lacks detail.
- − The dog is a generic cartoon and lacks interaction with the scene.
Nano Banana 2 Lite
- + Excellent 'traditional' caricature style with exaggerated features that still resemble the original subject.
- + Very creative merging of themes, such as 'Ruff Shift' pun and the microphone-bone-hockey stick hybrid.
- + Densely packed with details that tell a cohesive story, including the studio lights and TV screen.
- − Text on some papers is slightly garbled ('HATTNET').
- − The composition is quite busy, which may be overwhelming for a simple profile highlight.
Verdict: Nano Bana 2 Lite is the clear winner for its superior creativity and adherence to the spirit of a 'caricature'. It masterfully blends the requested themes into clever jokes, such as a dog playing hockey on the news monitor, while FLUX.1 Kontext [max] simply places the items nearby in a static, generic cartoon style. Nano Bana 2 Lite also does a much better job of capturing and exaggerating the subject's distinct facial features rather than applying a template face.
Adorable Baby Animals in Sunny Meadow
Text-to-Image“Hyper-photorealistic scene of fluffy baby animals—a golden retriever puppy, tabby kitten, baby bunny, and red fox kit—with big expressive eyes and ultra-detailed soft fur, playfully chasing butterflies and tumbling together in a lush wildflower meadow, warm golden sunrise light with god rays and dew sparkles, joyful wholesome vibe, 8K masterpiece.”
AI Judge Analysis
FLUX.1 Kontext [max]
- + Excellent fur texture rendering
- + Correct lighting and god ray application
- + High visual clarity and soft, dreamy aesthetic
- − The animals are sitting still rather than 'playfully chasing and tumbling'
- − The bunny has slightly stylized, human-like eyes compared to the others
Nano Banana 2 Lite
- + Perfectly captures the 'tumbling and chasing' action requested in the prompt
- + Diverse and detailed wildflower variety
- + Captures all four requested animals with dynamic poses
- − The fox's anatomy is slightly awkward where it interacts with the dog
- − The kitten has an extra-long tail that appears to merge into the dog's fur
Verdict: Nano Bana 2 Lite followed the specific action of the prompt much better, showing the animals actively tumbling and playing rather than just sitting together. While FLUX.1 Kontext [max] has slightly cleaner fur textures and a more polished 'masterpiece' glow, it failed to capture the dynamic movement requested.
Studio Ghibli Anime Style
Editing“Transform this photo into a Studio Ghibli–inspired illustration. Use soft pastel colors, hand-painted textures, gentle lighting, dreamy backgrounds, and a warm, nostalgic mood”
AI Judge Analysis
FLUX.1 Kontext [max]
- + Expertly captures the Studio Ghibli cel-shaded aesthetic.
- + Successfully adapts the characters' facial expressions into 2D anime style while maintaining their identity.
- + Preserves the iconic composition and colors of the original meme.
- − The hand-painted texture is a bit flat compared to Ghibli watercolor backgrounds.
Nano Banana 2 Lite
- + Excellent hand-painted watercolor textures in the background.
- + Strong adherence to the requested 'dreamy' and 'nostalgic' mood with warm lighting.
- + Adds charming Ghibli-esque details like cats and flowers in the background.
- − Faces are rendered in a semi-realistic way that clashes with the Ghibli style requirement.
- − The foreground woman's face is overly blurry, losing too much detail from the source.
Verdict: FLUX.1 Kontext [max] is the winner for its superior stylistic transformation, successfully turning the photographic meme into a convincing Studio Ghibli anime cel. Nano Bana 2 Lite creates a more beautiful, atmospheric background, but its failure to stylize the faces into the characteristic 2D anime look results in a 'filtered photo' appearance rather than an illustration.
Golden Hour Stroll
Image Editing“Add dynamic motion to this photo: make hair blow in the wind, add leaves flying, energetic and lively feel.”
AI Judge Analysis
FLUX.1 Kontext [max]
- + Maintains excellent facial consistency with the source image.
- + Subtle and naturalistic hair movement that aligns with a light breeze.
- + Preserves the overall lighting and texture of the original scene.
- − The 'flying leaves' are very sparse and look somewhat static.
- − Significant anatomical error where the subject's left arm and hand have been awkwardly repositioned to touch the dog.
Nano Banana 2 Lite
- + Stronger adherence to the 'dynamic motion' prompt with more wind-swept hair.
- + Effectively includes many flying leaves across the frame to create an energetic feel.
- + Preserves the original pose and arm position of the subject.
- − The hair has some messy artifacts and loses some of the original texture.
- − The leaves are somewhat low-resolution compared to the rest of the image.
Verdict: Nano Bana 2 Lite is the clear winner for this task as it successfully captures the 'energetic and lively' atmosphere requested through more aggressive hair movement and a visible volume of flying leaves. While FLUX.1 Kontext [max] preserves the face better, it fails the edit by introducing a major anatomical glitch with the left arm and being too conservative with the motion effects.
Vintage Cafe Logo
Text-to-Image“Vintage minimalist restaurant logo for "Caffè Florian", retro cloche dome with steam and "Est. 1720" banner, classic typography, warm brown and cream tones, subtle texture on light background, vector emblem style.”
AI Judge Analysis
FLUX.1 Kontext [max]
- + Excellent typography with proper accent on 'Caffè'.
- + Authentic hand-drawn vintage texture and shading.
- + Closer adherence to the 'banner' element request for the date.
- − The steam icon is slightly off-center compared to the cloche handle.
Nano Banana 2 Lite
- + Elegant vector-style circular composition.
- + Clean line work and balanced layout.
- − Missed the 'banner' requirement for the date, using an oval shape instead.
- − Incorrect accent over the 'E' in 'Caffè' (grave accent used instead of the standard grave found in the name).
Verdict: FLUX.1 Kontext [max] delivered a more authentic vintage feel with superior texture and better adherence to the specific 'banner' requirement. Nano Bana 2 Lite produced a very clean logo, but failed to include a banner and had a less sophisticated typographic approach.
Apollo 11: Journey to Tranquility
Text-to-Image“Create a clean, modern vector infographic poster about the Apollo 11 mission. NASA-inspired palette (navy, white, muted red, light gray). Flat-vector style, crisp lines, consistent iconography, subtle gradients only. Steps (stop at landing): 1. Launch (Saturn Vicon) 2. Earth Orbit (Earth + orbit ring icon) 3. Translunar (trajectory arc icon) 4. Lunar Orbit (Moon + orbit ring icon) 5. Descent (lunar module descending icon) 6. Landing (lunar module on the surface icon) Small supporting elements (minimal text): • Crew strip: three silhouette icons with only last names: Armstrong, Aldrin, Collins. • Landing site marker: Moon pin labeled "Tranquility" only. Layout constraints: generous margins, large readable labels, clean background with subtle stars. Vector-only, print-poster look, high resolution.”
AI Judge Analysis
FLUX.1 Kontext [max]
- + Strong typography for the main title
- + Clean vector silhouettes of the crew
- + Excellent use of negative space
- − Failed to provide all 6 specific steps of the mission
- − The rocket design is generic and does not resemble a Saturn V
- − Contains some unintelligible placeholder text beneath icons
Nano Banana 2 Lite
- + Perfectly adhered to all 6 requested steps in the correct order
- + Excellent text rendering of complex labels and dates
- + Strictly followed the NASA-inspired color palette and flat vector style
- − Icons within the circles can be slightly cluttered at smaller scales
- − Minor repetition of the landing module icon between steps 5 and 6
Verdict: Nano Bana 2 Lite performed significantly better by following the specific content requirements of the prompt, including all six mission steps and accurate iconography. FLUX.1 Kontext [max] failed to include the full sequence and used a generic rocket design rather than the requested Saturn V style.
Explore each model
The lightweight, low-cost variant of Nano Banana 2 (Gemini 3.1 Flash Image). Ultra-low-latency image generation and editing at a fixed 1K resolution, designed for high-volume interactive use cases.