Black Forest Labs' 12-billion parameter multimodal flow transformer for in-context image generation and editing with character consistency, typography handling, and commercial-ready quality
Settled by community votes across 17 shared challenges, with an AI judge weighing in on each.
FLUX.1 Kontext [pro]
#40 of 48 in Text-to-Image
Nano Banana 2 Lite
#27 of 48 in Text-to-Image
Where the votes landed
FLUX.1 Kontext [pro]
100.0%
win rate
Ties
0.0%
Nano Banana 2 Lite
0.0%
win rate
Challenge by challenge
The strongest take from each model on every shared challenge, with the AI judge's read.
Geometric Composition
Text-to-Image“A glass cube on a wooden table. Inside the cube is a small blue sphere. On top of the cube sits a red book. A green plant is behind the cube, partially visible through the glass. Soft window light from the left.”
AI Judge Analysis
FLUX.1 Kontext [pro]
- + Excellent photographic realism and lighting
- + Perfect adherence to spatial positioning requested in the prompt
- + Clean, high-resolution rendering of the glass and wooden textures
- − The sphere appears to have a fuzzy or carpet-like texture rather than being a simple 'small blue sphere'
Nano Banana 2 Lite
- + Successfully renders legible text on the book spine related to the prompt
- + Accurate depiction of a smooth, solid blue sphere
- + Good implementation of refraction within the glass cube
- − The glass cube has strange internal reflections and distortions that look physically improbable
- − The sphere appears to be levitating in the center rather than resting inside
Verdict: FLUX.1 Kontext [pro] produced a much cleaner and more aesthetically pleasing image with superior lighting and composition. While Nano Banana 2 Lite creatively added text to the red book and captured the sphere's texture better, its rendering of the glass cube was less coherent than the polished result from FLUX.1 Kontext [pro].
Candid Street Photography
Text-to-Image“A candid street photo of an elderly Japanese man repairing a red bicycle in light rain, reflections on wet pavement, shallow depth of field, 50mm lens, natural skin texture, imperfect framing, motion blur from passing cars, cinematic but realistic, no stylization.”
AI Judge Analysis
FLUX.1 Kontext [pro]
- + Excellent skin texture and facial realism
- + Beautiful atmospheric light rain effect and shallow depth of field
- + Clean, high-quality image generation
- − Man appears to be riding or holding the bicycle rather than repairing it
- − Missing motion blur from passing cars as requested in the prompt
Nano Banana 2 Lite
- + Accurately depicts the act of 'repairing' with tools
- + Excellent street environment with Japanese signage and umbrellas
- + Reflections on wet pavement are highly realistic and localized
- − Anatomy issues with hands and fingers (especially holding the wrench)
- − The red bicycle is slightly mangled in its structural geometry
Verdict: Nano Banana 2 Lite followed the prompt much more accurately by showing the man actually repairing the bike with tools in a busy street setting, whereas FLUX.1 Kontext [pro] produced a higher quality portrait that failed to capture the 'repair' action or the requested motion blur. Despite some anatomical flaws in the hands, Nano Banana 2 Lite captures the intended storytelling and environment much better.
Fantasy Warrior
Text-to-Image“Close portrait of a battle-worn paladin in ornate engraved plate armor, hair braided with small beads, faint scars and dirt on the skin, warm torchlight reflecting off metal, shallow depth of field, bokeh sparks, lifelike eyes, highly detailed texture on leather straps and cloth underlayer.”
AI Judge Analysis
FLUX.1 Kontext [pro]
- + Excellent adherence to the 'engraved plate armor' prompt with beautiful floral patterns.
- + Very high-quality facial skin texture and realistic, lifelike eyes.
- + Superior warm torchlight lighting effect on the metal surfaces.
- − Failed to include beads within the hair braids as requested.
- − The battle-worn elements (scars and dirt) are very subtle compared to the prompt requirements.
Nano Banana 2 Lite
- + Strong adherence to technical prompt details like 'braided with small beads' and 'dirt on the skin'.
- + Excellent metal textures showing significant wear, scratches, and battle damage.
- + Great use of bokeh sparks and depth of field to create an atmospheric background.
- − The image has strange vertical white bars on the sides, indicating a generation/aspect ratio error.
- − The skin texture around the eyes appears overly aged or strained compared to the lifelike requirement.
Verdict: FLUX.1 Kontext [pro] produces a more aesthetically pleasing and high-fidelity image with superior lighting and armor engravings, though it missed the specific detail of the beads. Nano Banana 2 Lite followed the prompt instructions more closely regarding the hair accessories and battle-worn features, but its overall quality is hampered by the vertical white bars on the edges of the frame.
Modern Clean Menu
Text-to-Image“Modern minimalist restaurant menu design, white background with colorful food photos in grid, sections for appetizers/pizza/mains, bold sans-serif fonts, vibrant accents, clean professional layout for casual dining.”
AI Judge Analysis
FLUX.1 Kontext [pro]
- + Excellent font clarity and bold sans-serif headers.
- + Clean, professional layout on a pure white background.
- + High-quality food photography that fits the frames well.
- − Text consists of gibberish words.
- − The food items under 'Mains' and 'Appetizers' don't logically match the categories (e.g., pizza shown in both).
- − Layout is sparse with a low count of menu items.
Nano Banana 2 Lite
- + Successfully follows the 'grid' requirement with multiple images per section.
- + Impressive text legibility with mostly coherent English dish names and descriptions.
- + Complete design including a logo, address, and website footer.
- − The grid alignment is slightly cluttered compared to a true minimalist aesthetic.
- − The wooden background table is part of the generation, whereas the prompt asked for a white background for the design itself.
Verdict: Nano Banana 2 Lite is the clear winner here as it produced a fully functional menu design with coherent English text and logically grouped food items. While FLUX.1 Kontext [pro] captures the 'minimalist' aesthetic well with high-quality headers, its failure to use real words and the lack of a proper grid for the photos makes it less useful as a design template.
Magic Burger Explosion: Fiery Photorealism Challenge
Text-to-Image“Ad for 'Magic Burger'. Dynamic, exploded burger with all components (bun, patty, cheese, lettuce, tomato, sauce) suspended in mid-air. Emphasize photorealistic detail and a sense of motion. Dark, fiery background with glowing embers. Integrate text: 'MAGIC BURGER' as a prominent title, 'LIMITED TIME ONLY' as a secondary message, and '€6.99' in a starburst, all rendered with a fiery, glowing effect.”
AI Judge Analysis
FLUX.1 Kontext [pro]
- + Excellent text legibility and clean graphic design
- + High photorealistic detail on the burger patty and cheese textures
- + Vibrant, glowing neon-like effects on the typography
- − Lacks the requested 'exploded' look as the ingredients are mostly stacked
- − Duplicates the price text, with the second one appearing awkwardly in the foreground flames
- − Missing the starburst element for the price
Nano Banana 2 Lite
- + Perfectly captures the 'exploded' view with widely suspended components
- + Includes the starburst element for the price as requested
- + Strong sense of motion and dynamic composition
- − Text rendering is slightly less crisp and refined than Model A
- − The 'starburst' is rendered as a literal fiery sun/star shape which feels less like an ad element
Verdict: Nano Banana 2 Lite is the superior choice for this prompt as it accurately depicts the 'exploded' burger request with all components suspended, whereas FLUX.1 Kontext [pro] simply generated a slightly separated stack. Although FLUX.1 Kontext [pro] has cleaner typography, it failed to include the starburst and duplicated the price inappropriately.
Chalkboard Menu
Text-to-Image“Handwritten-style chalkboard menu in a cozy café, all text rendered in the exact same realistic chalk handwriting style with natural variations in letter size, slight slant, and chalk texture — no printed or digital fonts anywhere on the board. Title at the top in elegant cursive chalk handwriting: ‘TODAY’S SPECIALS – APRIL 30, 2026’. Below it, three menu items also in the same handwritten chalk style: ‘Truffle Mushroom Risotto – $24’, ‘Grilled Octopus with Lemon & Herbs – $28’, ‘Brown Butter Chocolate Chip Cookies – $9’. At the very bottom, smaller text in the identical handwritten chalk style (slightly smaller but still clearly legible with the same handwriting characteristics): ‘All items made fresh daily • Ask about our gluten-free options’. Warm ambient café lighting, visible chalk dust and smudges, realistic handwriting imperfections, no clean printed text anywhere.”
AI Judge Analysis
FLUX.1 Kontext [pro]
- + Excellent chalk texture throughout all text
- + Highly accurate spelling of the required menu items
- + More realistic variations in letter size and slant consistently with handwriting
- − The 'Today's Specials' title is not in cursive as requested
- − Minor spelling artifacts in the bottom footer text
- − Tight framing on the board lacks environmental context
Nano Banana 2 Lite
- + Great composition showing the board in a café environment
- + Includes elegant cursive and decorative flourishes for the header
- + Perfect spelling on all requested menu items
- − The text at the bottom appears as a digital font rather than chalk
- − Text is too uniform in size and spacing to look authentically handwritten
- − Chalk texture is slightly less grainy than model A
Verdict: FLUX.1 Kontext [pro] captures the authentic look of chalk on a board much better with realistic textures and handwriting variations, although it missed the cursive requirement for the title. Nano Banana 2 Lite followed the styling prompt for the title better and provided a nice background, but failed the 'no digital fonts' requirement at the bottom of the board. FLUX.1 Kontext [pro] is more successful because the main body of the text feels truly handwritten.
The Reversed Rodeo
Text-to-Image“Horse riding astronaut in space — horse on top, not vice versa. Surreal, highly detailed, cinematic.”
AI Judge Analysis
FLUX.1 Kontext [pro]
- + Perfectly follows the specific role-reversal instruction of the horse riding the astronaut.
- + High realism in texture, lighting, and cinematic composition.
- + Maintains a coherent surreal aesthetic with logical lighting coming from the planet below.
- − The astronaut's feet are depicted as horse hooves, which adds a third layer of surrealism not explicitly requested.
- − Small anatomical merging where the small rider's legs blend into the horse.
Nano Banana 2 Lite
- + Energetic and vibrant background with impressive galactic details.
- + Includes creative sci-fi elements like the horse's breathing apparatus.
- − Failed the primary prompt instruction; the horse is behind/above the astronaut but not 'riding' him.
- − The astronaut's pose is awkward as he is sitting on a floating platform rather than being ridden.
Verdict: FLUX.1 Kontext [pro] successfully interpreted the difficult 'horse on top' spatial instruction, creating a truly surreal image of a horse riding an astronaut. Nano Banana 2 Lite ignored the specific role-reversal prompt and instead placed a horse and astronaut side-by-side in a more conventional space scene.
The Capybara Taxi Driver
Text-to-Image“Photorealistic scene inside a yellow New York taxi at night. A capybara is driving, wearing a yellow taxi driver cap and a dark jacket. It has a calm, professional expression and both front paws on the steering wheel. In the back seat sits a human businesswoman in a coat, looking at her phone with a completely normal, bored expression (as if this is just another normal ride). Through the windows you can see the streets of Manhattan at night with blurred lights. Realistic taxi interior, photorealistic, detailed fur and fabric, 35mm lens, night lighting with reflections, shallow depth of field.”
AI Judge Analysis
FLUX.1 Kontext [pro]
- + Features a yellow taxi hat as specifically requested in the prompt.
- + Excellent fur texture and lighting on the capybara.
- + The capybara's expression is very calm and professional.
- − The passenger is on the phone instead of looking at it, and her expression seems distressed rather than bored.
- − The framing is very tight, cutting off much of the requested interior scene.
Nano Banana 2 Lite
- + Superior composition showing the full interior and both characters clearly.
- + Perfect adherence to the passenger detail, including the phone and bored expression.
- + Includes realistic taxi details like the fare meter and GPS screen.
- − The hat is a dark police-style cap instead of the requested yellow taxi cap.
- − The interior of the car looks somewhat worn and dirty rather than just 'realistic'.
Verdict: Nano Banana 2 Lite is the preferred image because it successfully captures the entire scene and the requested psychological dynamic between the characters, despite missing the specific color of the hat. FLUX.1 Kontext [pro] has better texture rendering but fails to follow the passenger's behavior and uses a claustrophobic composition that hides most of the environment.
The Halloween Invitation
Text-to-Image“Vintage gothic Halloween party invitation. Dark parchment poster, spooky border with webs and thorns, central glowing jack-o-lantern, bats, twisted trees, moody night sky. Add elegant gothic title text saying "Halloween Party Invitation", a small scroll banner saying "You are invited to a night of frights", and event details at the bottom: Date: 30.10.2026 Time: 7pm Location: The Arches, NYC Spooky but polished, cinematic lighting, square format.”
AI Judge Analysis
FLUX.1 Kontext [pro]
- + Excellent typography rendering for the main title and most of the event details.
- + Atmospheric lighting on the jack-o-lantern and the ground textures.
- + Includes a complex thorny/widow-web border as requested.
- − Includes hallucinations in the text, added a line 'Your: Vorkleat: light & Spans' that was not requested.
- − The 'dark parchment' texture is mostly lost to a solid black background.
Nano Banana 2 Lite
- + Perfect adherence to all text elements without any hallucinations or spelling errors.
- + Superior gothic composition with a much richer 'dark parchment' and vintage border feel.
- + Excellent background detail including a haunted house and graveyard that enhances the theme.
- − None notable; the image follows all aspects of the prompt with high visual fidelity.
Verdict: Nano Banana 2 Lite is the clear winner as it followed all prompt instructions perfectly, including the specific text details without adding hallucinations. While FLUX.1 Kontext [pro] produced a nice image, it struggled with text accuracy and omitted the vintage parchment aesthetic requested in the background.
Bald man challenge
Image Editing“Give the person a full, thick head of natural hair with realistic texture, density, and a natural hairline. Preserve facial features and lighting.”
AI Judge Analysis
FLUX.1 Kontext [pro]
- + Successfully added a thick head of hair with high density
- + Maintained the overall lighting and color palette of the original scene
- − Significantly altered the facial features and the shape of the glasses
- − The hair texture looks slightly stylized or overly groomed compared to the rugged beard
- − Created a visible seam/mismatch where the new hair meets the original sideburns
Nano Banana 2 Lite
- + Excellent source preservation, keeping the facial features and original glasses identical
- + The hair texture looks natural and matches the rugged aesthetic of the beard
- + Seamlessly blended the new hairline with the existing sideburns and skin
- − The hair density is slightly less 'full' than Model A, though still realistic
Verdict: Nano Banana 2 Lite is the clear winner because it successfully applied the edit while preserving the person's identity and original facial details perfectly. FLUX.1 Kontext [pro] failed the preservation aspect by changing the person's bone structure, eyes, and eyewear, making it look like a different individual.
Isometric Miniature Diorama Scenes
Text-to-Image“Create a clear, 45° top-down isometric miniature 3D cartoon scene of Japan's signature dish: sushi, with soft refined textures, realistic PBR materials, gentle lighting, on a small raised diorama base with minimal garnish and plate. Solid light blue background. At top-center: 'JAPAN' in large bold text, 'SUSHI' below it, small flag icon. Perfectly centered, ultra-clean, high-clarity, square format.”
AI Judge Analysis
FLUX.1 Kontext [pro]
- + Excellent 3D cartoon art style with soft, rounded textures.
- + Clean, bold typography that integrates well with the aesthetic.
- + Perfect centering and adherence to the single-plate request.
- − The sushi rice grains are rendered as distinct spheres, looking slightly like bubbles.
- − The flag icon is stylized but less recognizable as a standard flag.
Nano Banana 2 Lite
- + Provides a more diverse variety of sushi items.
- + Includes realistic details like chopsticks and soy sauce dishes.
- + The flag icon is clear and correctly positioned.
- − The base diorama is slightly cropped at the bottom.
- − Some textures on the wooden base and sushi look flatter compared to Model A.
Verdict: FLUX.1 Kontext [pro] captures the '3D cartoon' and 'soft refined texture' portion of the prompt perfectly, creating a very cohesive and professional-looking icon. Nano Banana 2 Lite provides a better variety of sushi and accessories, but it falls slightly short on the composition by clipping the bottom of the base. FLUX.1 Kontext [pro] is the winner for its superior polish and stylistic consistency.
Over-the-top cartoon caricature
Editing“Create a caricature of me and my job. Make it exaggerated and humorous, incorporating my profession as a tv show anchor and my love for dogs and hockey.”
AI Judge Analysis
FLUX.1 Kontext [pro]
- + Excellent preservation of the subject's clothing and basic features
- + Clean illustration style
- − Completely ignored all content instructions regarding dogs, hockey, and the TV anchor profession
- − Simply converted the image to a generic cartoon avatar rather than a caricature
Nano Banana 2 Lite
- + Successfully incorporated all prompt elements: TV anchor, dogs, and hockey
- + Great caricature style with expressive features and humorous details
- + Intelligent puns in the text like 'Ruff Shift' and 'Dog News Network'
- − Lower fidelity in facial resemblance compared to the source image
- − Some minor hand anatomy issues typical of complex AI generations
Verdict: FLUX.1 Kontext [pro] failed the prompt entirely, providing a stylized version of the original photo but ignoring the requested elements of dogs, hockey, and profession. Nano Banana 2 Lite followed the instructions perfectly, creating a chaotic and humorous caricature that integrates every specific detail requested in a creative way, despite losing some of the subject's exact facial likeness.
Adorable Baby Animals in Sunny Meadow
Text-to-Image“Hyper-photorealistic scene of fluffy baby animals—a golden retriever puppy, tabby kitten, baby bunny, and red fox kit—with big expressive eyes and ultra-detailed soft fur, playfully chasing butterflies and tumbling together in a lush wildflower meadow, warm golden sunrise light with god rays and dew sparkles, joyful wholesome vibe, 8K masterpiece.”
AI Judge Analysis
FLUX.1 Kontext [pro]
- + Excellent soft lighting with a convincing golden hour glow
- + Well-defined cute expressions that match the 'wholesome' vibe
- − Failed to include a rabbit, instead showing two kittens
- − The animals are sitting still rather than 'playfully chasing' and 'tumbling' as requested
- − Anatomical oddity on the central kitten's forehead/ear area
Nano Banana 2 Lite
- + Perfect adherence to all requested species: puppy, kitten, bunny, and fox
- + Dynamic and complex composition that captures the 'tumbling' and 'chasing' action
- + Highly detailed environment with visible dew sparkles and varied wildflowers
- − The fox's front paw looks slightly distorted where it touches the puppy
Verdict: Nano Banana 2 Lite is the clear winner for its superior prompt adherence, successfully including all four specific animals and capturing the requested action of tumbling together. FLUX.1 Kontext [pro] failed to include the rabbit and produced a static portrait rather than an action scene.
Studio Ghibli Anime Style
Editing“Transform this photo into a Studio Ghibli–inspired illustration. Use soft pastel colors, hand-painted textures, gentle lighting, dreamy backgrounds, and a warm, nostalgic mood”
AI Judge Analysis
FLUX.1 Kontext [pro]
- + Successfully translates the real subjects into localized, cell-shaded anime characters.
- + Captures the iconic Studio Ghibli watercolor background and line work perfectly.
- + Maintains the distinct facial expressions and dynamic poses from the original meme.
- − The woman in red has a slightly simplified face compared to her source counterpart.
Nano Banana 2 Lite
- + High fidelity to the original source faces and clothing textures.
- + Beautiful dreamy lighting and soft pastel color palette.
- − Fails to transform the characters into an illustration, appearing more like a filtered photo.
- − Does not capture the Ghibli art style, opting for a generic digital painting look.
Verdict: FLUX.1 Kontext [pro] is the clear winner as it successfully reimagined the scene in the specific Studio Ghibli art style requested, while Nano Banana 2 Lite merely applied a soft filter over the original high-resolution faces. FLUX.1 Kontext [pro] balanced character transformation with source preservation, whereas Nano Banana 2 Lite failed the core 'illustration' instruction.
Golden Hour Stroll
Image Editing“Add dynamic motion to this photo: make hair blow in the wind, add leaves flying, energetic and lively feel.”
AI Judge Analysis
FLUX.1 Kontext [pro]
- + Excellent preservation of the woman's face and identity from the original.
- + Subtle and realistic hair movement that respects physics.
- + Cleaner overall image with fewer artifacts.
- − The leaf additions are very sparse and look like an afterthought.
- − The 'energetic' feel is less pronounced than requested.
Nano Banana 2 Lite
- + Strong adherence to the movement request with dramatic hair and clothing wind effects.
- + Includes many flying leaves across the frame to create a sense of environment.
- + Successfully adds motion to the dog's fur as well.
- − The woman's face has been significantly altered, losing the likeness of the original.
- − Many leaves appear as blurry brown smudges that lack detail.
- − Some artifacts around the hair and the edges of the denim jacket.
Verdict: Nano Banana 2 Lite followed the motion-related instructions much more thoroughly, adding significant wind effects to the hair, jacket, and dog, as well as many flying leaves. However, it failed at source preservation by changing the woman's facial features. FLUX.1 Kontext [pro] preserved the original subject perfectly but provided a very minimal edit that barely captured the requested energetic feel.
Vintage Cafe Logo
Text-to-Image“Vintage minimalist restaurant logo for "Caffè Florian", retro cloche dome with steam and "Est. 1720" banner, classic typography, warm brown and cream tones, subtle texture on light background, vector emblem style.”
AI Judge Analysis
FLUX.1 Kontext [pro]
- + Excellent typography with correct Italian accent marks.
- + Strong texture that authentically mimics a vintage print.
- + Clean, minimalist composition that adheres perfectly to the vector emblem style.
- − The steam element is a bit thick and less 'ethereal' than traditional steam representations.
Nano Banana 2 Lite
- + Elegant decorative flourishes enhance the vintage restaurant aesthetic.
- + Balanced circular composition creates a cohesive brand mark.
- + Accurate text rendering and correct placement of the banner.
- − The accent on 'CAFFÈ' is a grave accent instead of the correct acute accent usually associated with the logo, though it is still legible.
- − The vector lines are a bit thin and lack the requested 'subtle texture' compared to Model A.
Verdict: Both models followed the prompt exceptionally well, producing high-quality professional logos. FLUX.1 Kontext [pro] is preferred for its superior texture and bold, clean typography that feels more like a finished minimalist brand asset, whereas Nano Banana 2 Lite offers a slightly more ornate but less textured design.
Apollo 11: Journey to Tranquility
Text-to-Image“Create a clean, modern vector infographic poster about the Apollo 11 mission. NASA-inspired palette (navy, white, muted red, light gray). Flat-vector style, crisp lines, consistent iconography, subtle gradients only. Steps (stop at landing): 1. Launch (Saturn Vicon) 2. Earth Orbit (Earth + orbit ring icon) 3. Translunar (trajectory arc icon) 4. Lunar Orbit (Moon + orbit ring icon) 5. Descent (lunar module descending icon) 6. Landing (lunar module on the surface icon) Small supporting elements (minimal text): • Crew strip: three silhouette icons with only last names: Armstrong, Aldrin, Collins. • Landing site marker: Moon pin labeled "Tranquility" only. Layout constraints: generous margins, large readable labels, clean background with subtle stars. Vector-only, print-poster look, high resolution.”
AI Judge Analysis
FLUX.1 Kontext [pro]
- + Strong aesthetic appeal with a deep navy background.
- + Consistent vector art style across all elements.
- − Poor information architecture; labels are scattered and confusingly placed.
- − Scientific inaccuracies including Saturn's rings around the Moon and a nonsensical trajectory map.
- − Failed to follow the requested 6-step numbered sequence.
Nano Banana 2 Lite
- + Excellent adherence to the 'infographic' prompt with a logical 1-6 step flow.
- + Clean, professional layout that clearly represents the mission phases.
- + High-quality text rendering and accurate icon-to-label matching.
- − The red color used is slightly brighter than the requested 'muted red'.
- − Layout feels a bit standard/templated compared to more artistic interpretations.
Verdict: Nano Banana 2 Lite followed the specific structural instructions of the prompt perfectly, delivering a logical 6-step infographic with accurate labeling and icons. FLUX.1 Kontext [pro] produced an artistic poster but failed the core requirement of being an educational infographic, instead creating a confusing map with bizarre anatomical errors like rings around the Moon.
Explore each model
The lightweight, low-cost variant of Nano Banana 2 (Gemini 3.1 Flash Image). Ultra-low-latency image generation and editing at a fixed 1K resolution, designed for high-volume interactive use cases.