Black Forest Labs' distilled 9 billion parameter image generation model with sub-second inference and multi-reference support
Settled by community votes across 2 shared challenges, with an AI judge weighing in on each.
FLUX.2 [klein] 9B
#7 of 23 in Image Editing
Not enough comparable category data
The chart appears once both models have ratings across at least three shared arena categories.
Stable Diffusion 3.5 Large Turbo
#44 of 44 in Text-to-Image
Where the votes landed
FLUX.2 [klein] 9B
100.0%
win rate
Ties
0.0%
Stable Diffusion 3.5 Large Turbo
0.0%
win rate
Challenge by challenge
The strongest take from each model on every shared challenge, with the AI judge's read.
Magic Burger Explosion: Fiery Photorealism Challenge
Text-to-Image“Ad for 'Magic Burger'. Dynamic, exploded burger with all components (bun, patty, cheese, lettuce, tomato, sauce) suspended in mid-air. Emphasize photorealistic detail and a sense of motion. Dark, fiery background with glowing embers. Integrate text: 'MAGIC BURGER' as a prominent title, 'LIMITED TIME ONLY' as a secondary message, and '€6.99' in a starburst, all rendered with a fiery, glowing effect.”
AI Judge Analysis
FLUX.2 [klein] 9B
- + Excellent typography rendering with the requested fiery effect and perfect spelling.
- + Highly realistic food textures on the bun, patties, and vegetables.
- + Successfully followed all prompt instructions including the price starburst and exploded composition.
- − The 'exploded' effect is slightly conservative, as components are still largely stacked together.
Stable Diffusion 3.5 Large Turbo
- + Strong sense of heat and fire with effective use of smoke and glowing embers.
- + Dynamic lighting that creates a high-contrast, dramatic atmosphere.
- − Completely failed to include the requested text ('MAGIC BURGER', etc.).
- − The burger anatomy is messy with unrealistic dripping black spheres and a wooden skewer not requested.
- − The burger is not truly 'exploded' into individual suspended components.
Verdict: FLUX.2 [klein] 9B is the clear winner as it followed every instruction in the prompt, including complex text integration and specific price formatting. In contrast, Stable Diffusion 3.5 Large Turbo completely ignored the text requirements and produced a less organized image with several visual artifacts.
Chalkboard Menu
Text-to-Image“Handwritten-style chalkboard menu in a cozy café, all text rendered in the exact same realistic chalk handwriting style with natural variations in letter size, slight slant, and chalk texture — no printed or digital fonts anywhere on the board. Title at the top in elegant cursive chalk handwriting: ‘TODAY’S SPECIALS – APRIL 30, 2026’. Below it, three menu items also in the same handwritten chalk style: ‘Truffle Mushroom Risotto – $24’, ‘Grilled Octopus with Lemon & Herbs – $28’, ‘Brown Butter Chocolate Chip Cookies – $9’. At the very bottom, smaller text in the identical handwritten chalk style (slightly smaller but still clearly legible with the same handwriting characteristics): ‘All items made fresh daily • Ask about our gluten-free options’. Warm ambient café lighting, visible chalk dust and smudges, realistic handwriting imperfections, no clean printed text anywhere.”
AI Judge Analysis
FLUX.2 [klein] 9B
- + Excellent text rendering with perfect spelling and realistic chalk texture.
- + Authentic hand-drawn aesthetic with smudge marks and natural variations.
- + Strict adherence to the specific menu items and date requested in the prompt.
- − Minor typo in the bottom fine print where 'fresh' is spelled 'fress'.
- − The composition is a tight crop on the board rather than showing the café environment.
Stable Diffusion 3.5 Large Turbo
- + Provides a broader environmental context showing the café interior and lighting.
- + The composition is balanced and clean.
- − Significant spelling errors and gibberish text throughout the menu.
- − The lettering looks like a smooth digital font rather than realistic chalk texture.
- − Fails to follow the specific date and menu items requested in the prompt.
Verdict: FLUX.2 [klein] 9B followed the prompt instructions meticulously, delivering highly accurate text with a convincing chalk-on-blackboard texture. In contrast, Stable Diffusion 3.5 Large Turbo failed on almost every textual requirement, producing numerous typos and utilizing a digital-looking font that lacked the requested hand-written authenticity.
Explore each model
Distilled version of SD 3.5 Large that generates high-quality images in just 4 steps, offering faster inference and reduced costs