Black Forest Labs' distilled 9 billion parameter image generation model with sub-second inference and multi-reference support
Settled by community votes across 1 shared challenge, with an AI judge weighing in on each.
FLUX.2 [klein] 9B
#7 of 23 in Image Editing
Not enough comparable category data
The chart appears once both models have ratings across at least three shared arena categories.
Wan 2.6
#23 of 44 in Text-to-Image
Where the votes landed
FLUX.2 [klein] 9B
0%
win rate
Ties
0%
Wan 2.6
0%
win rate
Challenge by challenge
The strongest take from each model on every shared challenge, with the AI judge's read.
Chalkboard Menu
Text-to-Image“Handwritten-style chalkboard menu in a cozy café, all text rendered in the exact same realistic chalk handwriting style with natural variations in letter size, slight slant, and chalk texture — no printed or digital fonts anywhere on the board. Title at the top in elegant cursive chalk handwriting: ‘TODAY’S SPECIALS – APRIL 30, 2026’. Below it, three menu items also in the same handwritten chalk style: ‘Truffle Mushroom Risotto – $24’, ‘Grilled Octopus with Lemon & Herbs – $28’, ‘Brown Butter Chocolate Chip Cookies – $9’. At the very bottom, smaller text in the identical handwritten chalk style (slightly smaller but still clearly legible with the same handwriting characteristics): ‘All items made fresh daily • Ask about our gluten-free options’. Warm ambient café lighting, visible chalk dust and smudges, realistic handwriting imperfections, no clean printed text anywhere.”
AI Judge Analysis
FLUX.2 [klein] 9B
- + Excellent text rendering with no spelling errors in the main menu items.
- + Clear, high-resolution chalk texture with realistic erasure marks.
- + Very consistent cursive and print handwriting styles.
- − Simple, flat composition compared to the requested café environment.
- − Small spelling error in the bottom fine print ('fress' instead of 'fresh').
Wan 2.6
- + Stronger sense of 'cozy café' atmosphere through depth of field and warm lighting.
- + Highly realistic chalk texture with dusty residue at the bottom frame.
- + Perfect spelling throughout the entire board including fine print.
- − The perspective makes some text at the edges slightly harder to read.
- − Slightly less 'elegant' cursive in the header compared to Model A.
Verdict: Both models performed exceptionally well on this complex text rendering task. FLUX.2 [klein] 9B provides a very clean, front-facing layout with beautiful handwriting, while Wan 2.6 better captures the 'cozy café' environmental request with realistic depth, lighting, and chalk dust. Wan 2.6 is the winner for including the atmosphere requested in the prompt and maintaining perfect spelling across all text.
Explore each model
Alibaba's multimodal generation model from the Wan AI suite, supporting text-to-video, image-to-video, reference-to-video with audio, and text-to-image, in both Chinese and English