Black Forest Labs' precision image generation model with maximum control, reliable text rendering, and complete creative control supporting up to 4MP output
Settled by community votes across 5 shared challenges, with an AI judge weighing in on each.
FLUX.2 [flex]
#13 of 44 in Text-to-Image
Not enough comparable category data
The chart appears once both models have ratings across at least three shared arena categories.
Wan 2.6
#23 of 44 in Text-to-Image
Where the votes landed
FLUX.2 [flex]
42.9%
win rate
Ties
14.3%
Wan 2.6
42.9%
win rate
Challenge by challenge
The strongest take from each model on every shared challenge, with the AI judge's read.
Man and Car in California
Editing“Make a photo of the man driving the car down the California coastline”
AI Judge Analysis
FLUX.2 [flex]
- + Excellent preservation of the specific white Rolls-Royce Phantom Drophead Coupé model.
- + Maintains the man's likeness and hairstyle very well.
- + Realistic motion blur on the wheels and road surface.
- − The man's scale relative to the car is slightly small.
- − The lighting on the man doesn't perfectly match the bright outdoor sun.
Wan 2.6
- + Great composition and lighting, conveying the coastal atmosphere effectively.
- + Good integration of the man's plaid coat into his driving outfit.
- + High visual quality and clarity.
- − Significant loss of car identity, changing the front end and logo on the side.
- − Altered the man's facial features and hair texture compared to the source.
- − The car's perspective looks slightly warped towards the rear.
Verdict: FLUX.2 [flex] successfully followed the editing instructions while maintaining high loyalty to both source images, specifically preserving the unique design of the Rolls-Royce and the man's face. Wan 2.6 produced a beautiful image but failed as an editor by significantly altering the car's model and the man's appearance.
Modern Clean Menu
Text-to-Image“Modern minimalist restaurant menu design, white background with colorful food photos in grid, sections for appetizers/pizza/mains, bold sans-serif fonts, vibrant accents, clean professional layout for casual dining.”
AI Judge Analysis
FLUX.2 [flex]
- + Clean, minimalist aesthetic that feels professional and usable.
- + The grid of food photos is uniform and balanced.
- + Font choices are modern and high-contrast, following the prompt well.
- − Content mismatch: the 'Pizza' section lists items like 'Picla Pide' that don't clearly relate to the pizza imagery shown in the grid above.
- − Appetizers section header is floating above the images without accompanying text list.
Wan 2.6
- + Strong use of vibrant accents and colorful geometric elements.
- + High-quality food photography with variety and appetite appeal.
- + Better distribution of text density, including a main header.
- − Text rendering is messy with many overlapping characters and nonsense symbols (e.g., '$β.95').
- − Layout is slightly cluttered compared to the minimalist request.
- − Section headers for 'Pizza' and 'Mains' overlap the grid lines awkwardly.
Verdict: FLUX.2 [flex] produced a much cleaner and more professional minimalist layout that better matches the 'modern casual dining' brief, despite some nonsensical text strings. Wan 2.6 has more vibrant imagery and creative accents, but the text rendering is significantly lower quality and the layout feels a bit cramped for a minimalist design.
Isometric Miniature Diorama Scenes
Text-to-Image“Create a clear, 45° top-down isometric miniature 3D cartoon scene of Japan's signature dish: sushi, with soft refined textures, realistic PBR materials, gentle lighting, on a small raised diorama base with minimal garnish and plate. Solid light blue background. At top-center: 'JAPAN' in large bold text, 'SUSHI' below it, small flag icon. Perfectly centered, ultra-clean, high-clarity, square format.”
AI Judge Analysis
FLUX.2 [flex]
- + Strict adherence to text layout instructions with 'JAPAN' on the top line and 'SUSHI' below.
- + Clean, professional typography that feels well-integrated into the design.
- + Smooth, buttery 3D textures that perfectly match the 'soft refined' and 'PBR' request.
- − The diorama base is very thin, bordering on just being a shadow or a flat plate.
- − The flag icon is placed below the text rather than 'at top-center' with the text.
Wan 2.6
- + Excellent execution of the 'raised diorama base' giving it a true miniature feel.
- + Higher detail on the rice grains and sushi toppings, adding to the realism of the materials.
- + Good use of a traditional wooden sushi board (geta) which fits the theme well.
- − Failed the text layout instruction by putting the flag on the second line next to 'SUSHI'.
- − The text is slightly cut off at the very top edge of the frame.
Verdict: FLUX.2 [flex] followed the specific text formatting instructions much better, creating a cleaner and more balanced graphic design. However, Wan 2.6 captured the 'raised diorama' and 'miniature' aspect of the prompt more effectively with its chunky base and detailed textures. FLUX.2 [flex] is the preferred choice for its superior text handling and more professional aesthetic.
Vintage Cafe Logo
Text-to-Image“Vintage minimalist restaurant logo for "Caffè Florian", retro cloche dome with steam and "Est. 1720" banner, classic typography, warm brown and cream tones, subtle texture on light background, vector emblem style.”
AI Judge Analysis
FLUX.2 [flex]
- + Perfect text rendering and spelling for both the main name and the date.
- + Excellent composition with a classic arched arrangement and a centered banner.
- + Clean vector style with smooth lines and subtle paper texture background.
- − The cloche illustration is very basic compared to Model B.
- − The line weight on the 'Est. 1720' text is slightly thin for a balanced logo look.
Wan 2.6
- + More sophisticated cloche illustration with nice shading and lighting.
- + Stronger vintage aesthetic with the distressed border texture.
- + Classic serif typography is well-chosen and legible.
- − Placement of the 'Est. 1720' banner is awkward, cutting into the side of the logo.
- − The banner text is cramped and less legible than Model A.
- − Minor rendering artifact on the accent mark of 'Caffè'.
Verdict: FLUX.2 [flex] produced a better-balanced logo with perfect text alignment and a very professional layout that adheres strictly to the vector emblem request. While Wan 2.6 has a more detailed illustration and stronger vintage vibe, the awkward placement of the banner makes it less successful as a cohesive logo design.
Apollo 11: Journey to Tranquility
Text-to-Image“Create a clean, modern vector infographic poster about the Apollo 11 mission. NASA-inspired palette (navy, white, muted red, light gray). Flat-vector style, crisp lines, consistent iconography, subtle gradients only. Steps (stop at landing): 1. Launch (Saturn Vicon) 2. Earth Orbit (Earth + orbit ring icon) 3. Translunar (trajectory arc icon) 4. Lunar Orbit (Moon + orbit ring icon) 5. Descent (lunar module descending icon) 6. Landing (lunar module on the surface icon) Small supporting elements (minimal text): • Crew strip: three silhouette icons with only last names: Armstrong, Aldrin, Collins. • Landing site marker: Moon pin labeled "Tranquility" only. Layout constraints: generous margins, large readable labels, clean background with subtle stars. Vector-only, print-poster look, high resolution.”
AI Judge Analysis
FLUX.2 [flex]
- + Excellent adherence to all six requested infographic steps
- + Perfect text rendering and clean vector aesthetic
- + Matches the requested NASA color palette accurately
- − The layout is missing the final 'Landing' icon requested in the prompt
- − Minor icon scale inconsistencies
Wan 2.6
- + Clean minimalist graphic design
- + Correctly identifies the mission crew names
- + Consistent vector style
- − Completely fails the primary task of creating a 6-step infographic
- − Missing the Saturn V, Earth, Moon, and trajectory icons
- − Empty composition with significant wasted space
Verdict: FLUX.2 [flex] successfully interprets the complex infographic request, providing almost all the specific icon stages with clear, readable text in a professional vector style. Wan 2.6 fails the prompt instructions entirely, producing a minimalist poster of crew members instead of the requested mission profile infographic.
Explore each model
Alibaba's multimodal generation model from the Wan AI suite, supporting text-to-video, image-to-video, reference-to-video with audio, and text-to-image, in both Chinese and English