Black Forest Labs' flagship image generation model delivering state-of-the-art quality with exceptional realism, precision, and consistency for both text-to-image and advanced image editing
Settled by community votes across 3 shared challenges, with an AI judge weighing in on each.
FLUX.2 [max]
#11 of 44 in Text-to-Image
Not enough comparable category data
The chart appears once both models have ratings across at least three shared arena categories.
Wan 2.7 Pro
#29 of 44 in Text-to-Image
Where the votes landed
FLUX.2 [max]
66.7%
win rate
Ties
0.0%
Wan 2.7 Pro
33.3%
win rate
Challenge by challenge
The strongest take from each model on every shared challenge, with the AI judge's read.
Magic Burger Explosion: Fiery Photorealism Challenge
Text-to-Image“Ad for 'Magic Burger'. Dynamic, exploded burger with all components (bun, patty, cheese, lettuce, tomato, sauce) suspended in mid-air. Emphasize photorealistic detail and a sense of motion. Dark, fiery background with glowing embers. Integrate text: 'MAGIC BURGER' as a prominent title, 'LIMITED TIME ONLY' as a secondary message, and '€6.99' in a starburst, all rendered with a fiery, glowing effect.”
AI Judge Analysis
FLUX.2 [max]
- + Excellent photorealistic texture on the bun and patty
- + Clean and highly legible graphic design layout
- + Smooth, natural integration of the floating sauce
- − The 'exploded' effect is less dynamic than the competitor
- − Misses the fiery/glowing effect on the price text
Wan 2.7 Pro
- + Highly dynamic 'deconstructed' composition with great motion
- + Incorporates more varied ingredients like onions and cucumbers
- + Closely follows the atmospheric prompt with fiery effects on all text elements
- − The cheese has a slightly plastic or artificial appearance
- − The bottom bun looks a bit flat and less detailed than the top
- − The lettuce leaf looks like a single clip-art element rather than part of a cohesive burger
Verdict: FLUX.2 [max] produces a more professional and realistic food advertisement with superior textures, but it is more conservative with the 'exploded' motion. Wan 2.7 Pro better captures the energy and specific atmospheric requirements of the prompt, including the fiery text effects and a more dramatic deconstruction, making it the more visually exciting choice despite slightly lower realism in the food textures.
Pose & Character Mashup
Editing“Use Image 1 as the exact pose reference and Image 2 as the character reference. Recreate the person/character from Image 2 in the exact dynamic pose and body position from Image 1. Keep the exact face, hair, clothing style/details, and expression from Image 2. Match the lighting and environment of Image 1. The final image must show the character from Image 2 performing the precise action/pose from Image 1 with perfect anatomy and natural integration.”
AI Judge Analysis
FLUX.2 [max]
- + Excellent character replication, including the specific face, sunglasses, scarf, and clothing details from Image 2.
- + Accurately places the character in the lighting and environment of Image 1.
- + Successfully translates the complex balance and limb positioning of the pose from Image 1.
- − The feet and toes show some structural AI artifacts.
- − The text on the shirt is partially garbled compared to the source.
Wan 2.7 Pro
- + Perfect preservation of the original Image 1 background and colors.
- − Completely failed the edit instruction to change the character.
- − The output is identical to Image 1 with no elements of Image 2 integrated.
- − Did not perform any character replacement.
Verdict: FLUX.2 [max] successfully performed the complex task of character replacement while maintaining pose and environment, accurately importing the person, sunglasses, scarf, and black clothing from Image 2 into the dynamic pose of Image 1. Wan 2.7 Pro failed entirely, returning the original Image 1 without any modifications. FLUX.2 [max] is the clear winner for following all multi-step instructions.
The Capybara Taxi Driver
Text-to-Image“Photorealistic scene inside a yellow New York taxi at night. A capybara is driving, wearing a yellow taxi driver cap and a dark jacket. It has a calm, professional expression and both front paws on the steering wheel. In the back seat sits a human businesswoman in a coat, looking at her phone with a completely normal, bored expression (as if this is just another normal ride). Through the windows you can see the streets of Manhattan at night with blurred lights. Realistic taxi interior, photorealistic, detailed fur and fabric, 35mm lens, night lighting with reflections, shallow depth of field.”
AI Judge Analysis
FLUX.2 [max]
- + Excellent photorealistic lighting on the capybara's fur and the interior dashboard.
- + Composition clearly places the passenger in the back seat as requested.
- − The capybara has realistic human hands instead of capybara paws.
- − The passenger is looking at her phone but appears slightly out of focus compared to the driver.
Wan 2.7 Pro
- + Successfully renders animal-like paws on the steering wheel.
- + The passenger has a perfect bored expression that fits the prompt's narrative.
- − The passenger is sitting in the front passenger seat instead of the back seat.
- − The transition between the capybara's head and the jacket looks less natural than in the other model.
Verdict: FLUX.2 [max] creates a more cinematic and realistic lighting environment with a correct spatial arrangement of the passenger in the back seat, though it fails on the 'paws' requirement by giving the animal human hands. Wan 2.7 Pro correctly interprets the anatomy of the capybara's paws and the bored expression of the passenger, but fails the foundational compositional instruction of placing the passenger in the back seat. FLUX.2 [max] is the winner for better overall image coherence and adherence to the interior layout of a taxi.
Explore each model
Alibaba's Wan 2.7 Pro image generation and editing model with higher-quality outputs and support for 4K image generation