FLUX.2 [max] Black Forest Labs Wan 2.7 Pro Alibaba

Settled by community votes across 3 shared challenges, with an AI judge weighing in on each.

FLUX.2 [max]

25.9 arena score

#11 of 44 in Text-to-Image

Skill signature

Not enough comparable category data

The chart appears once both models have ratings across at least three shared arena categories.

Wan 2.7 Pro

21.4 arena score

#29 of 44 in Text-to-Image

Vote tally

Where the votes landed

FLUX.2 [max]

66.7%

win rate

Ties

0.0%

Wan 2.7 Pro

33.3%

win rate

66.7% 0.0% ties 33.3%

Shared challenges 3

Challenge by challenge

The strongest take from each model on every shared challenge, with the AI judge's read.

Magic Burger Explosion: Fiery Photorealism Challenge

Text-to-Image

“Ad for 'Magic Burger'. Dynamic, exploded burger with all components (bun, patty, cheese, lettuce, tomato, sauce) suspended in mid-air. Emphasize photorealistic detail and a sense of motion. Dark, fiery background with glowing embers. Integrate text: 'MAGIC BURGER' as a prominent title, 'LIMITED TIME ONLY' as a secondary message, and '€6.99' in a starburst, all rendered with a fiery, glowing effect.”

FLUX.2 [max]

Wan 2.7 Pro

100% wins 0% ties 0% wins

AI Judge Analysis

FLUX.2 [max]

+ Excellent photorealistic texture on the bun and patty
+ Clean and highly legible graphic design layout
+ Smooth, natural integration of the floating sauce

− The 'exploded' effect is less dynamic than the competitor
− Misses the fiery/glowing effect on the price text

Wan 2.7 Pro

+ Highly dynamic 'deconstructed' composition with great motion
+ Incorporates more varied ingredients like onions and cucumbers
+ Closely follows the atmospheric prompt with fiery effects on all text elements

− The cheese has a slightly plastic or artificial appearance
− The bottom bun looks a bit flat and less detailed than the top
− The lettuce leaf looks like a single clip-art element rather than part of a cohesive burger

Verdict: FLUX.2 [max] produces a more professional and realistic food advertisement with superior textures, but it is more conservative with the 'exploded' motion. Wan 2.7 Pro better captures the energy and specific atmospheric requirements of the prompt, including the fiery text effects and a more dramatic deconstruction, making it the more visually exciting choice despite slightly lower realism in the food textures.

Pose & Character Mashup

Editing

Edit instruction

“Use Image 1 as the exact pose reference and Image 2 as the character reference. Recreate the person/character from Image 2 in the exact dynamic pose and body position from Image 1. Keep the exact face, hair, clothing style/details, and expression from Image 2. Match the lighting and environment of Image 1. The final image must show the character from Image 2 performing the precise action/pose from Image 1 with perfect anatomy and natural integration.”

Source

FLUX.2 [max]

Wan 2.7 Pro

AI Judge Analysis

FLUX.2 [max]

+ Excellent character replication, including the specific face, sunglasses, scarf, and clothing details from Image 2.
+ Accurately places the character in the lighting and environment of Image 1.
+ Successfully translates the complex balance and limb positioning of the pose from Image 1.

− The feet and toes show some structural AI artifacts.
− The text on the shirt is partially garbled compared to the source.

Wan 2.7 Pro

+ Perfect preservation of the original Image 1 background and colors.

− Completely failed the edit instruction to change the character.
− The output is identical to Image 1 with no elements of Image 2 integrated.
− Did not perform any character replacement.

Verdict: FLUX.2 [max] successfully performed the complex task of character replacement while maintaining pose and environment, accurately importing the person, sunglasses, scarf, and black clothing from Image 2 into the dynamic pose of Image 1. Wan 2.7 Pro failed entirely, returning the original Image 1 without any modifications. FLUX.2 [max] is the clear winner for following all multi-step instructions.

The Capybara Taxi Driver

Text-to-Image

“Photorealistic scene inside a yellow New York taxi at night. A capybara is driving, wearing a yellow taxi driver cap and a dark jacket. It has a calm, professional expression and both front paws on the steering wheel. In the back seat sits a human businesswoman in a coat, looking at her phone with a completely normal, bored expression (as if this is just another normal ride). Through the windows you can see the streets of Manhattan at night with blurred lights. Realistic taxi interior, photorealistic, detailed fur and fabric, 35mm lens, night lighting with reflections, shallow depth of field.”

FLUX.2 [max]

Wan 2.7 Pro

0% wins 0% ties 100% wins

AI Judge Analysis

FLUX.2 [max]

+ Excellent photorealistic lighting on the capybara's fur and the interior dashboard.
+ Composition clearly places the passenger in the back seat as requested.

− The capybara has realistic human hands instead of capybara paws.
− The passenger is looking at her phone but appears slightly out of focus compared to the driver.

Wan 2.7 Pro

+ Successfully renders animal-like paws on the steering wheel.
+ The passenger has a perfect bored expression that fits the prompt's narrative.

− The passenger is sitting in the front passenger seat instead of the back seat.
− The transition between the capybara's head and the jacket looks less natural than in the other model.

Verdict: FLUX.2 [max] creates a more cinematic and realistic lighting environment with a correct spatial arrangement of the passenger in the back seat, though it fails on the 'paws' requirement by giving the animal human hands. Wan 2.7 Pro correctly interprets the anatomy of the capybara's paws and the bored expression of the passenger, but fails the foundational compositional instruction of placing the passenger in the back seat. FLUX.2 [max] is the winner for better overall image coherence and adherence to the interior layout of a taxi.

Next steps

Explore each model

FLUX.2 [max]

Black Forest Labs

Black Forest Labs' flagship image generation model delivering state-of-the-art quality with exceptional realism, precision, and consistency for both text-to-image and advanced image editing

Vote this model in the arena

Arena profile Lumenfall catalog

Wan 2.7 Pro

Alibaba

Alibaba's Wan 2.7 Pro image generation and editing model with higher-quality outputs and support for 4K image generation

Vote this model in the arena

Arena profile Lumenfall catalog