FLUX.2 [dev] vs Qwen Image 2512
Head-to-head across 3 challenges
FLUX.2 [dev]
80.0%
win rate
Ties
0.0%
Qwen Image 2512
20.0%
win rate
Challenge Results
Geometric Composition
Text-to-Image“A glass cube on a wooden table. Inside the cube is a small blue sphere. On top of the cube sits a red book. A green plant is behind the cube, partially visible through the glass. Soft window light from the left.”
AI Judge Analysis
FLUX.2 [dev]
- + Excellent rendering of caustic light and refractions through the glass cube.
- + The blue sphere looks like real glass/crystal, matching the overall aesthetic.
- + Very soft, natural window lighting that creates a photorealistic atmosphere.
- − The blue sphere appears to be floating unnaturally within the cube.
- − The glass cube has rounded, thick edges that make it look a bit like a construction block rather than a hollow container.
Qwen Image 2512
- + The physics are more grounded, with the sphere resting realistically on the bottom of the cube.
- + The glass cube has sharp, defined edges and clear internal reflections.
- + Excellent texture work on the book cover and the wood grain of the table.
- − The blue sphere has a matte, plastic texture that looks a bit less premium than the glass surroundings.
- − The plant in the background is slightly more obscured compared to Model A.
Verdict: Both models followed the complex spatial prompt perfectly, including the specific placement of the plant behind the glass. FLUX.2 [dev] produces a more artistic, luminous image with beautiful caustics, but the sphere is floating; Qwen Image 2512 produces a more grounded, realistic scene with better material definition for the book and table.
Candid Street Photography
Text-to-Image“A candid street photo of an elderly Japanese man repairing a red bicycle in light rain, reflections on wet pavement, shallow depth of field, 50mm lens, natural skin texture, imperfect framing, motion blur from passing cars, cinematic but realistic, no stylization.”
AI Judge Analysis
FLUX.2 [dev]
- + Excellent execution of the motion blur request for passing cars.
- + Highly realistic skin and hair textures on the subject.
- + Successfully captures the action of 'repairing' the bicycle with complex hand positioning.
- − The structural integrity of the bicycle becomes messy and nonsensical in the lower section.
- − There are some minor digital artifacts in the blurred cars that look a bit painterly.
Qwen Image 2512
- + Very coherent bicycle structure and realistic mechanical details.
- + Follows the prompt for an elderly Japanese man with natural skin textures.
- + Good use of wet pavement reflections and shallow depth of field.
- − Failed to include 'motion blur' from passing cars; the background traffic looks mostly stationary.
- − The subject is posing/looking at the camera, which contradicts the 'candid' part of the prompt.
- − The subject is just crouching by the bike rather than 'repairing' it.
Verdict: FLUX.2 [dev] followed the technical instructions for motion blur and the candid, active nature of the scene much better than Qwen Image 2512. While Qwen Image 2512 rendered a more structurally sound bicycle, FLUX.2 [dev] achieved a more convincing cinematic street photography aesthetic with genuine movement and a candid feel.
Adorable Baby Animals in Sunny Meadow
Text-to-Image“Hyper-photorealistic scene of fluffy baby animals—a golden retriever puppy, tabby kitten, baby bunny, and red fox kit—with big expressive eyes and ultra-detailed soft fur, playfully chasing butterflies and tumbling together in a lush wildflower meadow, warm golden sunrise light with god rays and dew sparkles, joyful wholesome vibe, 8K masterpiece.”
AI Judge Analysis
FLUX.2 [dev]
- + Excellent depiction of god rays and sunrise lighting
- + Very natural fur texture and backlighting
- + Includes multiple bunnies as requested by the plural prompt
- − The animals are sitting still rather than 'chasing and tumbling' as requested
- − One of the butterflies has slightly distorted wings
Qwen Image 2512
- + Strong composition with a central focus
- + Clear, expressive eyes and sharp details on all four animal types
- + Vibrant colors and well-defined butterflies
- − The anatomy where the puppy's paws meet the other animals is a bit muddled
- − Lacks the 'tumbling' action requested in the prompt
- − God rays are a bit more artificial/stylized compared to Model A
Verdict: Both models followed the prompt well, but FLUX.2 [dev] produced a more photorealistic scene with superior lighting and atmospheric effects like dew sparkles and natural god rays. Qwen Image 2512 has a charming composition but suffers from slight anatomical blending where the animals overlap and lacks the atmospheric depth of FLUX.2.
FLUX.2 [dev]
Black Forest Labs' open-weights image generation model with frontier performance, available for non-commercial local deployment
Qwen Image 2512
Improved version of Alibaba's Qwen image model with better text rendering, finer natural textures, and more realistic human generation.