FLUX.2 [flex] vs Z-Image Turbo

Head-to-head across 6 challenges

FLUX.2 [flex]

42.9%

win rate

Ties

14.3%

Z-Image Turbo

42.9%

win rate

42.9% 14.3% ties 42.9%

Challenge Results

Geometric Composition

Text-to-Image

“A glass cube on a wooden table. Inside the cube is a small blue sphere. On top of the cube sits a red book. A green plant is behind the cube, partially visible through the glass. Soft window light from the left.”

FLUX.2 [flex]

Z-Image Turbo

25% wins 25% ties 50% wins

AI Judge Analysis

FLUX.2 [flex]

+ Excellent photographic quality and sharp focus
+ Accurately represents glass refraction and reflections
+ Clean, modern composition with well-balanced lighting

− The blue sphere is quite large, whereas the prompt asked for a small one

Z-Image Turbo

+ Followed the 'small blue sphere' instruction more accurately
+ Realistic texture on the red book cover
+ Included all required elements in the scene

− The background plant is very blurry and less distinct
− The glass cube has strange reflective artifacts on the side panels
− The bottom of the cube looks like a mirror rather than clear glass

Verdict: Both models followed the prompt perfectly in terms of spatial relationships and objects. FLUX.2 [flex] produced a much higher quality image with superior clarity and realistic glass physics, though the sphere was larger than requested. Z-Image Turbo followed the scale instruction for the sphere better but suffered from a muddy background and inconsistent rendering of the glass cube's base.

Candid Street Photography

Text-to-Image

“A candid street photo of an elderly Japanese man repairing a red bicycle in light rain, reflections on wet pavement, shallow depth of field, 50mm lens, natural skin texture, imperfect framing, motion blur from passing cars, cinematic but realistic, no stylization.”

FLUX.2 [flex]

Z-Image Turbo

40% wins 20% ties 40% wins

AI Judge Analysis

FLUX.2 [flex]

+ Excellent adherence to aesthetic prompts like 'cinematic', 'shallow depth of field', and 'reflections on wet pavement'.
+ Shows the subject actively 'repairing' the bicycle as requested.
+ Strong atmospheric lighting and convincing motion blur on background traffic.

− The structural geometry of the bicycle frame is slightly nonsensical near the pedals.
− The left foot appears to be merging into the ground/bike structure.

Z-Image Turbo

+ Realistic skin textures and clothing folds.
+ The person looks naturally Japanese and the environment feels authentic.
+ Captures the light rain effect well on the ground.

− Fails the 'repairing' prompt; the man is simply standing with or pushing the bike.
− Lacks the requested 'motion blur' from passing cars; the background car is static.
− Missing the 'shallow depth of field' and 'cinematic' lighting requested.

Verdict: FLUX.2 [flex] is the clear winner as it adhered to almost every specific technical prompt, including motion blur, shallow depth of field, and the specific action of repairing the bike. Z-Image Turbo produced a high-quality, realistic image, but failed to capture the kinetic energy and cinematic atmosphere requested by the user, and ignored the primary action of 'repairing'.

Modern Clean Menu

Text-to-Image

“Modern minimalist restaurant menu design, white background with colorful food photos in grid, sections for appetizers/pizza/mains, bold sans-serif fonts, vibrant accents, clean professional layout for casual dining.”

FLUX.2 [flex]

Z-Image Turbo

0% wins 0% ties 100% wins

AI Judge Analysis

FLUX.2 [flex]

+ Strictly followed the category requirements for appetizers, pizza, and mains.
+ Excellent layout balance with a clear hierarchy and clean white space.
+ Food photos are high quality and consistent with the category headers above or below them.

− Text is mostly gibberish, though the letterforms are clean.
− The placement of 'Appetizers' at the top with 'Pizza' and 'Mains' at the bottom feels slightly disconnected.

Z-Image Turbo

+ Bold, impactful typography that catches the eye immediately.
+ Generates recognizable currency symbols and numbers for pricing.
+ Vibrant food photography that fills the grid well.

− Spelling errors in large headers such as 'MANS' and 'SETIIION'.
− The layout is a bit cluttered, with food photos interrupting the flow between text sections.
− Inaccurate categorization; food photos don't always align with the nearby text sections.

Verdict: FLUX.2 [flex] produced a much more professional and realistic menu layout that adheres perfectly to the requested sections (Appetizers, Pizza, Mains). While Z-Image Turbo has bolder text, its significant spelling errors and disjointed layout make it less functional as a design template compared to the clean, minimalist execution of FLUX.2 [flex].

Isometric Miniature Diorama Scenes

Text-to-Image

“Create a clear, 45° top-down isometric miniature 3D cartoon scene of Japan's signature dish: sushi, with soft refined textures, realistic PBR materials, gentle lighting, on a small raised diorama base with minimal garnish and plate. Solid light blue background. At top-center: 'JAPAN' in large bold text, 'SUSHI' below it, small flag icon. Perfectly centered, ultra-clean, high-clarity, square format.”

FLUX.2 [flex]

Z-Image Turbo

100% wins 0% ties 0% wins

AI Judge Analysis

FLUX.2 [flex]

+ Text is perfectly rendered and centered as requested.
+ Correctly identifies and displays the Japanese flag icon.
+ Excellent miniature 3D aesthetic with a variety of sushi types (nigiri and maki).

− The diorama base is a bit plain, matching the background color exactly.

Z-Image Turbo

+ The 3D textures on the salmon and rice are very tactile and 'squishy' in appearance.
+ Good use of a multi-toned diorama base for better visual separation.

− Incorrectly uses the Chinese flag instead of the Japanese flag.
− The text 'SUSHI' is not centered under 'JAPAN' as requested.
− The sushi composition is very basic with only one piece of nigiri.

Verdict: FLUX.2 [flex] is the clear winner as it followed all instructions, including the correct flag and text alignment. Z-Image Turbo failed a critical cultural context check by placing a Chinese flag next to the text 'JAPAN SUSHI', and it also struggled with text centering and layout balance.

Adorable Baby Animals in Sunny Meadow

Text-to-Image

“Hyper-photorealistic scene of fluffy baby animals—a golden retriever puppy, tabby kitten, baby bunny, and red fox kit—with big expressive eyes and ultra-detailed soft fur, playfully chasing butterflies and tumbling together in a lush wildflower meadow, warm golden sunrise light with god rays and dew sparkles, joyful wholesome vibe, 8K masterpiece.”

FLUX.2 [flex]

Z-Image Turbo

100% wins 0% ties 0% wins

AI Judge Analysis

FLUX.2 [flex]

+ Excellent adherence to the lighting requested, with atmospheric god rays and visible dew drops on the flowers.
+ Superior fur texture and detail across all four distinct animals.
+ Dynamic and balanced composition that captures the 'chasing' aspect of the prompt effectively.

− The kitten is slightly less realistic compared to the other three animals.
− The butterflies look a bit like stickers placed on top of the image.

Z-Image Turbo

+ Warm, pleasant color palette that captures the 'wholesome' vibe well.
+ Accurately includes all four requested animals in a tight, cute grouping.

− Anatomical issues, particularly the golden retriever's paw merging into the rabbit's back.
− Lower overall resolution and softer details compared to the masterpiece quality requested.
− The 'god rays' are much less defined and the dew sparkles are represented as generic white dots.

Verdict: FLUX.2 [flex] is the clear winner as it successfully rendered all four distinct animals with high fidelity and captured the atmospheric elements like god rays and dew drops much more effectively. Z-Image Turbo struggled with animal anatomy and produced a softer, less detailed image with less dynamic composition.

Vintage Cafe Logo

Text-to-Image

“Vintage minimalist restaurant logo for "Caffè Florian", retro cloche dome with steam and "Est. 1720" banner, classic typography, warm brown and cream tones, subtle texture on light background, vector emblem style.”

FLUX.2 [flex]

Z-Image Turbo

AI Judge Analysis

FLUX.2 [flex]

+ Perfect text rendering for both the name and the banner
+ Elegant minimalist vector style with clean lines
+ Excellent composition with the curved text framing the cloche

− The steam is a bit faint compared to the rest of the logo

Z-Image Turbo

+ Stronger vector presence with bold colors
+ Good use of the brown and cream color palette

− The 'f f' in 'Caffè' are merged awkwardly
− The steam icon looks unbalanced and disconnected
− The horizontal banner is less dynamic than the curved ribbon in Model A

Verdict: FLUX.2 [flex] produced a superior logo with perfect typography and a more sophisticated composition that feels historically appropriate for the brand. Z-Image Turbo struggled with the letter spacing in the main title and created a less balanced graphic overall.

FLUX.2 [flex]

Black Forest Labs' precision image generation model with maximum control, reliable text rendering, and complete creative control supporting up to 4MP output

View Model Arena

Z-Image Turbo

Tongyi-MAI's 6-billion parameter distilled text-to-image model optimized for speed, achieving high-quality generation in 8 steps or fewer with support for bilingual text rendering

View Model Arena