FLUX.2 [flex] vs Z-Image Turbo

Head-to-head across 9 challenges

FLUX.2 [flex]

50.0%

win rate

Ties

15.0%

Z-Image Turbo

35.0%

win rate

50.0% 15.0% ties 35.0%

Challenge Results

Geometric Composition

Text-to-Image

“A glass cube on a wooden table. Inside the cube is a small blue sphere. On top of the cube sits a red book. A green plant is behind the cube, partially visible through the glass. Soft window light from the left.”

FLUX.2 [flex]
Z-Image Turbo
33% wins 33% ties 33% wins

AI Judge Analysis

FLUX.2 [flex]

  • + Excellent photographic quality and sharp focus
  • + Accurately represents glass refraction and reflections
  • + Clean, modern composition with well-balanced lighting
  • The blue sphere is quite large, whereas the prompt asked for a small one

Z-Image Turbo

  • + Followed the 'small blue sphere' instruction more accurately
  • + Realistic texture on the red book cover
  • + Included all required elements in the scene
  • The background plant is very blurry and less distinct
  • The glass cube has strange reflective artifacts on the side panels
  • The bottom of the cube looks like a mirror rather than clear glass

Verdict: Both models followed the prompt perfectly in terms of spatial relationships and objects. FLUX.2 [flex] produced a much higher quality image with superior clarity and realistic glass physics, though the sphere was larger than requested. Z-Image Turbo followed the scale instruction for the sphere better but suffered from a muddy background and inconsistent rendering of the glass cube's base.

Candid Street Photography

Text-to-Image

“A candid street photo of an elderly Japanese man repairing a red bicycle in light rain, reflections on wet pavement, shallow depth of field, 50mm lens, natural skin texture, imperfect framing, motion blur from passing cars, cinematic but realistic, no stylization.”

FLUX.2 [flex]
Z-Image Turbo
50% wins 25% ties 25% wins

AI Judge Analysis

FLUX.2 [flex]

  • + Excellent adherence to aesthetic prompts like 'cinematic', 'shallow depth of field', and 'reflections on wet pavement'.
  • + Shows the subject actively 'repairing' the bicycle as requested.
  • + Strong atmospheric lighting and convincing motion blur on background traffic.
  • The structural geometry of the bicycle frame is slightly nonsensical near the pedals.
  • The left foot appears to be merging into the ground/bike structure.

Z-Image Turbo

  • + Realistic skin textures and clothing folds.
  • + The person looks naturally Japanese and the environment feels authentic.
  • + Captures the light rain effect well on the ground.
  • Fails the 'repairing' prompt; the man is simply standing with or pushing the bike.
  • Lacks the requested 'motion blur' from passing cars; the background car is static.
  • Missing the 'shallow depth of field' and 'cinematic' lighting requested.

Verdict: FLUX.2 [flex] is the clear winner as it adhered to almost every specific technical prompt, including motion blur, shallow depth of field, and the specific action of repairing the bike. Z-Image Turbo produced a high-quality, realistic image, but failed to capture the kinetic energy and cinematic atmosphere requested by the user, and ignored the primary action of 'repairing'.

Modern Clean Menu

Text-to-Image

“Modern minimalist restaurant menu design, white background with colorful food photos in grid, sections for appetizers/pizza/mains, bold sans-serif fonts, vibrant accents, clean professional layout for casual dining.”

FLUX.2 [flex]
Z-Image Turbo
0% wins 0% ties 100% wins

AI Judge Analysis

FLUX.2 [flex]

  • + Strictly followed the category requirements for appetizers, pizza, and mains.
  • + Excellent layout balance with a clear hierarchy and clean white space.
  • + Food photos are high quality and consistent with the category headers above or below them.
  • Text is mostly gibberish, though the letterforms are clean.
  • The placement of 'Appetizers' at the top with 'Pizza' and 'Mains' at the bottom feels slightly disconnected.

Z-Image Turbo

  • + Bold, impactful typography that catches the eye immediately.
  • + Generates recognizable currency symbols and numbers for pricing.
  • + Vibrant food photography that fills the grid well.
  • Spelling errors in large headers such as 'MANS' and 'SETIIION'.
  • The layout is a bit cluttered, with food photos interrupting the flow between text sections.
  • Inaccurate categorization; food photos don't always align with the nearby text sections.

Verdict: FLUX.2 [flex] produced a much more professional and realistic menu layout that adheres perfectly to the requested sections (Appetizers, Pizza, Mains). While Z-Image Turbo has bolder text, its significant spelling errors and disjointed layout make it less functional as a design template compared to the clean, minimalist execution of FLUX.2 [flex].

Isometric Miniature Diorama Scenes

Text-to-Image

“Create a clear, 45° top-down isometric miniature 3D cartoon scene of Japan's signature dish: sushi, with soft refined textures, realistic PBR materials, gentle lighting, on a small raised diorama base with minimal garnish and plate. Solid light blue background. At top-center: 'JAPAN' in large bold text, 'SUSHI' below it, small flag icon. Perfectly centered, ultra-clean, high-clarity, square format.”

FLUX.2 [flex]
Z-Image Turbo
100% wins 0% ties 0% wins

AI Judge Analysis

FLUX.2 [flex]

  • + Text is perfectly rendered and centered as requested.
  • + Correctly identifies and displays the Japanese flag icon.
  • + Excellent miniature 3D aesthetic with a variety of sushi types (nigiri and maki).
  • The diorama base is a bit plain, matching the background color exactly.

Z-Image Turbo

  • + The 3D textures on the salmon and rice are very tactile and 'squishy' in appearance.
  • + Good use of a multi-toned diorama base for better visual separation.
  • Incorrectly uses the Chinese flag instead of the Japanese flag.
  • The text 'SUSHI' is not centered under 'JAPAN' as requested.
  • The sushi composition is very basic with only one piece of nigiri.

Verdict: FLUX.2 [flex] is the clear winner as it followed all instructions, including the correct flag and text alignment. Z-Image Turbo failed a critical cultural context check by placing a Chinese flag next to the text 'JAPAN SUSHI', and it also struggled with text centering and layout balance.

Night Sky Transformation

Editing
Edit instruction

“Change the scene to night: a deep, dark sky with subtle, glistening stars visible behind the mountain.”

Before After
FLUX.2 [flex]
Before After
Z-Image Turbo
100% wins 0% ties 0% wins

AI Judge Analysis

FLUX.2 [flex]

  • + Excellent prompt adherence, changing the sky to a deep dark night with stars.
  • + High source preservation, keeping the town layout and mountain structure nearly identical.
  • + Added a convincing moonlit glow to the mountain peak that feels natural for a night scene.

Z-Image Turbo

  • + The foreground town retains good lighting details.
  • Failed the primary edit instruction by keeping a sunset/golden hour sky instead of a night sky.
  • Significantly altered the shape and texture of the main mountain peak.
  • General loss of source image fidelity in the background mountains.

Verdict: FLUX.2 [flex] successfully followed the edit instructions, transforming the sunset into a clear night scene with stars while perfectly preserving the composition and details of the source image. Z-Image Turbo failed to create a night scene, maintaining the orange sunset palette and unnecessarily distorting the shape of the mountain.

Adorable Baby Animals in Sunny Meadow

Text-to-Image

“Hyper-photorealistic scene of fluffy baby animals—a golden retriever puppy, tabby kitten, baby bunny, and red fox kit—with big expressive eyes and ultra-detailed soft fur, playfully chasing butterflies and tumbling together in a lush wildflower meadow, warm golden sunrise light with god rays and dew sparkles, joyful wholesome vibe, 8K masterpiece.”

FLUX.2 [flex]
Z-Image Turbo

AI Judge Analysis

FLUX.2 [flex]

  • + Excellent adherence to the lighting requested, with atmospheric god rays and visible dew drops on the flowers.
  • + Superior fur texture and detail across all four distinct animals.
  • + Dynamic and balanced composition that captures the 'chasing' aspect of the prompt effectively.
  • The kitten is slightly less realistic compared to the other three animals.
  • The butterflies look a bit like stickers placed on top of the image.

Z-Image Turbo

  • + Warm, pleasant color palette that captures the 'wholesome' vibe well.
  • + Accurately includes all four requested animals in a tight, cute grouping.
  • Anatomical issues, particularly the golden retriever's paw merging into the rabbit's back.
  • Lower overall resolution and softer details compared to the masterpiece quality requested.
  • The 'god rays' are much less defined and the dew sparkles are represented as generic white dots.

Verdict: FLUX.2 [flex] is the clear winner as it successfully rendered all four distinct animals with high fidelity and captured the atmospheric elements like god rays and dew drops much more effectively. Z-Image Turbo struggled with animal anatomy and produced a softer, less detailed image with less dynamic composition.

Victorian Greenhouse Oasis

Text-to-Image

“Hyper-photorealistic interior of a lush Victorian glass greenhouse filled with exotic tropical plants, vibrant blooming orchids, tall ferns, colorful butterflies in flight, sunlight filtering through ornate glass roof creating realistic caustics and dew on leaves, intricate iron framework visible, misty atmosphere, 8K masterpiece.”

FLUX.2 [flex]
Z-Image Turbo
0% wins 25% ties 75% wins

AI Judge Analysis

FLUX.2 [flex]

  • + Sophisticated architectural design with highly detailed wrought iron framework.
  • + Superior rendering of dew drops on leaves and atmospheric lighting effects.
  • + Balanced composition with a clear, inviting central path.
  • Butterflies feel a bit like flat stickers placed on top of the image.
  • Symmetry is almost too perfect, feeling slightly artificial.

Z-Image Turbo

  • + More natural, lush plant growth with varied textures.
  • + Excellent mist effects that add depth to the background.
  • + Strong rendering of large-scale architectural glass curves.
  • Butterflies are inconsistently scaled and lack motion blur.
  • Some lighting transitions between the foreground and the misty background look harsh.

Verdict: FLUX.2 [flex] wins by providing a more cohesive and professional architectural render, especially with the intricate ironwork and the realistic dew on the foliage. Z-Image Turbo captures the lushness of a greenhouse well, but the placement of the butterflies feels less integrated than the more detailed environment of the FLUX output.

Heroic Super Hero Portrait

Text-to-Image

“Hyper-photorealistic full-body portrait of a female superhero standing triumphantly on a New York skyscraper rooftop at golden sunset, wearing a classic modest superhero costume with flowing cape, chest emblem, gloves, and boots in red and blue colors, practical design, short hair, strong determined heroic expression looking into the distance, powerful confident stance with hands on hips and cape billowing dramatically in the wind, detailed urban cityscape background, warm natural sunlight with sharp shadows and fabric highlights, ultra-sharp textures on suit, hair, and concrete, 8K masterpiece, empowering family-friendly style.”

FLUX.2 [flex]
Z-Image Turbo

AI Judge Analysis

FLUX.2 [flex]

  • + Excellent high-resolution detail in the urban cityscape background.
  • + High realism in the fabric textures and practical suit design.
  • + Strong composition with a sense of scale and height on the rooftop.
  • The character design looks very similar to an existing IP (Captain Marvel).

Z-Image Turbo

  • + Good adherence to the 'classic' superhero aesthetic with a skirted costume.
  • + Warm, natural lighting that perfectly matches the golden sunset prompt.
  • + Face and expression are clear and well-rendered.
  • The background cityscape is quite blurred, losing the 'detailed urban' requirement.
  • Hands and fingers at the waist have some slight anatomical distortion.
  • The Superman 'S' logo is a bit derivative for a generic prompt.

Verdict: FLUX.2 [flex] delivers a much more polished and professional 'hyper-photorealistic' result with incredible detail in the background city and texture of the suit. While Z-Image Turbo captures the classic costume vibe well, it suffers from a lack of background detail and minor issues with the hand rendering.

Vintage Cafe Logo

Text-to-Image

“Vintage minimalist restaurant logo for "Caffè Florian", retro cloche dome with steam and "Est. 1720" banner, classic typography, warm brown and cream tones, subtle texture on light background, vector emblem style.”

FLUX.2 [flex]
Z-Image Turbo

AI Judge Analysis

FLUX.2 [flex]

  • + Perfect text rendering for both the name and the banner
  • + Elegant minimalist vector style with clean lines
  • + Excellent composition with the curved text framing the cloche
  • The steam is a bit faint compared to the rest of the logo

Z-Image Turbo

  • + Stronger vector presence with bold colors
  • + Good use of the brown and cream color palette
  • The 'f f' in 'Caffè' are merged awkwardly
  • The steam icon looks unbalanced and disconnected
  • The horizontal banner is less dynamic than the curved ribbon in Model A

Verdict: FLUX.2 [flex] produced a superior logo with perfect typography and a more sophisticated composition that feels historically appropriate for the brand. Z-Image Turbo struggled with the letter spacing in the main title and created a less balanced graphic overall.

FLUX.2 [flex]

Black Forest Labs' precision image generation model with maximum control, reliable text rendering, and complete creative control supporting up to 4MP output

Z-Image Turbo

Tongyi-MAI's 6-billion parameter distilled text-to-image model optimized for speed, achieving high-quality generation in 8 steps or fewer with support for bilingual text rendering