GPT Image 1.5 vs Z-Image Turbo

Head-to-head across 10 challenges

GPT Image 1.5

81.8%

win rate

Ties

9.1%

Z-Image Turbo

9.1%

win rate

81.8% 9.1% ties 9.1%

Challenge Results

Geometric Composition

Text-to-Image

“A glass cube on a wooden table. Inside the cube is a small blue sphere. On top of the cube sits a red book. A green plant is behind the cube, partially visible through the glass. Soft window light from the left.”

GPT Image 1.5
Z-Image Turbo
50% wins 50% ties 0% wins

AI Judge Analysis

GPT Image 1.5

  • + Excellent photographic detail on the book texture and wood grain.
  • + Strong physical logic with the green plant clearly visible through the glass panels.
  • + Accurate lighting and reflections on the glass and blue sphere.
  • The 'blue sphere' is quite large despite the prompt asking for a 'small' one.

Z-Image Turbo

  • + Follows the 'small sphere' instruction better than Model A.
  • + Clean composition with soft window light as requested.
  • Physics error where the plant in the background disappears behind the glass panes.
  • The glass cube handles reflections poorly, losing transparency in the back-right corner.

Verdict: GPT Image 1.5 is the superior image due to its consistent handling of transparency and reflections; the green plant is properly visible through the glass cube, whereas it magically vanishes in Z-Image Turbo. While Z-Image Turbo scaled the sphere more accurately to the prompt, its failure to maintain visual logic through the glass makes it less realistic.

Candid Street Photography

Text-to-Image

“A candid street photo of an elderly Japanese man repairing a red bicycle in light rain, reflections on wet pavement, shallow depth of field, 50mm lens, natural skin texture, imperfect framing, motion blur from passing cars, cinematic but realistic, no stylization.”

GPT Image 1.5
Z-Image Turbo

AI Judge Analysis

GPT Image 1.5

  • + Excellent adherence to the 'repairing' aspect of the prompt with tools and a crouched pose.
  • + Superior atmospheric lighting and realistic reflections on wet pavement.
  • + Perfect execution of the shallow depth of field and 'cinematic' look requested.
  • The bike anatomy is slightly jumbled near the rear wheel/derailleur area.

Z-Image Turbo

  • + Clear representation of light rain and natural skin texture.
  • + Good composition with a clear subject.
  • The man is standing/walking with the bike rather than repairing it.
  • The background cars lack the requested motion blur.
  • The lighting feels flat and less cinematic compared to the other image.

Verdict: GPT Image 1.5 followed the prompt much more closely, capturing the specific 'repairing' action and the cinematic atmosphere of a rainy street. While Z-Image Turbo produced a clean image, it missed the repair activity and the motion blur requirement, resulting in a more generic snapshot. GPT Image 1.5's use of light and reflections better captured the 'cinematic but realistic' tone requested.

Fantasy Warrior

Text-to-Image

“Close portrait of a battle-worn paladin in ornate engraved plate armor, hair braided with small beads, faint scars and dirt on the skin, warm torchlight reflecting off metal, shallow depth of field, bokeh sparks, lifelike eyes, highly detailed texture on leather straps and cloth underlayer.”

GPT Image 1.5
Z-Image Turbo

AI Judge Analysis

GPT Image 1.5

  • + Exceptional texture detail on the skin, scars, and engraved metal.
  • + Stronger adherence to the 'close portrait' instruction with impactful framing.
  • + Beautiful lighting and bokeh sparks that integrate naturally with the scene.
  • The hair beads are present but somewhat large and chunky compared to a delicate braid feel.

Z-Image Turbo

  • + Successfully includes the specific bead details in the hair braids.
  • + Captures the underlayer textures like chainmail and quilted fabric well.
  • + Good use of the torch as a physical light source in the frame.
  • The composition is a medium shot rather than the requested 'close portrait'.
  • The skin texture and facial features lack the hyper-realistic clarity seen in Model A.
  • Lighting on the face is a bit flat despite the presence of the torch.

Verdict: GPT Image 1.5 produced a far more compelling and high-quality image that perfectly matches the 'close portrait' and 'lifelike' requirements of the prompt. While Z-Image Turbo followed the instructions for hair beads and leather/cloth layers well, the overall image quality and resolution in GPT Image 1.5 are superior, offering much more intricate detail in the skin and metal textures.

Modern Clean Menu

Text-to-Image

“Modern minimalist restaurant menu design, white background with colorful food photos in grid, sections for appetizers/pizza/mains, bold sans-serif fonts, vibrant accents, clean professional layout for casual dining.”

GPT Image 1.5
Z-Image Turbo
100% wins 0% ties 0% wins

AI Judge Analysis

GPT Image 1.5

  • + Perfectly legible text with correct spelling and descriptions.
  • + Excellent alignment between text sections and corresponding food imagery.
  • + Very high-quality, realistic food photography that looks professional.
  • The layout is a bit standard, leaning more towards a flyer than a modern square grid design.

Z-Image Turbo

  • + Follows the 'grid' request more literally with a structured tile layout.
  • + Bold use of color blocks and sans-serif fonts matches the 'vibrant accents' prompt.
  • Text is largely nonsensical gibberish (e.g., 'PIZZA MANS', 'SE TIIION').
  • Food photos are repetitive and lower in visual quality compared to Model A.
  • Poor hierarchy and spacing in the text columns.

Verdict: GPT Image 1.5 is the clear winner because it produces a functional, professional menu with perfectly rendered text and high-quality food photography. Z-Image Turbo followed the 'grid' instruction well, but failed significantly on text legibility and overall image coherence.

Bald man challenge

Editing
Edit instruction

“Give the person a full, thick head of natural hair with realistic texture, density, and a natural hairline. Preserve facial features and lighting.”

Before After
GPT Image 1.5
Before After
Z-Image Turbo
100% wins 0% ties 0% wins

AI Judge Analysis

GPT Image 1.5

  • + Expertly follows the prompt providing a full, thick head of hair with natural-looking curls.
  • + Maintains excellent source preservation of the facial features, clothes, and overall lighting.
  • + The integrated hair texture matches the existing beard density and style perfectly.
  • Slight adjustment to the top frame of the glasses near the nose bridge compared to the original.

Z-Image Turbo

  • + Maintains the overall composition and color palette of the original photo.
  • Completely failed the primary edit request, leaving the subject bald with only a slight stubble shadow.
  • Lost the subject's glasses entirely, which was not requested.
  • Significantly altered the facial features and the background landscape.

Verdict: GPT Image 1.5 successfully executed the edit by adding a realistic, thick head of hair that blends seamlessly with the original subject's features and lighting. Z-Image Turbo failed the prompt entirely, failing to add hair while also mistakenly removing the subject's glasses and altering the background.

Night Sky Transformation

Editing
Edit instruction

“Change the scene to night: a deep, dark sky with subtle, glistening stars visible behind the mountain.”

Before After
GPT Image 1.5
Before After
Z-Image Turbo
100% wins 0% ties 0% wins

AI Judge Analysis

GPT Image 1.5

  • + Perfectly adheres to the night sky and glistening stars request.
  • + Maintains the structural integrity and layout of the original image.
  • + Excellent lighting adjustment, changing the golden hour glow to a moody, nighttime atmosphere while keeping the town lights.
  • The mountain peak is a bit dark/low contrast against the sky.

Z-Image Turbo

  • + Maintains high resolution and clear textures.
  • Completely failed the instruction to change the scene to night.
  • Significantly altered the shape of the main mountain and the layout of the town.
  • The sky remains in a sunset/golden hour state with no visible stars.

Verdict: GPT Image 1.5 successfully executed the edit by transforming the golden hour scene into a convincing night landscape with a starry sky, while preserving the original composition. Z-Image Turbo failed the prompt entirely, providing a slightly modified version of the original sunset scene and losing the distinctive shape of the Matterhorn mountain.

Adorable Baby Animals in Sunny Meadow

Text-to-Image

“Hyper-photorealistic scene of fluffy baby animals—a golden retriever puppy, tabby kitten, baby bunny, and red fox kit—with big expressive eyes and ultra-detailed soft fur, playfully chasing butterflies and tumbling together in a lush wildflower meadow, warm golden sunrise light with god rays and dew sparkles, joyful wholesome vibe, 8K masterpiece.”

GPT Image 1.5
Z-Image Turbo
100% wins 0% ties 0% wins

AI Judge Analysis

GPT Image 1.5

  • + Excellent adherence to lighting requests with clear god rays and dew sparkles.
  • + Highly expressive and dynamic poses that match the 'tumbling' and 'chasing' aspects of the prompt.
  • + Superior texture rendering in the fur and floral elements.
  • The kitten's anatomy, specifically its paws and belly, looks slightly distorted.
  • The background is very busy, which slightly distracts from the subject.

Z-Image Turbo

  • + Clear and distinct representation of all four requested animals.
  • + Clean composition with a soft, pleasant bokeh effect.
  • + Good anatomical consistency for all animals.
  • Lighting is relatively flat and lacks the requested 'god rays' and 'dew sparkles'.
  • The animals look more like they are posing than 'tumbling' or 'playfully chasing'.
  • The butterflies appear somewhat static and floaty.

Verdict: GPT Image 1.5 captured the atmosphere of the prompt much more effectively, providing the requested god rays, dew sparkles, and a genuine sense of dynamic 'tumbling' movement. While Z-Image Turbo produced a cleaner, more anatomically stable image, it failed to incorporate the specific environmental effects requested and felt more like a posed studio shot than a masterpiece scene.

Victorian Greenhouse Oasis

Text-to-Image

“Hyper-photorealistic interior of a lush Victorian glass greenhouse filled with exotic tropical plants, vibrant blooming orchids, tall ferns, colorful butterflies in flight, sunlight filtering through ornate glass roof creating realistic caustics and dew on leaves, intricate iron framework visible, misty atmosphere, 8K masterpiece.”

GPT Image 1.5
Z-Image Turbo
0% wins 0% ties 100% wins

AI Judge Analysis

GPT Image 1.5

  • + Expertly captures the light filtering through the glass with realistic volumetric god rays.
  • + Highly detailed Victorian ironwork that matches the requested architectural style perfectly.
  • + Exquisite texture on the leaves, featuring realistic dew drops and light refraction.
  • The composition is very busy, bordering on cluttered in the foreground.

Z-Image Turbo

  • + Good inclusion of various ferns and tropical plants as requested.
  • + Clearer sense of depth and a walkable path within the greenhouse environment.
  • The butterflies look like poorly composited stickers rather than part of the lighting environment.
  • The misty atmosphere appears as a localized cloud in the center rather than a pervasive environment effect.
  • The ironwork is much simpler and lacks the 'intricate' detail requested in the prompt.

Verdict: GPT Image 1.5 is the clear winner, delivering a masterpiece-level image with incredible lighting, complex Victorian architecture, and high-fidelity textures on the flora and dew drops. In contrast, Z-Image Turbo has significant issues with the realism of the butterflies and a much flatter lighting profile that fails to capture the requested 'caustics'.

Heroic Super Hero Portrait

Text-to-Image

“Hyper-photorealistic full-body portrait of a female superhero standing triumphantly on a New York skyscraper rooftop at golden sunset, wearing a classic modest superhero costume with flowing cape, chest emblem, gloves, and boots in red and blue colors, practical design, short hair, strong determined heroic expression looking into the distance, powerful confident stance with hands on hips and cape billowing dramatically in the wind, detailed urban cityscape background, warm natural sunlight with sharp shadows and fabric highlights, ultra-sharp textures on suit, hair, and concrete, 8K masterpiece, empowering family-friendly style.”

GPT Image 1.5
Z-Image Turbo

AI Judge Analysis

GPT Image 1.5

  • + Excellent high-resolution background featuring iconic NYC landmarks like the Empire State Building.
  • + Superior hand anatomy and finger placement on the hips.
  • + Highly detailed fabric textures on the suit and cape.
  • The skirt design makes the outfit feel less 'full-body' or 'practical' than requested.
  • Lighting on the character's face is a bit flat compared to the intensity of the sunset.

Z-Image Turbo

  • + The costume design is more practical and modest with the inclusion of full-length leggings.
  • + Excellent hair movement that matches the billowing of the cape.
  • + Atmospheric golden hour lighting that feels well-integrated with the subject.
  • Significant issues with hand anatomy, particularly the right hand having too many fingers/distorted shape.
  • The urban background is generic and lacks the detail and scale of the requested New York skyline.

Verdict: GPT Image 1.5 wins primarily due to technical execution and background detail; its rendition of New York is unmistakable and the hand anatomy is perfect. While Z-Image Turbo followed the 'modest' and 'practical' costume prompt better by including leggings, the anatomical errors on the hands and the lack of iconic city detail make it the weaker image.

Vintage Cafe Logo

Text-to-Image

“Vintage minimalist restaurant logo for "Caffè Florian", retro cloche dome with steam and "Est. 1720" banner, classic typography, warm brown and cream tones, subtle texture on light background, vector emblem style.”

GPT Image 1.5
Z-Image Turbo
100% wins 0% ties 0% wins

AI Judge Analysis

GPT Image 1.5

  • + Excellent typography style that feels premium and vintage
  • + High quality shading and grain texture on the cloche dome
  • + Perfect spelling of names and dates
  • Ignored the request for a light background, providing a black one instead
  • Slightly less 'minimalist' than requested due to heavy shading

Z-Image Turbo

  • + Followed the light background and subtle texture instructions perfectly
  • + True minimalist vector aesthetic
  • + Clean, balanced layout
  • The 'steam' element is very small and lacks artistic impact
  • Typography is more modern/basic compared to the requested 'classic' feel

Verdict: Z-Image Turbo adhered more closely to the full prompt by providing the requested light background and minimalist vector style, whereas GPT Image 1.5 failed on the background color despite producing a more visually impressive and detailed vintage illustration. If the user requires a usable logo on a light surface as requested, Z-Image Turbo is the functional winner, though GPT Image 1.5 has superior artistic execution for a vintage brand.

GPT Image 1.5

OpenAI's state-of-the-art image generation model with better instruction following and adherence to prompts

Z-Image Turbo

Tongyi-MAI's 6-billion parameter distilled text-to-image model optimized for speed, achieving high-quality generation in 8 steps or fewer with support for bilingual text rendering