Z-Image Turbo AI Image Editing Model

Tongyi-MAI's 6-billion parameter distilled text-to-image model optimized for speed, achieving high-quality generation in 8 steps or fewer with support for bilingual text rendering

Benchmarks

Alibaba's Z-Image Turbo holds a top 10 position in global text-to-image rankings with a 1253 Elo rating. It remains competitive in high-speed workflows, maintaining a rank of 16 for complex image editing tasks with an Elo of 1021.

Lumenfall Arena
#16
Image Editing · 1021 Elo
Lumenfall Arena
#10
Text-to-Image · 1252 Elo

Image Editing Landscape

2 without speed data omitted.

Text-to-Image Landscape

12 without speed data omitted.

Competition Results

Modern Clean Menu

Text-to-Image
#2/19
Prompt

“Modern minimalist restaurant menu design, white background with colorful food photos in grid, sections for appetizers/pizza/mains, bold sans-serif fonts, vibrant accents, clean professional layout for casual dining.”

Generated
3 attempts – showing best result

Adorable Baby Animals in Sunny Meadow

Text-to-Image
#6/23
Prompt

“Hyper-photorealistic scene of fluffy baby animals—a golden retriever puppy, tabby kitten, baby bunny, and red fox kit—with big expressive eyes and ultra-detailed soft fur, playfully chasing butterflies and tumbling together in a lush wildflower meadow, warm golden sunrise light with god rays and dew sparkles, joyful wholesome vibe, 8K masterpiece.”

Generated
3 attempts – showing best result

Fantasy Warrior

Text-to-Image
#8/19
Prompt

“Close portrait of a battle-worn paladin in ornate engraved plate armor, hair braided with small beads, faint scars and dirt on the skin, warm torchlight reflecting off metal, shallow depth of field, bokeh sparks, lifelike eyes, highly detailed texture on leather straps and cloth underlayer.”

Generated
3 attempts – showing best result

Geometric Composition

Text-to-Image
#11/22
Prompt

“A glass cube on a wooden table. Inside the cube is a small blue sphere. On top of the cube sits a red book. A green plant is behind the cube, partially visible through the glass. Soft window light from the left.”

Generated
3 attempts – showing best result

Heroic Super Hero Portrait

Text-to-Image
#11/19
Prompt

“Hyper-photorealistic full-body portrait of a female superhero standing triumphantly on a New York skyscraper rooftop at golden sunset, wearing a classic modest superhero costume with flowing cape, chest emblem, gloves, and boots in red and blue colors, practical design, short hair, strong determined heroic expression looking into the distance, powerful confident stance with hands on hips and cape billowing dramatically in the wind, detailed urban cityscape background, warm natural sunlight with sharp shadows and fabric highlights, ultra-sharp textures on suit, hair, and concrete, 8K masterpiece, empowering family-friendly style.”

Generated
3 attempts – showing best result

Vintage Cafe Logo

Text-to-Image
#9/19
Prompt

“Vintage minimalist restaurant logo for "Caffè Florian", retro cloche dome with steam and "Est. 1720" banner, classic typography, warm brown and cream tones, subtle texture on light background, vector emblem style.”

Generated
3 attempts – showing best result

Victorian Greenhouse Oasis

Text-to-Image
#13/17
Prompt

“Hyper-photorealistic interior of a lush Victorian glass greenhouse filled with exotic tropical plants, vibrant blooming orchids, tall ferns, colorful butterflies in flight, sunlight filtering through ornate glass roof creating realistic caustics and dew on leaves, intricate iron framework visible, misty atmosphere, 8K masterpiece.”

Generated
3 attempts – showing best result

Candid Street Photography

Text-to-Image
#15/22
Prompt

“A candid street photo of an elderly Japanese man repairing a red bicycle in light rain, reflections on wet pavement, shallow depth of field, 50mm lens, natural skin texture, imperfect framing, motion blur from passing cars, cinematic but realistic, no stylization.”

Generated
3 attempts – showing best result

Isometric Miniature Diorama Scenes

Text-to-Image
#19/19
Prompt

“Create a clear, 45° top-down isometric miniature 3D cartoon scene of Japan's signature dish: sushi, with soft refined textures, realistic PBR materials, gentle lighting, on a small raised diorama base with minimal garnish and plate. Solid light blue background. At top-center: 'JAPAN' in large bold text, 'SUSHI' below it, small flag icon. Perfectly centered, ultra-clean, high-clarity, square format.”

Generated
3 attempts – showing best result

Night Sky Transformation

Image Editing
#15/15
Edit instruction

“Change the scene to night: a deep, dark sky with subtle, glistening stars visible behind the mountain.”

Result
Z-Image Turbo edited result for Night Sky Transformation
Original image before Z-Image Turbo editing
Before After
3 attempts – showing best result

Bald man challenge

Image Editing
#14/14
Edit instruction

“Give the person a full, thick head of natural hair with realistic texture, density, and a natural hairline. Preserve facial features and lighting.”

Result
Z-Image Turbo edited result for Bald man challenge
Original image before Z-Image Turbo editing
Before After
3 attempts – showing best result
Archived results

Fantasy Warrior

Text-to-Image
#3/14
Prompt

“Close portrait of a battle-worn paladin in ornate engraved plate armor, hair braided with small beads, faint scars and dirt on the skin, warm torchlight reflecting off metal, shallow depth of field, bokeh sparks, lifelike eyes, highly detailed texture on leather straps and cloth underlayer.”

Generated
3 attempts – showing best result
Help rank Z-Image Turbo Vote in blind head-to-head matchups
Start Voting