GPT Image 1 Mini OpenAI Seedream 4.5 ByteDance

Settled by community votes across 7 shared challenges, with an AI judge weighing in on each.

GPT Image 1 Mini

25.3 arena score

#12 of 44 in Text-to-Image

Skill signature

Not enough comparable category data

The chart appears once both models have ratings across at least three shared arena categories.

Seedream 4.5

26.1 arena score

#10 of 44 in Text-to-Image

Vote tally

Where the votes landed

GPT Image 1 Mini

100.0%

win rate

Ties

0.0%

Seedream 4.5

0.0%

win rate

100.0% 0.0% ties 0.0%

Shared challenges 7

Challenge by challenge

The strongest take from each model on every shared challenge, with the AI judge's read.

Geometric Composition

Text-to-Image

“A glass cube on a wooden table. Inside the cube is a small blue sphere. On top of the cube sits a red book. A green plant is behind the cube, partially visible through the glass. Soft window light from the left.”

GPT Image 1 Mini

Seedream 4.5

AI Judge Analysis

GPT Image 1 Mini

+ Excellent photorealistic rendering of textures on the book and sphere.
+ Accurate interpretation of 'soft window light' from the left.
+ Clear and logical composition with a well-defined glass cube.

− The green plant is more 'next to' the cube than 'behind' it as seen through the glass.

Seedream 4.5

+ Perfectly places the plant behind the glass cube, showing realistic refraction.
+ Naturalistic high-contrast lighting that matches a bright window setting.

− The 'sphere' is distorted and looks more like a heart or a liquid drop than a sphere.
− The cube geometry is slightly warped on the right side.
− The blue object appears both inside and outside the glass simultaneously.

Verdict: GPT Image 1 Mini creates a much more polished and physically coherent image with superior texture rendering and a perfect blue sphere. While Seedream 4.5 captures the refraction of the plant through the glass more accurately, it fails significantly on the shape of the sphere and the structural integrity of the cube.

Candid Street Photography

Text-to-Image

“A candid street photo of an elderly Japanese man repairing a red bicycle in light rain, reflections on wet pavement, shallow depth of field, 50mm lens, natural skin texture, imperfect framing, motion blur from passing cars, cinematic but realistic, no stylization.”

GPT Image 1 Mini

Seedream 4.5

100% wins 0% ties 0% wins

AI Judge Analysis

GPT Image 1 Mini

+ Excellent realism in skin texture and clothing
+ Subtle and natural lighting that fits the 'rainy day' mood
+ The bicycle design and mechanical interaction feel more organic and less like a floating asset

− Missed the request for motion blur from passing cars
− Framing is very centered, missing the 'imperfect framing' request

Seedream 4.5

+ Successfully captured the motion blur from passing cars
+ Includes vibrant reflections on the wet pavement that add to the cinematic feel
+ Captures an active repairing motion with a tool

− The wrench is physically merging with the chain and wheel spokes in a nonsensical way
− The bicycle mechanics are nonsensical with a chain running behind the axle
− The face has a slightly smoothed, AI-standard look compared to the requested 'natural' texture

Verdict: Seedream 4.5 followed more of the technical prompt requirements, specifically including the motion blur and wet pavement reflections, but it suffers from significant anatomical and mechanical failures where the tool and bike parts merge. GPT Image 1 Mini produced a much more realistic and grounded image with superior textures and coherence, but it ignored the specific request for motion blur. GPT Image 1 Mini is preferred for its high photographic quality and lack of disturbing artifacts.

Chalkboard Menu

Text-to-Image

“Handwritten-style chalkboard menu in a cozy café, all text rendered in the exact same realistic chalk handwriting style with natural variations in letter size, slight slant, and chalk texture — no printed or digital fonts anywhere on the board. Title at the top in elegant cursive chalk handwriting: ‘TODAY’S SPECIALS – APRIL 30, 2026’. Below it, three menu items also in the same handwritten chalk style: ‘Truffle Mushroom Risotto – $24’, ‘Grilled Octopus with Lemon & Herbs – $28’, ‘Brown Butter Chocolate Chip Cookies – $9’. At the very bottom, smaller text in the identical handwritten chalk style (slightly smaller but still clearly legible with the same handwriting characteristics): ‘All items made fresh daily • Ask about our gluten-free options’. Warm ambient café lighting, visible chalk dust and smudges, realistic handwriting imperfections, no clean printed text anywhere.”

GPT Image 1 Mini

Seedream 4.5

AI Judge Analysis

GPT Image 1 Mini

+ Excellent text rendering with no spelling errors
+ Clean and well-centered layout

− The text looks more like a digital font with a texture filter than genuine handwriting
− Failed to render the title in the requested 'elegant cursive' style

Seedream 4.5

+ Successfully rendered the title in a cursive style as requested
+ Texture and letter variations feel much more like authentic chalk handwriting
+ Includes a realistic cafe background which adds to the 'cozy café' atmosphere

− Includes a repetitive line for the first menu item ('Risotto - $24' appears twice)
− Chalk dust effects are slightly heavy in certain spots

Verdict: GPT Image 1 Mini has superior spelling and layout consistency, but the text feels like a digital font and it ignores the cursive requirement. Seedream 4.5 captures the requested 'handwritten' aesthetic and cursive title much more authentically, despite the minor repetitive text error on the first item.

Pose & Character Mashup

Editing

Edit instruction

“Use Image 1 as the exact pose reference and Image 2 as the character reference. Recreate the person/character from Image 2 in the exact dynamic pose and body position from Image 1. Keep the exact face, hair, clothing style/details, and expression from Image 2. Match the lighting and environment of Image 1. The final image must show the character from Image 2 performing the precise action/pose from Image 1 with perfect anatomy and natural integration.”

Source

GPT Image 1 Mini

Seedream 4.5

100% wins 0% ties 0% wins

AI Judge Analysis

GPT Image 1 Mini

+ Successfully captures the character's facial features and sunglasses from Image 2.
+ Accurately recreates the yellow background and red ottoman from Image 1.
+ The scarf and black sweatshirt details from Image 2 are well-integrated.

− Fails to match the exact dynamic leg and head tilt pose from Image 1.
− Anatomy issue with the left hand which appears to have six fingers.
− Only one foot is on the ottoman, unlike the crossed-leg pose in the source.

Seedream 4.5

+ Executes the exact head tilt and body angle requested from Image 1 better than Model A.
+ Preserves the specific pattern and texture of the scarf from Image 2 more accurately.
+ Successfully maintains both feet on the ottoman as seen in the reference pose.

− The hands have significant anatomical errors, particularly the left hand which is mangled.
− The background lighting is less vibrant and consistent than Model A.
− Small white artifacts/dots present on the lower legs and hands.

Verdict: Seedream 4.5 followed the complex pose instructions much more accurately, capturing the specific head tilt and leg positioning from Image 1. While GPT Image 1 Mini produced a cleaner image with fewer artifacts, it simplified the pose significantly and failed to match the dynamic movement of the reference. Despite some anatomical flaws in the hands, Seedream 4.5 is the winner for superior prompt adherence regarding the pose and character details.

Outfit Transfer Challenge

Editing

Edit instruction

“Use Image 1 as the base person. Dress them in the exact elaborate outfit from Image 2 (including all layers, accessories, jewelry, and shoes). Carefully adapt the clothing to the body shape and pose in Image 1 while maintaining realistic fabric behavior, correct proportions, and perfect lighting/shadow matching. Keep the person’s exact face, hair, and background completely unchanged.”

Source

GPT Image 1 Mini

Seedream 4.5

AI Judge Analysis

GPT Image 1 Mini

+ Successfully replicates the outfit style and scarf pattern.
+ Adapts the pose of the person to be more upright while keeping the background context.
+ Maintains the skin condition (vitiligo) on visible areas like hands and face.

− Failed to keep the person's face and hair shape identical to Image 1, making them look older and changing the facial structure.
− Did not include the sunglasses or the specific ring requested from the 'exact' outfit in Image 2.
− Changed the camera angle and background composition from the original Image 1.

Seedream 4.5

+ Perfectly preserves the background, wooden structure, and camera angle of Image 1.
+ Accurately matches the accessories including the specific gold watch, rings, and sunglasses from Image 2.
+ Maintains the specific lighting and texture of the original beach scene.

− Failed to keep the person's face and hair from Image 1, instead merging them with the person from Image 2.
− Leaves the shirt open which was not in the style of Image 2, resulting in a slightly messy composition.

Verdict: Both models struggled with the strict requirement of keeping the face and hair completely unchanged while swapping the outfit. GPT Image 1 Mini changed the camera perspective and facial features significantly, though it created a more cohesive fashion shot. Seedream 4.5 was superior in maintaining the background and specific accessories like the watch and sunglasses, but it failed the primary face-preservation task by blending the two subjects' faces together.

The Capybara Taxi Driver

Text-to-Image

“Photorealistic scene inside a yellow New York taxi at night. A capybara is driving, wearing a yellow taxi driver cap and a dark jacket. It has a calm, professional expression and both front paws on the steering wheel. In the back seat sits a human businesswoman in a coat, looking at her phone with a completely normal, bored expression (as if this is just another normal ride). Through the windows you can see the streets of Manhattan at night with blurred lights. Realistic taxi interior, photorealistic, detailed fur and fabric, 35mm lens, night lighting with reflections, shallow depth of field.”

GPT Image 1 Mini

Seedream 4.5

AI Judge Analysis

GPT Image 1 Mini

+ Excellent cinematic lighting and photorealism
+ High-quality texture on the capybara's fur and the jacket
+ Atmospheric background bokeh that captures the New York night feel

− The passenger in the back is not clearly looking at a phone
− The capybara's paws on the wheel look slightly mutated/unnatural

Seedream 4.5

+ Perfect adherence to the phone detail in the passenger's hands
+ Captures the bored, mundane expression of the passenger very well
+ Clearer depiction of both paws on the steering wheel

− Lighting is a bit flat and less cinematic compared to Model A
− The 'TAXI' text on the hat is slightly crooked
− The composition feels a bit like a stock photo rather than a realistic film frame

Verdict: GPT Image 1 Mini produces a much more visually stunning and atmospheric image with professional cinematic lighting and textures. However, Seedream 4.5 captures the specific storytelling details of the prompt better, particularly the passenger's bored interaction with her phone and the symmetrical 'pro' driving stance of the capybara.

Adorable Baby Animals in Sunny Meadow

Text-to-Image

“Hyper-photorealistic scene of fluffy baby animals—a golden retriever puppy, tabby kitten, baby bunny, and red fox kit—with big expressive eyes and ultra-detailed soft fur, playfully chasing butterflies and tumbling together in a lush wildflower meadow, warm golden sunrise light with god rays and dew sparkles, joyful wholesome vibe, 8K masterpiece.”

GPT Image 1 Mini

Seedream 4.5

100% wins 0% ties 0% wins

AI Judge Analysis

GPT Image 1 Mini

+ Excellent character balance with all four animals clearly visible and distinct
+ More realistic fur textures and animal anatomy
+ Effective use of subtle god rays and morning dew consistent with the prompt

− One butterfly has a merged wing appearance
− Lighting is a bit flat compared to the dramatic sunrise requested

Seedream 4.5

+ Beautiful, dramatic lighting with prominent god rays and golden hour glow
+ Captures the 'expressive eyes' and 'tumbling' dynamic very well
+ Vibrant colors and creative use of dew sparkles/bubbles

− The fox's eyes are overly stylized/cartoonish compared to the 'photorealistic' request
− The bunny is partially obscured in the background
− Anatomical issues with the kitten's front paws

Verdict: GPT Image 1 Mini followed the prompt's request for photorealism much better, maintaining realistic proportions and textures for all four animals. While Seedream 4.5 created a more magical and visually striking atmosphere with superior lighting, the fox and kitten suffer from anatomical distortions and a more illustrative, less realistic style.

Next steps

Explore each model

GPT Image 1 Mini

OpenAI

OpenAI's cost-effective image generation model for when image quality isn't the top priority

Vote this model in the arena

Arena profile Lumenfall catalog

Seedream 4.5

ByteDance

ByteDance's latest image generation model unifying text-to-image and image editing in a single architecture, with improved text rendering and 30-40% faster generation than v4.0

Vote this model in the arena

Arena profile Lumenfall catalog