GPT Image 1.5 vs Seedream 4.0
Head-to-head across 12 challenges
GPT Image 1.5
73.9%
win rate
Ties
0.0%
Seedream 4.0
26.1%
win rate
Challenge Results
Modern Clean Menu
Text-to-Image“Modern minimalist restaurant menu design, white background with colorful food photos in grid, sections for appetizers/pizza/mains, bold sans-serif fonts, vibrant accents, clean professional layout for casual dining.”
AI Judge Analysis
GPT Image 1.5
- + Excellent text rendering with clear, legible names, descriptions, and prices.
- + Highly professional layout that functions as an actual usable menu.
- + Perfect adherence to all prompt elements, including specific food categories and color accents.
- − Overall design is a bit conservative or 'stock-photo' in style.
Seedream 4.0
- + Bold, clear heading text.
- + Modern, high-energy photo grid arrangement.
- − Fails to provide actual menu content like item names or prices.
- − The layout is just a collage of headers and photos rather than a functional menu design.
- − Large amount of wasted white space in the center of the design.
Verdict: GPT Image 1.5 is the clear winner as it produced a fully realized, professional menu design with legible text, item descriptions, and a logical layout. Seedream 4.0 created an abstract collage that lacks the functional elements of a menu, such as item lists and prices, making it unusable for the requested purpose.
Geometric Composition
Text-to-Image“A glass cube on a wooden table. Inside the cube is a small blue sphere. On top of the cube sits a red book. A green plant is behind the cube, partially visible through the glass. Soft window light from the left.”
AI Judge Analysis
GPT Image 1.5
- + Excellent photographic realism and sharpness
- + Exceptional handling of glass reflections and physical thickness
- + Perfect adherence to all spatial instructions and lighting
- − The sphere is quite large relative to the prompt 'small blue sphere'
Seedream 4.0
- + Successfully placed all requested elements in the scene
- + Accurately depicted a 'small' blue sphere as requested
- + Strong sense of cinematic natural lighting
- − The glass cube is missing its back-left vertical edge, making the geometry incoherent
- − Significant artifacting where the plant is visible through the glass
Verdict: Both models followed the prompt instructions perfectly in terms of object placement and lighting direction. GPT Image 1.5 is the clear winner due to its superior image quality and physical coherence; Seedream 4.0 suffers from broken geometry on the glass cube and messy rendering of the background plant through the glass.
Candid Street Photography
Text-to-Image“A candid street photo of an elderly Japanese man repairing a red bicycle in light rain, reflections on wet pavement, shallow depth of field, 50mm lens, natural skin texture, imperfect framing, motion blur from passing cars, cinematic but realistic, no stylization.”
AI Judge Analysis
GPT Image 1.5
- + Excellent attention to detail with realistic rain droplets on the man's jacket and the bike frame.
- + Strong atmospheric lighting and reflections that accurately convey a rainy day.
- + Successful execution of the 'imperfect framing' prompt with the cropped car and candid feel.
- − The bike anatomy becomes a bit messy around the rear wheel and spokes.
Seedream 4.0
- + Good use of motion blur on the passing vehicle to create a sense of street movement.
- + Realistic puddles and reflections on the pavement.
- + Clearer view of the man's face and hands working on the bike.
- − The man is wearing a short-sleeved shirt in what appears to be rainy weather, which feels logically inconsistent.
- − The bicycle's physical structure is significantly warped, particularly the pedals/crankset and the front wheel spokes.
Verdict: GPT Image 1.5 is the superior image as it better captures the textures and atmosphere of a rainy day, including visible raindrops on surfaces. Seedream 4.0 captures the motion blur well, but the physical distortions of the bicycle and the illogical clothing choice for the weather make it less convincing.
Man and Car in California
Editing“Make a photo of the man driving the car down the California coastline”
AI Judge Analysis
GPT Image 1.5
- + Excellent preservation of the man's facial features and hair texture.
- + High-quality rendering of the car's interior details.
- + The lighting on the man matches the beach environment well.
- − The perspective of the car feels slightly disjointed from the road.
- − The man is positioned too far back in the seat relative to the steering wheel.
Seedream 4.0
- + Perfectly preserves the specific car model (Rolls-Royce) and its details from the source image.
- + Dynamic and realistic composition with a clear sense of motion.
- + Accurately represents the man's clothing and hairstyle in the new context.
- − The man's facial details are slightly blurred compared to the source image.
- − Minor jitter in the wheel spokes due to the motion blur effect.
Verdict: Both models performed exceptionally well, but Seedream 4.0 is the winner for its superior composition and preservation of the car's physical presence. While GPT Image 1.5 kept the man's face sharper, the overall scene in Seedream 4.0 feels much more natural and cohesive as a single photograph.
The Halloween Invitation
Text-to-Image“Vintage gothic Halloween party invitation. Dark parchment poster, spooky border with webs and thorns, central glowing jack-o-lantern, bats, twisted trees, moody night sky. Add elegant gothic title text saying "Halloween Party Invitation", a small scroll banner saying "You are invited to a night of frights", and event details at the bottom: Date: 30.10.2026 Time: 7pm Location: The Arches, NYC Spooky but polished, cinematic lighting, square format.”
AI Judge Analysis
GPT Image 1.5
- + Perfect text rendering for all requested headings and details.
- + Strong adherence to the 'vintage gothic' style with sepia tones and parchment textures.
- + Excellent composition with a very clear thorns-and-webs border that frames the scene well.
- − The jack-o'-lantern carving is slightly more traditional and less 'cinematic' than Model B's.
Seedream 4.0
- + Impressive 3D lighting on the pumpkin and background trees.
- + High level of detail in the background silhouettes and atmospheric fog.
- + Creative torn-edge effect on the central parchment layer.
- − Text in the scroll banner is messy and barely legible compared to the other text.
- − The thorn border is less cohesive, with some thorns appearing to float or break the frame erratically.
- − Small typo in '7pm' which looks more like '7pm' with a merged 'm'.
Verdict: GPT Image 1.5 is the superior choice for an invitation as it follows the textual requirements perfectly with clean, elegant gothic typography. While Seedream 4.0 has more atmospheric lighting and a higher-quality central pumpkin, its failure to legibly render the scroll text makes it less functional for its intended purpose.
Bald man challenge
Image Editing“Give the person a full, thick head of natural hair with realistic texture, density, and a natural hairline. Preserve facial features and lighting.”
AI Judge Analysis
GPT Image 1.5
- + Natural hair texture that matches the beard style
- + Seamless integration of hair with the original facial features
- + Excellent preservation of background and lighting
- − Slightly altered the shape of the glasses frames
Seedream 4.0
- + Followed the instruction for 'thick' hair very literally
- + Maintained the original background and lighting well
- − Hair looks like a wig with an unnatural, stiff texture
- − The hairline is too low and lacks realistic transition
- − The vertical volume of the hair feels out of proportion for a natural head of hair
Verdict: GPT Image 1.5 provides a much more convincing and realistic edit, creating hair that matches the character's facial hair and sits naturally on the head. Seedream 4.0 added a very large volume of hair that lacks realistic texture and looks more like a toupee or a wig, failing the 'natural' requirement of the prompt.
Adorable Baby Animals in Sunny Meadow
Text-to-Image“Hyper-photorealistic scene of fluffy baby animals—a golden retriever puppy, tabby kitten, baby bunny, and red fox kit—with big expressive eyes and ultra-detailed soft fur, playfully chasing butterflies and tumbling together in a lush wildflower meadow, warm golden sunrise light with god rays and dew sparkles, joyful wholesome vibe, 8K masterpiece.”
AI Judge Analysis
GPT Image 1.5
- + Excellent fur texture and fine detail on all four animals.
- + Expressive facial features that perfectly match the 'joyful' vibe.
- + Stronger adherence to the 'tumbling together' request with close, intertwined positioning.
- − The kitten has an anatomically strange third hind leg/paw visible at the bottom.
- − The fox's front paw has a slightly merged, undefined look.
Seedream 4.0
- + Dynamic composition that captures the 'chasing' aspect better by showing the animals in motion.
- + Beautiful bokeh and lighting effects with well-distributed dew sparkles.
- + The animals are clearly distinguishable and appropriately sized relative to one another.
- − The kitten has a very small, somewhat distorted front paw while reaching up.
- − The fox kit's red fur is slightly over-saturated, bordering on unrealistic compared to the other animals.
Verdict: Both models followed the complex prompt extremely well, including all four specific animals and the difficult lighting conditions. GPT Image 1.5 wins on fur texture and emotional expression, creating a very heartwarming close-up, though it suffers from a significant anatomical limb error on the kitten. Seedream 4.0 is preferred for its superior composition and action, capturing a more believable sense of 'playing and chasing' in a meadow without major anatomical failures.
Over-the-top cartoon caricature
Editing“Create a caricature of me and my job. Make it exaggerated and humorous, incorporating my profession as a tv show anchor and my love for dogs and hockey.”
AI Judge Analysis
GPT Image 1.5
- + Excellent caricature style with highly exaggerated features that suit the request.
- + Rich, vibrant background incorporating all requested themes (news desk, dogs, hockey game).
- + High-quality rendering with no major anatomical issues or artifacts.
- − Changes the character's clothing and eye color, losing some of the source identity.
- − Text is slightly generic compared to the visual density.
Seedream 4.0
- + Maintains the subject's original outfit and features more accurately than Model A.
- + Preserves the background elements of the source image while overlaying the news desk.
- + Cleverly combines the hockey and anchor themes through the jersey and stick placement.
- − The caricature deformation on the face has some artifacts around the mouth and eyes.
- − The composition feels a bit cluttered with the phone, microphone, and desk overlapping awkwardly.
Verdict: GPT Image 1.5 delivers a much more polished and 'exaggerated' caricature as requested, featuring a creative integration of a dog in a hockey helmet and a full news studio background. Seedream 4.0 does a better job of preserving the specific person's likeness and clothing from the source image but suffers from messy visual artifacts and a less appealing overall composition.
Studio Ghibli Anime Style
Editing“Transform this photo into a Studio Ghibli–inspired illustration. Use soft pastel colors, hand-painted textures, gentle lighting, dreamy backgrounds, and a warm, nostalgic mood”
AI Judge Analysis
GPT Image 1.5
- + Excellent preservation of the original subjects' clothing patterns and poses
- + Captures the warm, nostalgic, and dreamy lighting requested in the prompt
- + Maintains the depth of field from the source image with the foreground girl blurred
- − Art style leans more towards generic shojo anime than distinct Studio Ghibli aesthetics
- − Faces are a bit too modernized for a nostalgic Ghibli look
Seedream 4.0
- + Perfectly captures the Studio Ghibli hand-painted watercolor aesthetic
- + Highly accurate character facial expressions that match the source while adopting the target style
- + Exceptional execution of hand-painted textures and soft pastel colors
- − The dreamy background replaces the urban street with abstract colors, losing some environmental context
- − The foreground character's blur is removed, making the composition flatter than the original
Verdict: Both models successfully interpreted the meme prompt and maintained the core composition. GPT Image 1.5 did a better job of preserving the source image's layout and clothing details, but Seedream 4.0 far exceeded it in terms of artistic style accuracy, delivering a near-perfect Ghibli watercolor aesthetic that fits the 'hand-painted' and 'nostalgic' requirements precisely.
Golden Hour Stroll
Image Editing“Add dynamic motion to this photo: make hair blow in the wind, add leaves flying, energetic and lively feel.”
AI Judge Analysis
GPT Image 1.5
- + Successfully added a large volume of flying leaves for a high-energy feel.
- + The hair is dramatically windswept, effectively conveying motion.
- + Maintains high image clarity and facial details from the original.
- − The leaves appear pasted on top and don't always interact realistically with the depth of the scene.
- − Slightly altered the woman's facial features compared to the source image.
Seedream 4.0
- + Excellent preservation of the woman's original face and expression.
- + The wind effect on the hair is realistic and well-integrated.
- + Leaves are placed with a better sense of depth and motion blur.
- − Significantly lower resolution and overall sharpness compared to the source and Model A.
- − Fewer flying leaves makes the scene feel less 'energetic' than requested.
Verdict: Both models followed the instructions well, but GPT Image 1.5 produced a much sharper, high-resolution result with a more intense 'energetic' feel due to the quantity of flying leaves. While Seedream 4.0 did a better job of preserving the subject's original facial features and creating a more natural sense of depth, the significant loss in image quality makes it the less desirable output.
Vintage Cafe Logo
Text-to-Image“Vintage minimalist restaurant logo for "Caffè Florian", retro cloche dome with steam and "Est. 1720" banner, classic typography, warm brown and cream tones, subtle texture on light background, vector emblem style.”
AI Judge Analysis
GPT Image 1.5
- + Excellent classic typography with a custom feel
- + Included the grave accent on 'Caffè' correctly
- + Good use of texture and shading on the cloche
- − Ignored the request for a light background
- − Banner integration is slightly clunky compared to the text
Seedream 4.0
- + Followed all prompt instructions including the light textured background
- + Clean and balanced vector-style composition
- + Legible and well-spaced typography
- − Used an acute accent (é) instead of the requested grave accent (è)
- − The steam lines are somewhat generic
Verdict: Both models followed the core elements of the prompt well, including the specific date and cloche imagery. Seedream 4.0 followed the background instructions more accurately and feels more like a finished logo emblem, whereas GPT Image 1.5 ignored the background requirement but produced superior, more authentic typography.
Apollo 11: Journey to Tranquility
Text-to-Image“Create a clean, modern vector infographic poster about the Apollo 11 mission. NASA-inspired palette (navy, white, muted red, light gray). Flat-vector style, crisp lines, consistent iconography, subtle gradients only. Steps (stop at landing): 1. Launch (Saturn Vicon) 2. Earth Orbit (Earth + orbit ring icon) 3. Translunar (trajectory arc icon) 4. Lunar Orbit (Moon + orbit ring icon) 5. Descent (lunar module descending icon) 6. Landing (lunar module on the surface icon) Small supporting elements (minimal text): • Crew strip: three silhouette icons with only last names: Armstrong, Aldrin, Collins. • Landing site marker: Moon pin labeled "Tranquility" only. Layout constraints: generous margins, large readable labels, clean background with subtle stars. Vector-only, print-poster look, high resolution.”
AI Judge Analysis
GPT Image 1.5
- + Perfect adherence to all six requested steps with appropriate icons.
- + Exceptional text rendering for all labels including crew names.
- + Highly professional vector aesthetic with a consistent and clean layout.
- − One minor text cut-off at the very top of the image.
Seedream 4.0
- + Accurately followed the color palette and basic steps.
- + Clean, readable typography for the main title and numbered steps.
- + Included the requested 'Tranquility' location marker.
- − Confusing visual layout where the Lunar Module is used for both 'Descent' (5) and 'Landing' (6) without a clear icon change.
- − Icons are significantly less detailed and polished compared to the other model.
- − Text for 'Translunar' is slightly cramped/misaligned.
Verdict: GPT Image 1.5 is the clear winner as it provides a professional, well-balanced infographic with high-quality icons for every single requested step. Seedream 4.0 followed the instructions well but suffered from simpler graphics and a layout that didn't visually distinguish the final stages as effectively as GPT Image 1.5.
GPT Image 1.5
OpenAI's state-of-the-art image generation model with better instruction following and adherence to prompts
Seedream 4.0
ByteDance's image generation model with integrated text-to-image and image editing capabilities in a unified architecture, supporting up to 4K resolution