GPT Image 1.5 vs Seedream 4.5
Head-to-head across 16 challenges
GPT Image 1.5
66.7%
win rate
Ties
4.2%
Seedream 4.5
29.2%
win rate
Challenge Results
Magic Burger Explosion: Fiery Photorealism Challenge
Text-to-Image“Ad for 'Magic Burger'. Dynamic, exploded burger with all components (bun, patty, cheese, lettuce, tomato, sauce) suspended in mid-air. Emphasize photorealistic detail and a sense of motion. Dark, fiery background with glowing embers. Integrate text: 'MAGIC BURGER' as a prominent title, 'LIMITED TIME ONLY' as a secondary message, and '€6.99' in a starburst, all rendered with a fiery, glowing effect.”
AI Judge Analysis
GPT Image 1.5
- + Excellent photorealistic texture on the meat and bun
- + Very effective use of the fiery, glowing effect on all text elements
- + Dynamic composition with embers and sauce splatters that enhance the action
- − The 'exploded' effect is a bit vertical and static compared to Model B
- − The bottom bun remains at the very bottom of the frame rather than being fully suspended
Seedream 4.5
- + Stronger sense of motion with diagonal arrangement and motion blur trails
- + Cleaner text layout and typography
- + Creative interpretation of the cheese stretching between components
- − The textures look slightly more digital/artificial than Model A
- − The lettuce and tomato look less realistic and more like plastic props
Verdict: GPT Image 1.5 wins on photographic realism and the integration of the fiery theme into the burger's textures, making the food look more appetizing. While Seedream 4.5 has a more dynamic 'exploded' composition and cleaner text, it lacks the gritty, high-detail realism found in GPT Image 1.5's rendering of the patty and bun.
Geometric Composition
Text-to-Image“A glass cube on a wooden table. Inside the cube is a small blue sphere. On top of the cube sits a red book. A green plant is behind the cube, partially visible through the glass. Soft window light from the left.”
AI Judge Analysis
GPT Image 1.5
- + Perfect adherence to the prompt's spatial instructions.
- + High-quality textures on the red book and wooden table.
- + Realistic reflections and refractions through the glass cube.
- − The 'small blue sphere' is relatively large compared to the cube.
Seedream 4.5
- + Captures the soft window light effect with high contrast and warmth.
- + Authentic glass refraction for the plant in the background.
- − Serious geometric errors where the blue sphere is merging into/outside the glass wall.
- − The cube shape is distorted and looks like a solid block rather than a hollow container.
- − The plant is very blurry and barely recognizable.
Verdict: GPT Image 1.5 successfully followed all spatial instructions, placing the sphere clearly inside the cube and the plant behind it. Seedream 4.5 struggled with spatial reasoning, resulting in a blue sphere that appears to be clipped through the glass wall and a distorted cube structure.
Candid Street Photography
Text-to-Image“A candid street photo of an elderly Japanese man repairing a red bicycle in light rain, reflections on wet pavement, shallow depth of field, 50mm lens, natural skin texture, imperfect framing, motion blur from passing cars, cinematic but realistic, no stylization.”
AI Judge Analysis
GPT Image 1.5
- + Excellent photorealistic texture on the jacket, skin, and bicycle.
- + Successfully incorporates the 'imperfect framing' prompt with a large car obscuring part of the scene.
- + Highly detailed and realistic rendering of a wet pavement environment.
- − The 'motion blur' on the car is quite minimal, appearing more as a static object than a moving one.
Seedream 4.5
- + Strong execution of motion blur from passing cars in the background.
- + Good adherence to the 'shallow depth of field' and 'candid' look.
- + Natural skin textures and realistic expressions.
- − The bicycle mechanics are nonsensical, with a wrench floating near a chain that isn't connected to a sprocket correctly.
- − Composition is a bit more 'posed' rather than 'candid' as the subject is looking directly at the camera.
Verdict: GPT Image 1.5 is the winner due to its superior technical accuracy; while Seedream 4.5 captures the motion blur of cars better, it fails significantly on the details of the bicycle repair. GPT Image 1.5 feels like a genuine, high-quality street photograph with realistic textures and a believable environment.
Modern Clean Menu
Text-to-Image“Modern minimalist restaurant menu design, white background with colorful food photos in grid, sections for appetizers/pizza/mains, bold sans-serif fonts, vibrant accents, clean professional layout for casual dining.”
AI Judge Analysis
GPT Image 1.5
- + Excellent text rendering with clear, legible menu items and descriptions.
- + Logical and highly professional layout that mimics a real-world menu.
- + High-quality, appetizing food photography that perfectly aligns with the text items.
Seedream 4.5
- + Clean, minimalist aesthetic with bold colored borders.
- + Follows the general section structure requested in the prompt.
- − Severely garbled and repetitive text (e.g., 'Appetizters', 'Festaurant').
- − Excessive whitespace and lack of detailed item descriptions.
- − Menu prices are unrealistically high ($79 for an appetizer).
Verdict: GPT Image 1.5 is the clear winner as it produces a fully functional, professional-grade menu with perfect text legibility and logical itemization. Seedream 4.5 follows the minimalist prompt but fails significantly on text rendering, producing nonsensical words and repetitive placeholders that make the design unusable.
Man and Car in California
Editing“Make a photo of the man driving the car down the California coastline”
AI Judge Analysis
GPT Image 1.5
- + Successfully merges the subject and car from the source images into the requested new environment.
- + Preserves the subject's facial features and distinctive hairstyle very accurately.
- + Excellent lighting and color matching between the car interior, the driver, and the coastal background.
- − The steering wheel is positioned incorrectly, appearing to grow out of the middle of the dashboard rather than being in front of the driver.
- − The subject is not actually holding the steering wheel correctly.
Seedream 4.5
- + Excellent full-body preservation of the subject's outfit, including the specific coat, scarf, pants, and boots.
- + High degree of source car preservation, including the interior details and door shape.
- + Composition shows a clearer view of both the car and the coastline.
- − The car door is wide open while driving, which is a significant logical error.
- − The subject's face is slightly altered and less accurate to the source photo compared to Model A.
Verdict: Both models do an impressive job of combining elements from two disparate source images into a single scene. GPT Image 1.5 achieves a more natural lighting and facial resemblance, but fails on the interior ergonomics with a misplaced steering wheel. Seedream 4.5 captures the most detail from the source clothing and car, but suffers from the nonsensical logic of the driver's door being fully open while the car is in motion.
Bald man challenge
Editing“Give the person a full, thick head of natural hair with realistic texture, density, and a natural hairline. Preserve facial features and lighting.”
AI Judge Analysis
GPT Image 1.5
- + Natural wavy hair texture that matches the beard style
- + Excellent preservation of the original facial features and expression
- + Seamless integration of the hairline and sideburns
Seedream 4.5
- + Matches the requested 'thick head of hair' prompt
- + Good preservation of the background and clothing
- − The hairline looks slightly artificial and overly rounded
- − Visible distortion/smudging on the right eye and eyelid compared to the source
- − The hair texture appears a bit flat and painted-on near the forehead
Verdict: GPT Image 1.5 is the winner because it provides a much more natural-looking hair texture and hairline that perfectly complements the subject's existing beard. Seedream 4.5 slightly alters the person's facial features, particularly around the eyes, and the resulting hairline looks less realistic.
Night Sky Transformation
Editing“Change the scene to night: a deep, dark sky with subtle, glistening stars visible behind the mountain.”
AI Judge Analysis
GPT Image 1.5
- + Successfully creates a convincing 'deep, dark sky' atmosphere.
- + Shows glistening stars as requested.
- + Realistic lighting adjustment where the landscape is naturally darker to match the new sky.
- − The composition is slightly zoomed in compared to the original, losing some of the peripheral mountain details.
- − Some fine details in the village are lost due to the heavy darkness.
Seedream 4.5
- + Excellent source preservation, maintaining the exact layout and details of the original image.
- + Keeps the landscape lighting more visible.
- − Fails to create a truly 'dark' sky; it looks more like twilight.
- − The lighting on the mountain peak is inconsistent with a night scene, appearing as if sunset light is still hitting it.
- − The stars are very faint and sparse compared to the prompt request.
Verdict: GPT Image 1.5 is the clear winner for its superior prompt adherence, successfully transforming both the sky and the overall lighting to a nighttime atmosphere. Seedream 4.5 preserves the original image structure perfectly but fails to fulfill the 'deep, dark sky' requirement, leaving the mountain peak illuminated by a non-existent sun.
Adorable Baby Animals in Sunny Meadow
Text-to-Image“Hyper-photorealistic scene of fluffy baby animals—a golden retriever puppy, tabby kitten, baby bunny, and red fox kit—with big expressive eyes and ultra-detailed soft fur, playfully chasing butterflies and tumbling together in a lush wildflower meadow, warm golden sunrise light with god rays and dew sparkles, joyful wholesome vibe, 8K masterpiece.”
AI judge analysis unavailable for this challenge.
Over-the-top cartoon caricature
Editing“Create a caricature of me and my job. Make it exaggerated and humorous, incorporating my profession as a tv show anchor and my love for dogs and hockey.”
AI Judge Analysis
GPT Image 1.5
- + Captures all requested elements including news setting, multiple dogs, and hockey in a vibrant TV studio environment.
- + Strong artistic caricature style with exaggerated features and high energy.
- + Excellent text rendering and graphic design for the 'Breaking News' overlay.
- − Completely changes the subject's outfit and background, losing the source image context.
- − The character's hands and the objects they are holding have some anatomical and structural warping.
Seedream 4.5
- + Preserves the subject's original denim outfit and the living room background from the source image while adding the desk.
- + The hockey gear and dog are rendered with high clarity and realistic textures.
- + The caricature head-to-body ratio is well-executed for a 'bobblehead' style.
- − The microphone setup is physically disconnected from the desk, appearing to float.
- − Less 'humorous' and 'exaggerated' in its overall composition compared to Model A.
- − The desk and background integration feels slightly mismatched in terms of lighting.
Verdict: GPT Image 1.5 creates a much more cohesive and imaginative scene that feels like a professional caricature, successfully incorporating the hockey theme into the background action and the dogs. Seedream 4.5 does a better job of preserving the source image's clothing and background but suffers from technical glitches like a floating microphone and a less dynamic interpretation of the prompt.
Heroic Super Hero Portrait
Text-to-Image“Hyper-photorealistic full-body portrait of a female superhero standing triumphantly on a New York skyscraper rooftop at golden sunset, wearing a classic modest superhero costume with flowing cape, chest emblem, gloves, and boots in red and blue colors, practical design, short hair, strong determined heroic expression looking into the distance, powerful confident stance with hands on hips and cape billowing dramatically in the wind, detailed urban cityscape background, warm natural sunlight with sharp shadows and fabric highlights, ultra-sharp textures on suit, hair, and concrete, 8K masterpiece, empowering family-friendly style.”
AI Judge Analysis
GPT Image 1.5
- + Excellent skin texture and face realism.
- + High-resolution background detail featuring recognizable landmarks like the Empire State Building.
- + Accurate hand anatomy with clear finger Definition.
- − The transition between the character and the background looks slightly like a green-screen effect.
- − The lighting on the character's face doesn't perfectly match the strong backlight of the sunset.
Seedream 4.5
- + Beautiful cinematic lighting and atmosphere that blends the character perfectly with the environment.
- + Captures the 'short hair' and 'flowing cape' prompt elements with more dynamic movement.
- + Highly artistic and cohesive color palette.
- − Anatomy issues on the hands, especially the left hand on the hip which has too many/distorted fingers.
- − The facial features are slightly less sharp and detailed compared to Model A.
Verdict: GPT Image 1.5 offers superior technical clarity and facial realism, but Seedream 4.5 captures the cinematic mood and lighting of a New York sunset much more effectively. While GPT Image 1.5 feels like a high-quality studio photo, Seedream 4.5 feels more like a still from a movie, though it suffers from noticeable AI artifacts in the hands.
Neutral Expression to Genuine Smile
Editing{
"action": "image_edit",
"reference": "uploaded neutral portrait",
"change": "Warm genuine Duchenne smile: lips curved up, slight natural teeth, soft eye crinkles, subtle cheek raise",
"details": "Realistic smiling skin (dimples if present, soft cheek shadows), slightly brighter eyes; keep exact eye shape/color/iris",
"preserve_exact": "Face identity/structure, eyes/nose/lips/eyebrows, hair, skin texture/pores/freckles, makeup, clothing, head pose, background, lighting, shadows, framing",
"no_changes": "No face shape change, no new features, no gaze shift, no hair/clothing/lighting/background edits",
"style": "Ultra-photorealistic 8K portrait, sharp face focus, natural soft lighting, realistic skin glow"
}
AI Judge Analysis
GPT Image 1.5
- + Excellent preservation of specific skin texture and freckles from the original.
- + Natural execution of the Duchenne smile with believable eye crinkles.
- + Maintains original lighting and background perfectly.
- − The teeth are slightly less sharp than in the source image's overall resolution.
Seedream 4.5
- + Very high clarity and sharpness in the facial edit.
- + Clean rendering of teeth and smile architecture.
- + Preserves the subject's identity and head pose accurately.
- − Skin texture is slightly smoothed compared to the source, losing some fine freckle detail.
- − The eye crinkles are a bit more subtle than requested for a full Duchenne smile.
Verdict: Both models performed exceptionally well, maintaining nearly identical pixel-level matching for the background and clothing. GPT Image 1.5 is the winner because it better preserved the original skin texture and freckle patterns while delivering a more convincing Duchenne smile with the requested eye crinkles. Seedream 4.5 produced a very clean result but slightly smoothed the skin, losing a small amount of the source's character.
Intricate Floral Mandala
Text-to-Image“Perfectly symmetrical mandala made entirely of real flowers, petals, leaves, fruits, and seeds in vibrant natural colors, intricate layered patterns with radial symmetry, top-down view on a soft neutral background, hyper-detailed organic textures and subtle shadows, photorealistic, 8K masterpiece.”
AI Judge Analysis
GPT Image 1.5
- + Exceptional adherence to the 'perfectly symmetrical' requirement.
- + Includes a clear variety of all requested elements like seeds, fruits, and leaves.
- + Strikingly vibrant colors and high-contrast composition.
- − The textures look slightly artificial or felt-like rather than 100% photorealistic.
- − The layout feels very dense and perhaps a bit repetitive.
Seedream 4.5
- + Very soft, photorealistic lighting and depth with realistic shadows.
- + Organic elements look very natural and freshly picked.
- + Beautiful, delicate color palette.
- − Lacks the requested perfect symmetry, with many elements misaligned or mismatched across the axes.
- − The composition is significantly more chaotic and less like a traditional mandala.
Verdict: GPT Image 1.5 followed the prompt's technical requirements for perfect symmetry and specific botanical elements much more accurately, resulting in a cohesive mandala. While Seedream 4.5 captures a more realistic lighting and organic texture, its failure to maintain radial symmetry makes it less successful as a mandala. GPT Image 1.5 is the preferred choice for its precise adherence to the structural and thematic prompt instructions.
Studio Ghibli Anime Style
Editing“Transform this photo into a Studio Ghibli–inspired illustration. Use soft pastel colors, hand-painted textures, gentle lighting, dreamy backgrounds, and a warm, nostalgic mood”
AI judge analysis unavailable for this challenge.
Golden Hour Stroll
Editing“Add dynamic motion to this photo: make hair blow in the wind, add leaves flying, energetic and lively feel.”
AI Judge Analysis
GPT Image 1.5
- + Excellent preservation of the subject's face and clothing.
- + The hair-blowing effect is realistically integrated with the original pose.
- + High density of falling leaves creates a strong sense of wind.
Seedream 4.5
- + Successfully captures a more 'energetic' feel by slightly altering the pose to a jog.
- + The leaves have motion blur which enhances the sense of dynamic movement.
- + Good preservation of the background and surroundings.
- − The leash now appears to be floating unconnected to the dog's collar.
- − The subject's face has changed significantly from the source image.
- − The hand holding the leash has anatomical issues/distortion.
Verdict: GPT Image 1.5 is the winner because it successfully applied the dynamic edits while maintaining perfect consistency with the source image's subject and details. Seedream 4.5 created a more energetic composition, but at the cost of changing the woman's face and introducing a noticeable error where the leash no longer connects to the dog.
Vintage Cafe Logo
Text-to-Image“Vintage minimalist restaurant logo for "Caffè Florian", retro cloche dome with steam and "Est. 1720" banner, classic typography, warm brown and cream tones, subtle texture on light background, vector emblem style.”
AI Judge Analysis
GPT Image 1.5
- + Excellent typography with correct Italian accent usage.
- + Rich vintage texture and sophisticated vector shading.
- + Strong central composition with a classic emblem feel.
- − Ignored the request for a light background, providing a black background instead.
- − Slightly less 'minimalist' than Model B due to detailed shading.
Seedream 4.5
- + Perfect adherence to the light background and minimalist vector style.
- + Very clean typography and iconography.
- + Clear execution of the banner and steam elements.
- − The 'f' in Florian is slightly disconnected or stylized in a Way that looks like a gap.
- − Shading on the cloche is very basic compared to the artistic depth of Model A.
Verdict: Both models followed the complex text requirements perfectly. Seedream 4.5 adhered better to the background and minimalism constraints of the prompt, while GPT Image 1.5 produced a much more visually compelling and textured piece of art that failed the 'light background' instruction. Seedream 4.5 is the winner for better prompt adherence regarding color scheme and style.
Apollo 11: Journey to Tranquility
Text-to-Image“Create a clean, modern vector infographic poster about the Apollo 11 mission. NASA-inspired palette (navy, white, muted red, light gray). Flat-vector style, crisp lines, consistent iconography, subtle gradients only. Steps (stop at landing): 1. Launch (Saturn Vicon) 2. Earth Orbit (Earth + orbit ring icon) 3. Translunar (trajectory arc icon) 4. Lunar Orbit (Moon + orbit ring icon) 5. Descent (lunar module descending icon) 6. Landing (lunar module on the surface icon) Small supporting elements (minimal text): • Crew strip: three silhouette icons with only last names: Armstrong, Aldrin, Collins. • Landing site marker: Moon pin labeled "Tranquility" only. Layout constraints: generous margins, large readable labels, clean background with subtle stars. Vector-only, print-poster look, high resolution.”
AI Judge Analysis
GPT Image 1.5
- + Excellent adherence to all step-by-step landing icons
- + Strong typography and legible text for all labels
- + Dynamic and engaging vertical composition
- − The 'Launch' section is slightly cluttered by the overlapping Saturn V rocket
- − Some icons are slightly more illustrative than strictly 'flat vector'
Seedream 4.5
- + Clean, minimalist flat-vector aesthetic that matches the 'modern infographic' request
- + Perfectly legible text and clear numbering of steps
- + Accurate use of the requested NASA-inspired color palette
- − Step 5 (Descent) is represented by a generic satellite icon instead of a descending lunar module
- − The step 3 trajectory arc is disconnected from the other elements
Verdict: GPT Image 1.5 is the preferred image because it followed the specific iconography instructions for every step of the mission, including the lunar module for both descent and landing. While Seedream 4.5 captures the 'flat vector' style more accurately, it failed on step 5 by depicting a satellite instead of a descending lunar module.
GPT Image 1.5
OpenAI's state-of-the-art image generation model with better instruction following and adherence to prompts
Seedream 4.5
ByteDance's latest image generation model unifying text-to-image and image editing in a single architecture, with improved text rendering and 30-40% faster generation than v4.0