Head to head
Esc

Models · slot A

to navigate to pick

FLUX.2 [dev] Flash fal GPT Image 1.5 OpenAI

Settled by community votes across 20 shared challenges, with an AI judge weighing in on each.

FLUX.2 [dev] Flash

26.8 arena score

#7 of 48 in Text-to-Image

Skill signature · Text-to-Image

GPT Image 1.5

27.0 arena score

#5 of 48 in Text-to-Image

Vote tally

Where the votes landed

FLUX.2 [dev] Flash

36.4%

win rate

Ties

18.2%

GPT Image 1.5

45.5%

win rate

36.4% 18.2% ties 45.5%
Shared challenges 20

Challenge by challenge

The strongest take from each model on every shared challenge, with the AI judge's read.

Geometric Composition

Text-to-Image

“A glass cube on a wooden table. Inside the cube is a small blue sphere. On top of the cube sits a red book. A green plant is behind the cube, partially visible through the glass. Soft window light from the left.”

FLUX.2 [dev] Flash
GPT Image 1.5
50% wins 17% ties 33% wins

AI Judge Analysis

FLUX.2 [dev] Flash

  • + Perfect adherence to the lighting instruction with a clear window light source on the left.
  • + Highly realistic glass textures, including subtle imperfections and reflections.
  • + Excellent depth of field and consistent perspective.
  • The sphere is slightly smaller than what might be expected from the prompt 'small', making it look a bit lost in the cube.

GPT Image 1.5

  • + Excellent color saturation and vibrant subject matter.
  • + Clean, sharp rendering of the glass cube and its internal reflection.
  • + Good composition with the plant framing the background.
  • The lighting is a bit more diffuse and less directional than the requested 'window light from the left'.
  • The glass cube has a mirrored base which wasn't requested, creating a double reflection of the sphere.

Verdict: Both models followed the prompt instructions very well. FLUX.2 [dev] Flash stands out for its superior photographic realism and more accurate rendering of the soft directional window light. GPT Image 1.5 is also very strong but looks slightly more like a 3D render and added a mirrored floor to the cube that wasn't in the prompt.

Man and Car in California

Editing
Edit instruction

“Make a photo of the man driving the car down the California coastline”

Source
FLUX.2 [dev] Flash
GPT Image 1.5

AI Judge Analysis

FLUX.2 [dev] Flash

  • + Excellent preservation of the man's facial features and specific hairstyle.
  • + Faithfully replicates the textures and patterns of the plaid coat and knit scarf.
  • + The interior dashboard maintains a premium, classic aesthetic consistent with a high-end vehicle.
  • The steering wheel logo does not match the Rolls-Royce from the source image.

GPT Image 1.5

  • + Successfully places the man in the car with a California coastline background.
  • + Correctly incorporates the cream-colored interior from the source image.
  • Fails to preserve the man's specific facial structure, making him look like a different person.
  • The clothing details (scarf and plaid coat) are simplified or lost compared to the source.
  • Poor composition with the car appearing to drive on the wrong side of the road or off the edge.

Verdict: FLUX.2 [dev] Flash is the clear winner as it maintains near-perfect consistency with the source images, capturing the man's likeness and clothing details with high fidelity. GPT Image 1.5 struggles with identity preservation, resulting in an image where the man only vaguely resembles the source, and the overall composition feels less coherent.

Candid Street Photography

Text-to-Image

“A candid street photo of an elderly Japanese man repairing a red bicycle in light rain, reflections on wet pavement, shallow depth of field, 50mm lens, natural skin texture, imperfect framing, motion blur from passing cars, cinematic but realistic, no stylization.”

FLUX.2 [dev] Flash
GPT Image 1.5

AI Judge Analysis

FLUX.2 [dev] Flash

  • + Excellent motion blur on background cars that creates a sense of movement.
  • + Highly realistic skin texture and facial features of the elderly man.
  • + Perfect adherence to 'imperfect framing' with a candid street photography feel.
  • The man's hands have structural issues, specifically the number and placement of fingers.
  • The bicycle kickstand and tools on the ground are somewhat messy and poorly defined.

GPT Image 1.5

  • + Stronger emotional 'candid' feel with the man crouching and focused on his work.
  • + Superior bicycle mechanics and tool kit detail.
  • + Excellent depiction of rain droplets on the man's jacket and cap.
  • Failed to include 'motion blur from passing cars', as the car in the background is static.
  • The depth of field is slightly deeper than requested compared to the creamy bokeh in Model A.

Verdict: FLUX.2 [dev] Flash captures the cinematic technicalities better, specifically the motion blur and the 50mm lens look, but suffers from significant hand artifacts. GPT Image 1.5 provides a more grounded, realistic scene with better attention to the physical tools and the texture of rain, though it ignored the motion blur requirement. FLUX.2 [dev] Flash is the winner for its superior 'candid street' atmosphere and adherence to all photography-specific prompts despite the hand issues.

Fantasy Warrior

Text-to-Image

“Close portrait of a battle-worn paladin in ornate engraved plate armor, hair braided with small beads, faint scars and dirt on the skin, warm torchlight reflecting off metal, shallow depth of field, bokeh sparks, lifelike eyes, highly detailed texture on leather straps and cloth underlayer.”

FLUX.2 [dev] Flash
GPT Image 1.5
0% wins 0% ties 100% wins

AI Judge Analysis

FLUX.2 [dev] Flash

  • + Excellent adherence to the 'hair braided with small beads' detail
  • + Clean and intricate engraving on the armor
  • + Well-defined torchlight sources that balance the composition
  • The facial skin texture and blood effects look slightly flatter and less realistic than the competitor
  • The 'battle-worn' appearance feels a bit clean overall

GPT Image 1.5

  • + Exceptional skin texture with realistic dirt and sweat mapping
  • + Highly effective use of warm torchlight reflections on the metal
  • + Strong sense of material depth in the leather and cloth underlayers
  • The beads in the hair are less distinct than requested
  • A small artifact is visible on the earring/ear area

Verdict: Both models followed the prompt exceptionally well, but GPT Image 1.5 wins due to its superior rendering of textures, particularly the skin and the interaction of light with the armor. While FLUX.2 [dev] Flash handled the specific request for hair beads more clearly, GPT Image 1.5 felt more like a tangible, high-quality photograph with a more convincing 'battle-worn' atmosphere.

Modern Clean Menu

Text-to-Image

“Modern minimalist restaurant menu design, white background with colorful food photos in grid, sections for appetizers/pizza/mains, bold sans-serif fonts, vibrant accents, clean professional layout for casual dining.”

FLUX.2 [dev] Flash
GPT Image 1.5

AI Judge Analysis

FLUX.2 [dev] Flash

  • + Excellent use of vibrant, colorful accents consistent with the request.
  • + Professional food photography with consistent lighting across the grid.
  • + Good spatial balance between the heading and content.
  • Text rendering is poor with several nonsense words and spelling errors like 'CASSLAL5'.
  • The sections for Appetizers, Pizza, and Mains are poorly organized, with photos of pizza appearing under the Appetizers heading.
  • Visual artifacts present in the smaller subtext.

GPT Image 1.5

  • + Perfect text legibility with accurate spelling and relevant menu items.
  • + Logical layout where food photos correspond correctly to the adjacent text sections.
  • + Extremely clean, professional minimalist design suitable for a real-world application.
  • The 'grid' format requested for photos is split/broken across the vertical layout rather than being a unified grid.
  • Color accents are more muted compared to the 'vibrant' request in the prompt.

Verdict: While FLUX.2 [dev] Flash captures the 'vibrant' aesthetic more effectively with its colorful framing, GPT Image 1.5 is the much stronger entry due to its perfect text rendering and logical organization. GPT Image 1.5 provides a usable menu where the food photos actually match the sections (Apps, Pizza, Mains), whereas FLUX.2 shows pizza in every single photo slot regardless of the heading.

Magic Burger Explosion: Fiery Photorealism Challenge

Text-to-Image

“Ad for 'Magic Burger'. Dynamic, exploded burger with all components (bun, patty, cheese, lettuce, tomato, sauce) suspended in mid-air. Emphasize photorealistic detail and a sense of motion. Dark, fiery background with glowing embers. Integrate text: 'MAGIC BURGER' as a prominent title, 'LIMITED TIME ONLY' as a secondary message, and '€6.99' in a starburst, all rendered with a fiery, glowing effect.”

FLUX.2 [dev] Flash
GPT Image 1.5

AI Judge Analysis

FLUX.2 [dev] Flash

  • + Extremely clean and legible typography
  • + Realistic lighting and photographic depth of field
  • + Sophisticated integration of the fiery starburst element
  • The composition is a bit static compared to the 'exploded' request
  • The sauce droplets look slightly artificial

GPT Image 1.5

  • + Very high sense of motion and 'exploded' energy
  • + Intense, dramatic lighting on the food items
  • + Includes extra details like red onions that add to the visual complexity
  • The text 'MAGIC BURGER' is partially cut off at the top
  • The image feels a bit cluttered and oversaturated
  • The starburst effect looks more like a sticker than a glowing element

Verdict: FLUX.2 [dev] Flash produces a much more professional advertisement with perfect text rendering and realistic lighting. While GPT Image 1.5 captures the 'exploded' motion and fiery energy better, its composition is marred by the text being cut off and an overall busy layout.

Chalkboard Menu

Text-to-Image

“Handwritten-style chalkboard menu in a cozy café, all text rendered in the exact same realistic chalk handwriting style with natural variations in letter size, slight slant, and chalk texture — no printed or digital fonts anywhere on the board. Title at the top in elegant cursive chalk handwriting: ‘TODAY’S SPECIALS – APRIL 30, 2026’. Below it, three menu items also in the same handwritten chalk style: ‘Truffle Mushroom Risotto – $24’, ‘Grilled Octopus with Lemon & Herbs – $28’, ‘Brown Butter Chocolate Chip Cookies – $9’. At the very bottom, smaller text in the identical handwritten chalk style (slightly smaller but still clearly legible with the same handwriting characteristics): ‘All items made fresh daily • Ask about our gluten-free options’. Warm ambient café lighting, visible chalk dust and smudges, realistic handwriting imperfections, no clean printed text anywhere.”

FLUX.2 [dev] Flash
GPT Image 1.5

AI Judge Analysis

FLUX.2 [dev] Flash

  • + Excellent chalk texture with realistic smudging and dusting on the board.
  • + Stronger sense of depth and environmental context by showing the cafe background.
  • + Perfectly captures the requested handwriting variations and elegant cursive title.
  • The spacing between lines is slightly uneven near the bottom.
  • The dash after 'Herbs' is unnecessary.

GPT Image 1.5

  • + Perfect text alignment and uniform spacing across all menu items.
  • + Extremely clear and legible handwriting while still maintaining a chalk feel.
  • + Accurately completes the truncated prompt text with 'Cookies'.
  • Lacks the realistic cafe background environmental details of the other image.
  • The lighting feels a bit flat compared to the more atmospheric lighting in Model A.

Verdict: Both models demonstrate exceptional text rendering capabilities, perfectly following the complex instructions for specific dates, items, and styles. FLUX.2 [dev] Flash is preferred for its superior environmental storytelling and richer textures, making the blackboard feel like a real object in a physical space, whereas GPT Image 1.5 feels slightly more like a digital flat-lay.

Pose & Character Mashup

Editing
Edit instruction

“Use Image 1 as the exact pose reference and Image 2 as the character reference. Recreate the person/character from Image 2 in the exact dynamic pose and body position from Image 1. Keep the exact face, hair, clothing style/details, and expression from Image 2. Match the lighting and environment of Image 1. The final image must show the character from Image 2 performing the precise action/pose from Image 1 with perfect anatomy and natural integration.”

Source
FLUX.2 [dev] Flash
GPT Image 1.5

AI Judge Analysis

FLUX.2 [dev] Flash

  • + Successfully replicates the character's clothing and accessories
  • + Maintains the environment and lighting of the source image
  • Serious anatomical failure, merging the hair of the original woman into the new character's torso
  • Failed to recreate the bent-over pose, opting for a generic standing-while-crouched position

GPT Image 1.5

  • + Achieves a much closer approximation of the crossed-leg pose from the source image
  • + High-fidelity recreation of the character's face, sunglasses, and scarf
  • + Correctly places the character in the environment without ghostly artifacts
  • Failed to angle the head/torso downward to match the extreme 'dynamic pose' of the woman in Image 1
  • The hands and feet have minor clarity issues compared to the source

Verdict: GPT Image 1.5 performed significantly better by accurately blending the two images into a coherent person, whereas FLUX.2 [dev] Flash suffered a major hallucination by merging the woman's hair from the background into the man's stomach. While GPT Image 1.5 did not fully capture the extreme downward lean of the original pose, it maintained the crossed legs and environmental integration far more skillfully.

The Reversed Rodeo

Text-to-Image

“Horse riding astronaut in space — horse on top, not vice versa. Surreal, highly detailed, cinematic.”

FLUX.2 [dev] Flash
GPT Image 1.5

AI Judge Analysis

FLUX.2 [dev] Flash

  • + Excellent anatomical details on the horse's coat and mane.
  • + Clear and believable cinematic lighting.
  • + High contrast between the subject and the celestial background.
  • Completely failed the negative constraint to have the horse on top.
  • Severe anatomical glitch with a third hind leg appearing behind the horse.

GPT Image 1.5

  • + Dynamic composition with a sense of motion and dust.
  • + Richly textured lunar surface and complex background elements.
  • + High level of detail in the space suit and horse harness.
  • Failed the negative constraint to have the horse on top of the astronaut.
  • The scale of the rings on the gas giant in the background looks slightly disconnected from the perspective.

Verdict: Both FLUX.2 [dev] Flash and GPT Image 1.5 failed the difficult prompt instruction to invert the rider roles (horse on top). While both models delivered high-quality cinematic imagery of an astronaut riding a horse, GPT Image 1.5 is the winner because it lacks the jarring structural failure of a third leg seen in the FLUX.2 image.

Outfit Transfer Challenge

Editing
Edit instruction

“Use Image 1 as the base person. Dress them in the exact elaborate outfit from Image 2 (including all layers, accessories, jewelry, and shoes). Carefully adapt the clothing to the body shape and pose in Image 1 while maintaining realistic fabric behavior, correct proportions, and perfect lighting/shadow matching. Keep the person’s exact face, hair, and background completely unchanged.”

Source
FLUX.2 [dev] Flash
GPT Image 1.5

AI Judge Analysis

FLUX.2 [dev] Flash

  • + Successfully applied the coat and scarf style from Image 2.
  • + Maintains the overall composition and background of Image 1.
  • Horrific facial distortion with multiple eyes and noses appearing on the face.
  • Altered the person's face and skin patterns significantly.
  • Added excessive, unrequested jewelry that was not in Image 2.

GPT Image 1.5

  • + Accurately transferred the clothing, scarf, and watch from Image 2.
  • + Maintains realistic body proportions and clothing fit.
  • + Preserved the background and lighting of Image 1 well.
  • Completely cropped out the head/face of the subject, failing the 'keep face and hair unchanged' instruction.
  • The skin visible on the hands does not match the vitiligo patterns from the source image.

Verdict: Both models failed significantly on the primary preservation instructions. FLUX.2 [dev] Flash attempted to keep the face but resulted in a nightmarish anatomical mess with garbled features, while GPT Image 1.5 avoided the face entirely by cropping it out of the frame. GPT Image 1.5 is the preferred failure as its clothing transfer is clean and accurate, whereas FLUX.2 added random jewelry and produced a visually broken subject.

The Capybara Taxi Driver

Text-to-Image

“Photorealistic scene inside a yellow New York taxi at night. A capybara is driving, wearing a yellow taxi driver cap and a dark jacket. It has a calm, professional expression and both front paws on the steering wheel. In the back seat sits a human businesswoman in a coat, looking at her phone with a completely normal, bored expression (as if this is just another normal ride). Through the windows you can see the streets of Manhattan at night with blurred lights. Realistic taxi interior, photorealistic, detailed fur and fabric, 35mm lens, night lighting with reflections, shallow depth of field.”

FLUX.2 [dev] Flash
GPT Image 1.5

AI Judge Analysis

FLUX.2 [dev] Flash

  • + Excellent photorealism in the skin and hair textures
  • + High-quality rendering of the businesswoman with clear details
  • + Clean and professional composition with a sharp 'Taxi' sign on top
  • The businesswoman is sitting in the passenger seat rather than the back seat as requested
  • The capybara's 'paws' look a bit like primate hands or gloves

GPT Image 1.5

  • + Correctly places the businesswoman in the back seat
  • + Wonderful expression and authentic taxi driver hat design on the capybara
  • + Great lighting and bokeh effect in the background streets
  • The capybara's claws/paws are slightly distorted and merge with the steering wheel
  • Slightly more motion blur or softness on the human face compared to Model A

Verdict: Model B (GPT Image 1.5) is the winner because it followed the spatial instructions more accurately, placing the businesswoman in the back seat, whereas FLUX.2 [dev] Flash placed her in the front passenger seat. Both models produced high-quality, photorealistic textures and captured the requested 'bored' and 'professional' expressions effectively.

The Halloween Invitation

Text-to-Image

“Vintage gothic Halloween party invitation. Dark parchment poster, spooky border with webs and thorns, central glowing jack-o-lantern, bats, twisted trees, moody night sky. Add elegant gothic title text saying "Halloween Party Invitation", a small scroll banner saying "You are invited to a night of frights", and event details at the bottom: Date: 30.10.2026 Time: 7pm Location: The Arches, NYC Spooky but polished, cinematic lighting, square format.”

FLUX.2 [dev] Flash
GPT Image 1.5

AI Judge Analysis

FLUX.2 [dev] Flash

  • + Excellent text readability and crisp rendering
  • + Clean and polished graphical layout
  • + Good adherence to the requested border composed of webs and thorns
  • Includes a line of gibberish text 'Hurk: 0369c' in the event details
  • The lighting feels a bit flat compared to the cinematic request

GPT Image 1.5

  • + Atmospheric cinematic lighting with a strong vintage aesthetic
  • + Accurate and elegant rendering of all requested text strings
  • + Richly detailed background with graveyard elements and a moon
  • The thorns in the border are slightly less distinct than Model A
  • Text is a bit more integrated into the texture, which can slightly reduce legibility

Verdict: Both models followed the prompt exceptionally well, but GPT Image 1.5 is the winner because it successfully rendered all text strings correctly without adding artifacts. While FLUX.2 [dev] Flash produced very clean graphics, it included a line of hallucinated text that detracts from the professional quality of the invitation.

Bald man challenge

Image Editing
Edit instruction

“Give the person a full, thick head of natural hair with realistic texture, density, and a natural hairline. Preserve facial features and lighting.”

Before After
FLUX.2 [dev] Flash
Before After
GPT Image 1.5

AI Judge Analysis

FLUX.2 [dev] Flash

  • + Excellent preservation of the source image facial features and clothing.
  • + Realistic hair texture and natural integration with the sideburns.
  • The volume and style of the hair (afro-style) might be an unexpected interpretation of 'natural hair' for the subject's existing beard texture.

GPT Image 1.5

  • + Perfectly matches the hair texture to the existing beard texture.
  • + Exceptional preservation of the source image surroundings and identity.
  • The hairline on the forehead looks slightly sharp and lacks a few fine transitional baby hairs found in natural growth.

Verdict: Both models performed exceptionally well, maintaining the original person's features, clothing, and background almost perfectly. FLUX.2 [dev] Flash went for a very high-volume style that looks realistic but slightly overwhelming, while GPT Image 1.5 chose a more conservative style that matches the existing beard texture perfectly, making it the more seamless edit.

Isometric Miniature Diorama Scenes

Text-to-Image

“Create a clear, 45° top-down isometric miniature 3D cartoon scene of Japan's signature dish: sushi, with soft refined textures, realistic PBR materials, gentle lighting, on a small raised diorama base with minimal garnish and plate. Solid light blue background. At top-center: 'JAPAN' in large bold text, 'SUSHI' below it, small flag icon. Perfectly centered, ultra-clean, high-clarity, square format.”

FLUX.2 [dev] Flash
GPT Image 1.5

AI Judge Analysis

FLUX.2 [dev] Flash

  • + Excellent adherence to the 'minimal' requirement with a clean, focused composition.
  • + Perfect text layout and typography that matches the prompt instruction for 'JAPAN' and 'SUSHI'.
  • + High-quality soft lighting and refined textures that give a premium 3D render feel.
  • The sushi roll construction is a bit nonsensical with fish draped over a roll that already contains a complex center.

GPT Image 1.5

  • + Highly detailed environment with a complex diorama base featuring grass and multiple materials.
  • + Includes a wide variety of sushi types that are very well rendered.
  • + Strong sense of 'miniature' scale with realistic proportions for the teapot and soy sauce bottle.
  • Failed the 'minimal garnish and plate' instruction by adding many extra props.
  • The font for 'SUSHI' is smaller and less prominent compared to the requested bold style.
  • The lighting is slightly harsher and less 'gentle' than the other model.

Verdict: FLUX.2 [dev] Flash adhered much better to the specific aesthetic constraints, particularly the request for a 'minimal' setup and the specific text hierarchy. While GPT Image 1.5 produced a more complex and visually rich scene, it ignored the negative constraint of minimal garnish, resulting in a cluttered diorama compared to the ultra-clean look achieved by FLUX.2 [dev] Flash.

Over-the-top cartoon caricature

Editing
Edit instruction

“Create a caricature of me and my job. Make it exaggerated and humorous, incorporating my profession as a tv show anchor and my love for dogs and hockey.”

Source
FLUX.2 [dev] Flash
GPT Image 1.5

AI Judge Analysis

FLUX.2 [dev] Flash

  • + Excellent preservation of the subject's facial features in a caricature style.
  • + Perfect blending of all themes, including dogs wearing hockey gear on a news set.
  • + Very clean and professional illustration style with high resolution.
  • The 'TV SHOW' text is a bit generic.
  • The tiny body compared to the large head is a very traditional and perhaps less creative caricature style.

GPT Image 1.5

  • + Very creative layout that feels like a real action-packed news broadcast.
  • + Good use of text overlays to enhance the theme of the job.
  • + High level of detail in the background, such as the camera and the hockey player on screen.
  • Subject's face is slightly less recognizable compared to the source image than Model A.
  • The dogs' interaction with the hockey sticks is a bit messy (e.g., the stick goes through the dog's mouth).

Verdict: Both models did an exceptional job capturing the request. FLUX.2 [dev] Flash provides a more accurate caricature of the woman's face and creates a cohesive scene where the dogs are truly part of the hockey team. GPT Image 1.5 offers a more dynamic composition with better 'news' context and text, but it loses slightly more of the subject's likeness and has minor technical clipping issues with the dog and hockey stick.

Adorable Baby Animals in Sunny Meadow

Text-to-Image

“Hyper-photorealistic scene of fluffy baby animals—a golden retriever puppy, tabby kitten, baby bunny, and red fox kit—with big expressive eyes and ultra-detailed soft fur, playfully chasing butterflies and tumbling together in a lush wildflower meadow, warm golden sunrise light with god rays and dew sparkles, joyful wholesome vibe, 8K masterpiece.”

FLUX.2 [dev] Flash
GPT Image 1.5
50% wins 0% ties 50% wins

AI Judge Analysis

FLUX.2 [dev] Flash

  • + Excellent variety of animals that match the specific prompt ingredients.
  • + High clarity and realistic lighting with clear god rays.
  • + Balanced composition that fills the frame well without feeling cluttered.
  • Includes an extra fifth animal (a second rabbit/hare) not requested in the prompt.
  • Butterflies feel a bit static and repetitious in their placement.

GPT Image 1.5

  • + Perfect adherence to the animal count and types requested.
  • + Great dynamic action with the kitten in a 'tumbling' pose.
  • + Strong rendering of 'warm golden sunrise light' and dew sparkles on the flowers.
  • The fox's eyes and snout look slightly more 'plush toy' than hyper-photorealistic.
  • The puppy's front paw has slightly odd toe definition.

Verdict: FLUX.2 [dev] Flash produces a cleaner, more technically impressive image with superior lighting, but it fails the prompt adherence check by adding a fifth animal. GPT Image 1.5 captures the 'tumbling' energy of the prompt much better and follows the numerical instructions exactly, making it the more reliable choice for this specific request.

Studio Ghibli Anime Style

Editing
Edit instruction

“Transform this photo into a Studio Ghibli–inspired illustration. Use soft pastel colors, hand-painted textures, gentle lighting, dreamy backgrounds, and a warm, nostalgic mood”

Source
FLUX.2 [dev] Flash
GPT Image 1.5

AI Judge Analysis

FLUX.2 [dev] Flash

  • + Excellent preservation of the source image's composition and character poses.
  • + Effective use of Ghibli-style character linework and watercolor textures.
  • + Maintains the distinct plaid pattern of the shirt very accurately.
  • Replaces the urban street background with a generic flowery field, failing to preserve that part of the source image.
  • The lighting is a bit flat compared to the requested 'dreamy' mood.

GPT Image 1.5

  • + Successfully captures the warm, nostalgic, and dreamy Ghibli lighting.
  • + Preserves the street background setting better than the other model.
  • + Applies a very soft, painterly texture that aligns with the prompt.
  • The faces look more like generic modern anime than the specific Ghibli aesthetic.
  • The woman in the foreground is excessively blurred, losing the 'illustration' feel.

Verdict: FLUX.2 [dev] Flash does a superior job of translating the iconic memes' characters into the Ghibli art style while keeping their exact poses, although it completely changes the background to a field. GPT Image 1.5 captures the requested 'dreamy' lighting and warm mood much better and keeps the street setting, but the character faces feel less authentic to the Ghibli inspiration. FLUX.2 [dev] Flash is the likely winner for its precise stylistic translation and character preservation.

Golden Hour Stroll

Image Editing
Edit instruction

“Add dynamic motion to this photo: make hair blow in the wind, add leaves flying, energetic and lively feel.”

Before After
FLUX.2 [dev] Flash
Before After
GPT Image 1.5

AI Judge Analysis

FLUX.2 [dev] Flash

  • + Successfully added blowing hair that flows naturally behind the subject.
  • + Added a large number of falling leaves to create the requested dynamic atmosphere.
  • + Preserved the background bridge and landscape elements well.
  • The leaf rendering is somewhat muddy and lacks clear motion blur.
  • The subject's face underwent slight changes in shape and lighting compared to the original.

GPT Image 1.5

  • + Excellent hair animation that feels energetic and well-integrated.
  • + Leaves have varied colors and realistic motion blur, enhancing the sense of movement.
  • + Maintained high fidelity to the subject's facial features and the dog's appearance.
  • The leash loop in the character's hand was slightly altered and duplicated.

Verdict: Both models followed the instructions well, but GPT Image 1.5 produced a superior result by utilizing better motion blur on the leaves, which creates a stronger sense of movement. While FLUX.2 [dev] Flash also succeeded in the hair edit, its leaves look more like static overlays and it slightly altered the subject's face.

Vintage Cafe Logo

Text-to-Image

“Vintage minimalist restaurant logo for "Caffè Florian", retro cloche dome with steam and "Est. 1720" banner, classic typography, warm brown and cream tones, subtle texture on light background, vector emblem style.”

FLUX.2 [dev] Flash
GPT Image 1.5

AI Judge Analysis

FLUX.2 [dev] Flash

  • + Excellent typography with correct accents and alignment
  • + Perfect adherence to the light background and subtle texture request
  • + Clean vector emblem style with a professional balance
  • The 'Est. 1720' text is placed outside the main banner element

GPT Image 1.5

  • + Strong vintage aesthetic with woodcut-style shading
  • + Accurate placement of 'Est. 1720' within a banner
  • + Sophisticated font choice for the main title
  • Failed to provide a light background, opting for black instead
  • The vector lines are slightly less clean and consistent than Model A

Verdict: FLUX.2 [dev] Flash followed the background and color instructions much more accurately than GPT Image 1.5, which ignored the 'light background' prompt completely. While GPT Image 1.5 captured the 'classic typography' well, FLUX.2 [dev] Flash produced a cleaner, more usable logo emblem that perfectly captured the requested warm brown and cream tones.

Apollo 11: Journey to Tranquility

Text-to-Image

“Create a clean, modern vector infographic poster about the Apollo 11 mission. NASA-inspired palette (navy, white, muted red, light gray). Flat-vector style, crisp lines, consistent iconography, subtle gradients only. Steps (stop at landing): 1. Launch (Saturn Vicon) 2. Earth Orbit (Earth + orbit ring icon) 3. Translunar (trajectory arc icon) 4. Lunar Orbit (Moon + orbit ring icon) 5. Descent (lunar module descending icon) 6. Landing (lunar module on the surface icon) Small supporting elements (minimal text): • Crew strip: three silhouette icons with only last names: Armstrong, Aldrin, Collins. • Landing site marker: Moon pin labeled "Tranquility" only. Layout constraints: generous margins, large readable labels, clean background with subtle stars. Vector-only, print-poster look, high resolution.”

FLUX.2 [dev] Flash
GPT Image 1.5
0% wins 50% ties 50% wins

AI Judge Analysis

FLUX.2 [dev] Flash

  • + Includes all requested steps and icons with clear labels.
  • + Higher details in the Lunar Module icons.
  • + Accurately renders the names of all three astronauts.
  • Composition is cluttered and disorganized.
  • Contains redundant labels and hallucinatory text (e.g., 'Sataurr' Iccòn').
  • Style leans towards 'illustrated' rather than the requested 'clean flat-vector'.

GPT Image 1.5

  • + Excellent adherence to the 'flat-vector' and 'clean infographic' aesthetic.
  • + Superior layout with logical flow and consistent iconography.
  • + Stronger adherence to the specified NASA-inspired color palette.
  • The 'Apollo 11' title is partially cut off at the top.
  • The translunar icon is simplified compared to the complex trajectory requested.
  • Includes some minor alignment issues between labels and icons.

Verdict: GPT Image 1.5 is the winner because it successfully captures the 'clean, modern vector infographic' aesthetic requested, using a clear vertical flow and consistent icons. While FLUX.2 [dev] Flash included more literal details, its composition is messy, redundant, and contains significant text hallucinations that detract from the professional poster feel.

Next steps

Explore each model