FLUX.2 [dev] Flash vs GPT Image 1 Mini
Head-to-head across 6 challenges
FLUX.2 [dev] Flash
50.0%
win rate
Ties
25.0%
GPT Image 1 Mini
25.0%
win rate
Challenge Results
Geometric Composition
Text-to-Image“A glass cube on a wooden table. Inside the cube is a small blue sphere. On top of the cube sits a red book. A green plant is behind the cube, partially visible through the glass. Soft window light from the left.”
AI Judge Analysis
FLUX.2 [dev] Flash
- + Excellent physical accuracy with the sphere resting on the bottom of the cube.
- + Highly realistic textures on the glass, including dust and fingerprints.
- + Correct distortion and refraction of the plant visible through the glass cube.
- − The sphere has a marble-like texture which might be seen as less 'pure' than a solid blue sphere.
GPT Image 1 Mini
- + Clean, minimalist aesthetic with smooth surfaces.
- + Adheres to all requested objects in the prompt.
- − The blue sphere is floating unnaturally in the center of the cube, defying gravity.
- − The plant is not visible through the glass as requested, but occupies the space behind the book/cube.
Verdict: FLUX.2 [dev] Flash produces a much more realistic image with convincing physics and light refraction; it correctly places the sphere on the bottom of the cube and shows the plant through the glass. GPT Image 1 Mini fails on physics by showing a floating sphere and misses the prompt requirement to have the plant visible through the glass.
Candid Street Photography
Text-to-Image“A candid street photo of an elderly Japanese man repairing a red bicycle in light rain, reflections on wet pavement, shallow depth of field, 50mm lens, natural skin texture, imperfect framing, motion blur from passing cars, cinematic but realistic, no stylization.”
AI Judge Analysis
FLUX.2 [dev] Flash
- + Excellent adherence to the 'motion blur' requirement with realistic passing cars.
- + Very high skin and clothing detail, appearing highly realistic and un-stylized.
- + Bicycle mechanics and tools on the ground add to the narrative and realism.
- − The hands have some minor structural clipping issues with the bicycle brake cable.
- − The front wheel of the bicycle is slightly disconnected from the frame's fork.
GPT Image 1 Mini
- + Strong composition with a side profile that emphasizes the man's expression.
- + Good wet pavement reflections and rainy atmosphere.
- + Natural looking pose for someone repairing a bicycle wheel.
- − Failed to include the requested 'motion blur from passing cars'.
- − The bicycle frame geometry is nonsensical, especially where the seat post and rear wheel meet.
- − The image has a slightly softer, more digital look compared to the requested 'no stylization'.
Verdict: FLUX.2 [dev] Flash followed the prompt much more closely, successfully incorporating the motion blur from passing cars which GPT Image 1 Mini ignored. While both models struggled with perfect bicycle anatomy, FLUX.2 [dev] Flash delivered a much more realistic texture and a complex, crowded street scene that feels like a genuine candid photo.
Fantasy Warrior
Text-to-Image“Close portrait of a battle-worn paladin in ornate engraved plate armor, hair braided with small beads, faint scars and dirt on the skin, warm torchlight reflecting off metal, shallow depth of field, bokeh sparks, lifelike eyes, highly detailed texture on leather straps and cloth underlayer.”
AI Judge Analysis
FLUX.2 [dev] Flash
- + Excellent adherence to the 'beads in hair' prompt element.
- + Incredible detail on the engraving, leather straps, and chainmail textures.
- + Superior sharpness and lighting on the facial features.
- − The torches in the background are a bit literal and sharp, slightly distracting from the bokeh effect.
GPT Image 1 Mini
- + Strong cinematic atmosphere with very warm, believable torchlight reflections.
- + Good implementation of shallow depth of field and bokeh sparks.
- + Natural skin texture and battle-worn appearance.
- − Missed the 'beads' in the braided hair entirely.
- − The armor engraving is less sharp and detailed compared to Model A.
Verdict: FLUX.2 [dev] Flash is the clear winner for its superior prompt adherence, particularly the inclusion of beads in the hair which GPT Image 1 Mini omitted. FLUX.2 also provides significantly more detail in the armor engravings and leather textures, making for a much more striking high-resolution portrait.
Chalkboard Menu
Text-to-Image“Handwritten-style chalkboard menu in a cozy café, all text rendered in the exact same realistic chalk handwriting style with natural variations in letter size, slight slant, and chalk texture — no printed or digital fonts anywhere on the board. Title at the top in elegant cursive chalk handwriting: ‘TODAY’S SPECIALS – APRIL 30, 2026’. Below it, three menu items also in the same handwritten chalk style: ‘Truffle Mushroom Risotto – $24’, ‘Grilled Octopus with Lemon & Herbs – $28’, ‘Brown Butter Chocolate Chip Cookies – $9’. At the very bottom, smaller text in the identical handwritten chalk style (slightly smaller but still clearly legible with the same handwriting characteristics): ‘All items made fresh daily • Ask about our gluten-free options’. Warm ambient café lighting, visible chalk dust and smudges, realistic handwriting imperfections, no clean printed text anywhere.”
AI Judge Analysis
FLUX.2 [dev] Flash
- + Excellent chalk texture with realistic smudges and dust on the board.
- + Strong adherence to the 'elegant cursive' request for the title.
- + The text looks genuinely handwritten with natural variation in stroke weight.
- − Some minor messy overlapping of letters in the bottom section.
- − The last price line is slightly broken up by the bottom edge layout.
- − The cursive can be slightly harder to read in certain places.
GPT Image 1 Mini
- + Perfect legibility of all requested text.
- + Great layout and alignment of prices.
- + Clean, professional presentation of the menu items.
- − The text looks more like a digital font or chalk marker rather than traditional 'handwritten' chalk.
- − The title lacks the requested 'elegant cursive' style.
- − The chalk texture is too uniform and lacks realistic dust variations.
Verdict: FLUX.2 [dev] Flash captures the requested 'handwritten chalk' aesthetic much more effectively, featuring realistic chalk dust, smudges, and diverse cursive lettering. While GPT Image 1 Mini provides perfect legibility, its output feels too much like a digital chalkboard font and fails to follow the specific 'elegant cursive' instruction for the title.
The Capybara Taxi Driver
Text-to-Image“Photorealistic scene inside a yellow New York taxi at night. A capybara is driving, wearing a yellow taxi driver cap and a dark jacket. It has a calm, professional expression and both front paws on the steering wheel. In the back seat sits a human businesswoman in a coat, looking at her phone with a completely normal, bored expression (as if this is just another normal ride). Through the windows you can see the streets of Manhattan at night with blurred lights. Realistic taxi interior, photorealistic, detailed fur and fabric, 35mm lens, night lighting with reflections, shallow depth of field.”
AI Judge Analysis
FLUX.2 [dev] Flash
- + Excellent photorealism with sharp textures on the capybara's fur and the woman's clothing.
- + Captures the bored, mundane expression of the businesswoman perfectly.
- + Cleverly positioned as if the camera is outside looking through the windshield with accurate taxi branding.
- − The passenger is sitting in the front passenger seat rather than the back seat as requested.
GPT Image 1 Mini
- + Correctly positions the passenger in the back seat.
- + Accurate lighting and atmospheric 'night street' bokeh through the window.
- − The capybara only has one paw visible on the steering wheel.
- − The passenger's face is slightly out of focus and less detailed compared to the driver.
Verdict: Both models captured the surrealism of the prompt very well. FLUX.2 [dev] Flash had superior overall clarity and better followed the character expression details, but it failed the spatial instruction of placing the passenger in the back seat. GPT Image 1 Mini adhered more closely to the physical layout and seating arrangement requested, though its textures are slightly softer.
Adorable Baby Animals in Sunny Meadow
Text-to-Image“Hyper-photorealistic scene of fluffy baby animals—a golden retriever puppy, tabby kitten, baby bunny, and red fox kit—with big expressive eyes and ultra-detailed soft fur, playfully chasing butterflies and tumbling together in a lush wildflower meadow, warm golden sunrise light with god rays and dew sparkles, joyful wholesome vibe, 8K masterpiece.”
AI Judge Analysis
FLUX.2 [dev] Flash
- + Excellent adherence to the 'dew sparkles' prompt with visible, crystalline drops on the grass.
- + Very high detail in the fur textures and butterfly wing patterns.
- + Dynamic composition that feels like a 'tumble' as requested.
- − Included an extra animal (two rabbits/rodents) which was not in the prompt.
- − The lighting feels a bit more like a digital composite rather than a natural photograph.
GPT Image 1 Mini
- + Perfectly captured the requested animal count (one of each species).
- + Better sense of motion with animals mid-air, matching the 'chasing' and 'tumbling' description.
- + Warm, natural lighting that feels cohesive across all subjects.
- − Lower contrast in the fur details compared to the other model.
- − The butterflies are less detailed and fewer in number.
Verdict: Both models followed the prompt well, but GPT Image 1 Mini adhered more accurately to the specific list of animals provided, whereas FLUX.2 [dev] Flash added an extra creature. However, FLUX.2 [dev] Flash delivered superior fine details in the fur and environment, particularly with the dew sparkles and butterfly textures, making it more visually impressive.
FLUX.2 [dev] Flash
Fast distilled version of Black Forest Labs' FLUX.2 [dev] optimized for speed and cost efficiency.
GPT Image 1 Mini
OpenAI's cost-effective image generation model for when image quality isn't the top priority