FLUX.2 [dev] vs GPT Image 1 Mini

Head-to-head across 4 challenges

FLUX.2 [dev]

66.7%

win rate

Ties

0.0%

GPT Image 1 Mini

33.3%

win rate

66.7% 0.0% ties 33.3%

Challenge Results

Geometric Composition

Text-to-Image

“A glass cube on a wooden table. Inside the cube is a small blue sphere. On top of the cube sits a red book. A green plant is behind the cube, partially visible through the glass. Soft window light from the left.”

FLUX.2 [dev]

GPT Image 1 Mini

50% wins 0% ties 50% wins

AI Judge Analysis

FLUX.2 [dev]

+ Excellent rendering of glass physics and refractions
+ The blue sphere has a realistic glass marble texture that fits the scene
+ Accurate lighting consistency between the window and reflections on the sphere

− The sphere appears to be floating unnaturally instead of resting on the bottom

GPT Image 1 Mini

+ Perfect adherence to all spatial instructions
+ The book has a very realistic paper texture on the side
+ Clean, minimalist composition

− The blue sphere has a matte, opaque texture that looks like foam or plastic rather than fitting the glass aesthetic
− The sphere is floating in the center of the cube, which feels gravity-defying

Verdict: Both models followed the complex spatial prompt perfectly. FLUX.2 [dev] produces a more cohesive aesthetic with beautiful glass refractions and a marble-like sphere, while GPT Image 1 Mini provides a cleaner look but with a matte sphere that feels slightly disconnected from the glass environment. FLUX.2 [dev] is preferred for its superior handling of light and transparency.

Candid Street Photography

Text-to-Image

“A candid street photo of an elderly Japanese man repairing a red bicycle in light rain, reflections on wet pavement, shallow depth of field, 50mm lens, natural skin texture, imperfect framing, motion blur from passing cars, cinematic but realistic, no stylization.”

FLUX.2 [dev]

GPT Image 1 Mini

100% wins 0% ties 0% wins

AI Judge Analysis

FLUX.2 [dev]

+ Excellent adherence to the motion blur request with realistic passing vehicles.
+ Very high skin and clothing detail that captures the 'natural texture' prompt perfectly.
+ The environment feels more authentically like a busy Japanese street.

− Anatomy and logistics are slightly messy where his hands meet the bike frame.
− The bike's physical structure is a bit nonsensical near the handlebars.

GPT Image 1 Mini

+ Clean, professional composition with a clear focus on the subject.
+ The bicycle structure is more coherent and traditional.
+ The lighting on the wet pavement is soft and atmospheric.

− Failed to include the requested motion blur from passing cars.
− The image looks slightly more like a staged portrait than a 'candid street photo'.
− Missing the 'imperfect framing' requested in the prompt.

Verdict: FLUX.2 [dev] followed the technical requirements of the prompt much better, specifically capturing the motion blur of passing cars and the 'imperfect framing' of a candid shot. GPT Image 1 Mini produced a beautiful, clean image, but it ignored several key prompt instructions regarding motion and framing, leading to a more static and staged appearance. FLUX.2 [dev] is the winner for its superior realism and adherence to the specific atmospheric cues requested.

Fantasy Warrior

Text-to-Image

“Close portrait of a battle-worn paladin in ornate engraved plate armor, hair braided with small beads, faint scars and dirt on the skin, warm torchlight reflecting off metal, shallow depth of field, bokeh sparks, lifelike eyes, highly detailed texture on leather straps and cloth underlayer.”

FLUX.2 [dev]

GPT Image 1 Mini

AI Judge Analysis

FLUX.2 [dev]

+ Excellent adherence to the 'beads in hair' prompt element.
+ Superior textural detail on the leather straps and metal engravings.
+ Dynamic lighting with visible torch sources and realistic skin battle-damage.

− The facial scars look a bit like fresh face paint or ink rather than healed tissue.
− The bokeh sparks are a bit large and distracting.

GPT Image 1 Mini

+ Very realistic, battle-worn skin texture and subtle grime.
+ Sophisticated, muted color palette and realistic lighting integration.
+ Excellent armor engraving detail that feels historical.

− Failed to include the specific 'small beads' in the hair.
− The bokeh sparks are very subtle and almost look like noise in some areas.
− Missing the 'leather straps and cloth underlayer' mentioned in the prompt.

Verdict: FLUX.2 [dev] followed the prompt more closely, specifically including the beads in the hair and the leather straps which GPT Image 1 Mini omitted. While GPT Image 1 Mini offered a very gritty and realistic skin texture, FLUX.2 [dev] provided the comprehensive detail and adherence requested by the prompt.

Adorable Baby Animals in Sunny Meadow

Text-to-Image

“Hyper-photorealistic scene of fluffy baby animals—a golden retriever puppy, tabby kitten, baby bunny, and red fox kit—with big expressive eyes and ultra-detailed soft fur, playfully chasing butterflies and tumbling together in a lush wildflower meadow, warm golden sunrise light with god rays and dew sparkles, joyful wholesome vibe, 8K masterpiece.”

FLUX.2 [dev]

GPT Image 1 Mini

AI Judge Analysis

FLUX.2 [dev]

+ Excellent fur detail and individual hair rendering
+ Superior lighting with realistic rim light and soft 'god rays'
+ Highly expressive and realistic animal anatomy

− Includes two bunnies instead of the requested 'a baby bunny'
− Animals are relatively static/posed rather than 'playfully chasing'

GPT Image 1 Mini

+ Matches the 'playfully chasing' and 'tumbling' action better than the other model
+ Follows the exact count of one animal per species requested
+ Vibrant colors and cheerful mood

− The fox's front paws are strangely formatted and lack distinct toes/claws
− The rabbit's eye appears a bit flat and less realistic
− Slightly more 'digital' feel compared to the photographic texture of Model A

Verdict: FLUX.2 [dev] produces a significantly more high-fidelity, photorealistic image with beautiful lighting and fur textures, though it fails on the exact count by adding a second rabbit. GPT Image 1 Mini captures the requested movement and action much better, showing the animals in mid-air, but it suffers from anatomical artifacts in the fox's paws and a less sophisticated lighting engine. FLUX.2 [dev] is the winner for its professional photographic quality and stunning detail.

FLUX.2 [dev]

Black Forest Labs' open-weights image generation model with frontier performance, available for non-commercial local deployment

View Model Arena

GPT Image 1 Mini

OpenAI's cost-effective image generation model for when image quality isn't the top priority

View Model Arena