FLUX.1 [schnell] FP8 vs GPT Image 2

Head-to-head across 3 challenges

FLUX.1 [schnell] FP8

0.0%

win rate

Ties

0.0%

GPT Image 2

100.0%

win rate

0.0% 0.0% ties 100.0%

Challenge Results

Chalkboard Menu

Text-to-Image

“Handwritten-style chalkboard menu in a cozy café, all text rendered in the exact same realistic chalk handwriting style with natural variations in letter size, slight slant, and chalk texture — no printed or digital fonts anywhere on the board. Title at the top in elegant cursive chalk handwriting: ‘TODAY’S SPECIALS – APRIL 30, 2026’. Below it, three menu items also in the same handwritten chalk style: ‘Truffle Mushroom Risotto – $24’, ‘Grilled Octopus with Lemon & Herbs – $28’, ‘Brown Butter Chocolate Chip Cookies – $9’. At the very bottom, smaller text in the identical handwritten chalk style (slightly smaller but still clearly legible with the same handwriting characteristics): ‘All items made fresh daily • Ask about our gluten-free options’. Warm ambient café lighting, visible chalk dust and smudges, realistic handwriting imperfections, no clean printed text anywhere.”

FLUX.1 [schnell] FP8

GPT Image 2

AI Judge Analysis

FLUX.1 [schnell] FP8

+ Very high image clarity and sharpness.
+ Realistic lighting and wood texture on the frame.

− Severely failed text rendering with repeated words and nonsensical phrases.
− Incorrect prices and layout compared to the prompt.
− Failed to render cursive for the title as requested.

GPT Image 2

+ Perfect text rendering of all requested items and dates.
+ Excellent adherence to the 'chalk texture' and 'cursive title' instructions.
+ Consistent and realistic handwriting style throughout the board.

− Slightly lower overall resolution/sharpness compared to the other model.
− The lighting is a bit dim in the lower corners.

Verdict: GPT Image 2 is the clear winner as it followed every text and styling instruction perfectly, including complex menu items and specific dates. FLUX.1 [schnell] FP8 struggled significantly with the text, producing repetitive gibberish and failing to follow the price or cursive requirements.

The Reversed Rodeo

Text-to-Image

“Horse riding astronaut in space — horse on top, not vice versa. Surreal, highly detailed, cinematic.”

FLUX.1 [schnell] FP8

GPT Image 2

0% wins 0% ties 100% wins

AI Judge Analysis

FLUX.1 [schnell] FP8

+ Excellent cinematic lighting and color palette.
+ High visual clarity and artistic composition with the Earth background.

− Fails the prompt requirement of a horse riding an astronaut.
− Anatomical horror with a horse head emerging from another horse's back.

GPT Image 2

+ Perfect adherence to the complex prompt instruction of the horse on top.
+ Incredibly high detail on the space suit, NASA logo, and lunar surface.
+ Accurate reflection in the astronaut's visor.

− Minor leather strap clipping through the horse's front leg.

Verdict: GPT Image 2 followed the specific and difficult 'horse on top' instruction perfectly, showing a horse literally riding an astronaut on all fours. FLUX.1 [schnell] FP8 failed the core concept of the prompt, instead generating a surreal two-headed horse that ignores the 'riding astronaut' requirement entirely.

The Halloween Invitation

Text-to-Image

“Vintage gothic Halloween party invitation. Dark parchment poster, spooky border with webs and thorns, central glowing jack-o-lantern, bats, twisted trees, moody night sky. Add elegant gothic title text saying "Halloween Party Invitation", a small scroll banner saying "You are invited to a night of frights", and event details at the bottom: Date: 30.10.2026 Time: 7pm Location: The Arches, NYC Spooky but polished, cinematic lighting, square format.”

FLUX.1 [schnell] FP8

GPT Image 2

AI Judge Analysis

FLUX.1 [schnell] FP8

+ Strong cinematic lighting with a vibrant center glow
+ Clean layout for the main title text

− Numerous spelling errors in the banner and specific event details
− The border feels more abstract and lacks the requested web/thorn detail
− Redundant and confusing time information provided

GPT Image 2

+ Perfect text rendering for all requested titles and event details
+ Intricate gothic border involving webs, thorns, and skulls
+ Highly detailed background containing the bridges (arches) and NYC skyline

− The parchment texture is very busy, slightly obscuring smaller details
− The pumpkin light is a bit flatter compared to the glow effect in Model A

Verdict: GPT Image 2 is the clear winner for its superior text accuracy, following every specific instruction for dates and locations without spelling errors. It also captures the 'Vintage Gothic' aesthetic much better with its detailed thorns, webs, and thematic background, whereas FLUX.1 [schnell] FP8 struggled significantly with the spelling and the complexity of the border.

FLUX.1 [schnell] FP8

FP8 quantized variant of Black Forest Labs' FLUX.1 [schnell] model, offering ~2x faster inference with reduced precision while maintaining high-quality image generation in 4 steps

View Model Arena

GPT Image 2

OpenAI's state-of-the-art image generation model with arbitrary resolution up to 4K and strong instruction following

View Model Arena