Black Forest Labs' open-weights multimodal flow transformer for in-context image generation and editing, available for non-commercial use with character consistency and style transfer capabilities
Settled by community votes across 1 shared challenge, with an AI judge weighing in on each.
FLUX.1 Kontext [dev]
#43 of 44 in Text-to-Image
Not enough comparable category data
The chart appears once both models have ratings across at least three shared arena categories.
GPT Image 1.5
#7 of 44 in Text-to-Image
Where the votes landed
FLUX.1 Kontext [dev]
0.0%
win rate
Ties
0.0%
GPT Image 1.5
100.0%
win rate
Challenge by challenge
The strongest take from each model on every shared challenge, with the AI judge's read.
Magic Burger Explosion: Fiery Photorealism Challenge
Text-to-Image“Ad for 'Magic Burger'. Dynamic, exploded burger with all components (bun, patty, cheese, lettuce, tomato, sauce) suspended in mid-air. Emphasize photorealistic detail and a sense of motion. Dark, fiery background with glowing embers. Integrate text: 'MAGIC BURGER' as a prominent title, 'LIMITED TIME ONLY' as a secondary message, and '€6.99' in a starburst, all rendered with a fiery, glowing effect.”
AI Judge Analysis
FLUX.1 Kontext [dev]
- + Excellent typography rendering for 'MAGIC BURGER'.
- + Good use of the starburst element for the price.
- + High resolution and clean composition.
- − Failed the main request for an 'exploded' burger, showing a fully assembled one instead.
- − Spelling error in 'LIMITED TIME ONLY' (rendered as 'LNHLY').
- − The background coals look a bit generic and static compared to the dynamic request.
GPT Image 1.5
- + Perfect adherence to the 'exploded' burger request with dynamic suspended components.
- + High level of texture detail on the patty, bun, and vegetables.
- + Successfully applied the fiery, glowing effect to both the background and the text.
- − The pricing starburst is a bit jagged and overlaps the burger.
- − Some visual clutter from the excessive ember sparks near the top text.
Verdict: GPT Image 1.5 is the clear winner as it followed the complex 'exploded burger' instruction and maintained high realism across all food components. FLUX.1 Kontext [dev] failed the primary layout request by showing a standard assembled burger and included a spelling typo in the secondary text.
Explore each model
OpenAI's state-of-the-art image generation model with better instruction following and adherence to prompts