Black Forest Labs' open-weights multimodal flow transformer for in-context image generation and editing, available for non-commercial use with character consistency and style transfer capabilities
Settled by community votes across 1 shared challenge, with an AI judge weighing in on each.
FLUX.1 Kontext [dev]
#43 of 44 in Text-to-Image
Not enough comparable category data
The chart appears once both models have ratings across at least three shared arena categories.
Grok Imagine Image
#19 of 44 in Text-to-Image
Where the votes landed
FLUX.1 Kontext [dev]
0.0%
win rate
Ties
0.0%
Grok Imagine Image
100.0%
win rate
Challenge by challenge
The strongest take from each model on every shared challenge, with the AI judge's read.
Magic Burger Explosion: Fiery Photorealism Challenge
Text-to-Image“Ad for 'Magic Burger'. Dynamic, exploded burger with all components (bun, patty, cheese, lettuce, tomato, sauce) suspended in mid-air. Emphasize photorealistic detail and a sense of motion. Dark, fiery background with glowing embers. Integrate text: 'MAGIC BURGER' as a prominent title, 'LIMITED TIME ONLY' as a secondary message, and '€6.99' in a starburst, all rendered with a fiery, glowing effect.”
AI Judge Analysis
FLUX.1 Kontext [dev]
- + Excellent text legibility and clean graphic design
- + High-quality rendering of the charcoal and flame base
- − Completely failed the 'exploded burger' requirement; the burger is fully assembled
- − Spelling error in 'ONLY' (rendered as 'LNHLY')
Grok Imagine Image
- + Perfectly captured the 'exploded' effect with suspended components
- + All text is spelled correctly and features the requested fiery glow
- + Strong sense of motion with sauce splashes and flying ingredients
- − Composition is slightly cluttered with sauce droplets
- − The starburst design is a bit generic compared to the high-detail background
Verdict: Grok Imagine Image is the clear winner as it followed the complex 'exploded burger' instruction perfectly, whereas FLUX.1 Kontext [dev] generated a standard assembled burger. Grok also maintained perfect spelling across all text elements, while FLUX.1 had a typo in the secondary message.
Explore each model
An image generation model by xAI designed to generate highly aesthetic images from text descriptions.