Black Forest Labs' open-weights image generation model with frontier performance, available for non-commercial local deployment
Settled by community votes across 4 shared challenges, with an AI judge weighing in on each.
FLUX.2 [dev]
#17 of 44 in Text-to-Image
Not enough comparable category data
The chart appears once both models have ratings across at least three shared arena categories.
Grok Imagine Image Pro
#14 of 44 in Text-to-Image
Where the votes landed
FLUX.2 [dev]
100.0%
win rate
Ties
0.0%
Grok Imagine Image Pro
0.0%
win rate
Challenge by challenge
The strongest take from each model on every shared challenge, with the AI judge's read.
Geometric Composition
Text-to-Image“A glass cube on a wooden table. Inside the cube is a small blue sphere. On top of the cube sits a red book. A green plant is behind the cube, partially visible through the glass. Soft window light from the left.”
AI Judge Analysis
FLUX.2 [dev]
- + Excellent rendering of the glass cube with realistic thickness and edge reflections.
- + The blue sphere has a beautiful translucent, marble-like quality.
- + The soft window light is masterfully handled, creating a gentle bloom from the left.
- − The blue sphere appears to be floating unnaturally rather than resting on the bottom of the cube.
- − The plant in the background is quite blurry, making it less distinct than requested.
Grok Imagine Image Pro
- + Perfect adherence to the plant placement, with leaves clearly visible through the glass.
- + The wooden table has a rich, weathered texture that feels very realistic.
- + The sphere is correctly grounded on the base of the cube.
- − The glass cube has strange, wavy optical distortions that don't match the straight exterior edges.
- − The reflection of the sphere on the right side of the glass looks like a second physical object rather than a reflection.
Verdict: Both models followed the complex spatial instructions perfectly. FLUX.2 [dev] produced a more aesthetically pleasing image with superior lighting and glass materials, though the sphere seems to float. Grok Imagine Image Pro handled the background plant and the physical placement of the sphere better, but suffered from odd reflections and internal glass distortions that detract from the realism.
Candid Street Photography
Text-to-Image“A candid street photo of an elderly Japanese man repairing a red bicycle in light rain, reflections on wet pavement, shallow depth of field, 50mm lens, natural skin texture, imperfect framing, motion blur from passing cars, cinematic but realistic, no stylization.”
AI Judge Analysis
FLUX.2 [dev]
- + Excellent execution of motion blur from passing cars
- + Realistic skin texture and candid feeling
- + Accurate depiction of a wet pavement with high-contrast reflections
- − Physical logic of the bicycle frame is distorted/glitched in the center
- − The man's lower body is merged awkwardly with the bike
Grok Imagine Image Pro
- + Clearer, more coherent bicycle anatomy and tool usage
- + Strong composition with a good sense of depth
- + Very natural skin and clothing textures
- − Fails to include the requested motion blur for passing cars
- − The rain is barely visible compared to the prompt's intent
Verdict: FLUX.2 [dev] followed the technical details of the prompt much better, specifically capturing the dynamic motion blur and the gritty, candid atmosphere of a rainy street. However, Grok Imagine Image Pro produced a much more physically coherent image with far fewer anatomical glitches, though it ignored the specific request for motion blur.
Fantasy Warrior
Text-to-Image“Close portrait of a battle-worn paladin in ornate engraved plate armor, hair braided with small beads, faint scars and dirt on the skin, warm torchlight reflecting off metal, shallow depth of field, bokeh sparks, lifelike eyes, highly detailed texture on leather straps and cloth underlayer.”
AI Judge Analysis
FLUX.2 [dev]
- + Natural skin texture and believable scar rendering
- + Excellent implementation of warm torchlight reflections
- + Lifelike, expressive eyes with a candid portrait feel
- − The leather straps look slightly flat compared to the metalwork
- − The braid/bead arrangement is a bit chaotic and asymmetric
Grok Imagine Image Pro
- + Exceptional detail on the engraved armor including legible Latin text
- + High-contrast lighting creates a dramatic atmosphere
- + Perfectly realized beads in the hair according to the prompt
- − Skin texture appears slightly more 'digital' or smoothed than Model A
- − The bokeh sparks feel a bit like a flat overlay rather than integrated depth
Verdict: Both models followed the prompt exceptionally well, but Model B (Grok Imagine Image Pro) stands out due to the incredible intricacy of the armor engravings and the addition of legible thematic text. While FLUX.1 [dev] produced a more natural and lifelike facial portrait, Grok’s technical execution of the specific material textures like the leather and metal makes it the stronger 'battle-worn paladin' image.
Adorable Baby Animals in Sunny Meadow
Text-to-Image“Hyper-photorealistic scene of fluffy baby animals—a golden retriever puppy, tabby kitten, baby bunny, and red fox kit—with big expressive eyes and ultra-detailed soft fur, playfully chasing butterflies and tumbling together in a lush wildflower meadow, warm golden sunrise light with god rays and dew sparkles, joyful wholesome vibe, 8K masterpiece.”
AI Judge Analysis
FLUX.2 [dev]
- + Superior photorealism with realistic fur textures and lighting interactions.
- + Excellent adherence to the 'god rays' and 'dew sparkles' prompt requirements.
- + High-quality composition with a focused, cinematic field of view.
- − Included two bunnies instead of one.
- − The animals are relatively static rather than 'tumbling together' as requested.
Grok Imagine Image Pro
- + Better captures the action of 'playfully chasing' and 'tumbling together'.
- + Highly vibrant colors and clear, expressive eyes for all animals.
- + Good placement of butterflies throughout the frame.
- − Included two kittens instead of one.
- − The image has a more 'digital illustration' feel compared to the requested hyper-photorealism.
- − Lower depth of field makes the background look flatter than Image A.
Verdict: FLUX.2 [dev] produces a significantly more realistic and atmospheric image with beautiful lighting and textures, though it is a bit more static. Grok Imagine Image Pro does a better job of capturing the playful movement described in the prompt but fails to achieve the requested hyper-photorealism, appearing more like a clean digital composite. FLUX.2 is the winner for its superior visual quality and adherence to the cinematic lighting requirements.
Explore each model
xAI's premium image generation model offering higher fidelity output and stronger performance on single-image editing benchmarks compared to the standard Grok Imagine model