Head to head
Esc

Models · slot A

to navigate to pick

FLUX.2 [dev] Black Forest Labs Grok Imagine Image Pro xAI

Settled by community votes across 4 shared challenges, with an AI judge weighing in on each.

FLUX.2 [dev]

24.5 arena score

#17 of 44 in Text-to-Image

Skill signature

Not enough comparable category data

The chart appears once both models have ratings across at least three shared arena categories.

Grok Imagine Image Pro

24.8 arena score

#14 of 44 in Text-to-Image

Vote tally

Where the votes landed

FLUX.2 [dev]

100.0%

win rate

Ties

0.0%

Grok Imagine Image Pro

0.0%

win rate

100.0% 0.0% ties 0.0%
Shared challenges 4

Challenge by challenge

The strongest take from each model on every shared challenge, with the AI judge's read.

Geometric Composition

Text-to-Image

“A glass cube on a wooden table. Inside the cube is a small blue sphere. On top of the cube sits a red book. A green plant is behind the cube, partially visible through the glass. Soft window light from the left.”

FLUX.2 [dev]
Grok Imagine Image Pro

AI Judge Analysis

FLUX.2 [dev]

  • + Excellent rendering of the glass cube with realistic thickness and edge reflections.
  • + The blue sphere has a beautiful translucent, marble-like quality.
  • + The soft window light is masterfully handled, creating a gentle bloom from the left.
  • The blue sphere appears to be floating unnaturally rather than resting on the bottom of the cube.
  • The plant in the background is quite blurry, making it less distinct than requested.

Grok Imagine Image Pro

  • + Perfect adherence to the plant placement, with leaves clearly visible through the glass.
  • + The wooden table has a rich, weathered texture that feels very realistic.
  • + The sphere is correctly grounded on the base of the cube.
  • The glass cube has strange, wavy optical distortions that don't match the straight exterior edges.
  • The reflection of the sphere on the right side of the glass looks like a second physical object rather than a reflection.

Verdict: Both models followed the complex spatial instructions perfectly. FLUX.2 [dev] produced a more aesthetically pleasing image with superior lighting and glass materials, though the sphere seems to float. Grok Imagine Image Pro handled the background plant and the physical placement of the sphere better, but suffered from odd reflections and internal glass distortions that detract from the realism.

Candid Street Photography

Text-to-Image

“A candid street photo of an elderly Japanese man repairing a red bicycle in light rain, reflections on wet pavement, shallow depth of field, 50mm lens, natural skin texture, imperfect framing, motion blur from passing cars, cinematic but realistic, no stylization.”

FLUX.2 [dev]
Grok Imagine Image Pro

AI Judge Analysis

FLUX.2 [dev]

  • + Excellent execution of motion blur from passing cars
  • + Realistic skin texture and candid feeling
  • + Accurate depiction of a wet pavement with high-contrast reflections
  • Physical logic of the bicycle frame is distorted/glitched in the center
  • The man's lower body is merged awkwardly with the bike

Grok Imagine Image Pro

  • + Clearer, more coherent bicycle anatomy and tool usage
  • + Strong composition with a good sense of depth
  • + Very natural skin and clothing textures
  • Fails to include the requested motion blur for passing cars
  • The rain is barely visible compared to the prompt's intent

Verdict: FLUX.2 [dev] followed the technical details of the prompt much better, specifically capturing the dynamic motion blur and the gritty, candid atmosphere of a rainy street. However, Grok Imagine Image Pro produced a much more physically coherent image with far fewer anatomical glitches, though it ignored the specific request for motion blur.

Fantasy Warrior

Text-to-Image

“Close portrait of a battle-worn paladin in ornate engraved plate armor, hair braided with small beads, faint scars and dirt on the skin, warm torchlight reflecting off metal, shallow depth of field, bokeh sparks, lifelike eyes, highly detailed texture on leather straps and cloth underlayer.”

FLUX.2 [dev]
Grok Imagine Image Pro
100% wins 0% ties 0% wins

AI Judge Analysis

FLUX.2 [dev]

  • + Natural skin texture and believable scar rendering
  • + Excellent implementation of warm torchlight reflections
  • + Lifelike, expressive eyes with a candid portrait feel
  • The leather straps look slightly flat compared to the metalwork
  • The braid/bead arrangement is a bit chaotic and asymmetric

Grok Imagine Image Pro

  • + Exceptional detail on the engraved armor including legible Latin text
  • + High-contrast lighting creates a dramatic atmosphere
  • + Perfectly realized beads in the hair according to the prompt
  • Skin texture appears slightly more 'digital' or smoothed than Model A
  • The bokeh sparks feel a bit like a flat overlay rather than integrated depth

Verdict: Both models followed the prompt exceptionally well, but Model B (Grok Imagine Image Pro) stands out due to the incredible intricacy of the armor engravings and the addition of legible thematic text. While FLUX.1 [dev] produced a more natural and lifelike facial portrait, Grok’s technical execution of the specific material textures like the leather and metal makes it the stronger 'battle-worn paladin' image.

Adorable Baby Animals in Sunny Meadow

Text-to-Image

“Hyper-photorealistic scene of fluffy baby animals—a golden retriever puppy, tabby kitten, baby bunny, and red fox kit—with big expressive eyes and ultra-detailed soft fur, playfully chasing butterflies and tumbling together in a lush wildflower meadow, warm golden sunrise light with god rays and dew sparkles, joyful wholesome vibe, 8K masterpiece.”

FLUX.2 [dev]
Grok Imagine Image Pro

AI Judge Analysis

FLUX.2 [dev]

  • + Superior photorealism with realistic fur textures and lighting interactions.
  • + Excellent adherence to the 'god rays' and 'dew sparkles' prompt requirements.
  • + High-quality composition with a focused, cinematic field of view.
  • Included two bunnies instead of one.
  • The animals are relatively static rather than 'tumbling together' as requested.

Grok Imagine Image Pro

  • + Better captures the action of 'playfully chasing' and 'tumbling together'.
  • + Highly vibrant colors and clear, expressive eyes for all animals.
  • + Good placement of butterflies throughout the frame.
  • Included two kittens instead of one.
  • The image has a more 'digital illustration' feel compared to the requested hyper-photorealism.
  • Lower depth of field makes the background look flatter than Image A.

Verdict: FLUX.2 [dev] produces a significantly more realistic and atmospheric image with beautiful lighting and textures, though it is a bit more static. Grok Imagine Image Pro does a better job of capturing the playful movement described in the prompt but fails to achieve the requested hyper-photorealism, appearing more like a clean digital composite. FLUX.2 is the winner for its superior visual quality and adherence to the cinematic lighting requirements.

Next steps

Explore each model