Head to head
Esc

Models · slot A

to navigate to pick

FLUX.2 [klein] 9B Black Forest Labs Grok Imagine Image xAI

Settled by community votes across 4 shared challenges, with an AI judge weighing in on each.

FLUX.2 [klein] 9B

21.3 arena score

#7 of 23 in Image Editing

Skill signature

Not enough comparable category data

The chart appears once both models have ratings across at least three shared arena categories.

Grok Imagine Image

24.1 arena score

#19 of 44 in Text-to-Image

Vote tally

Where the votes landed

FLUX.2 [klein] 9B

50.0%

win rate

Ties

0.0%

Grok Imagine Image

50.0%

win rate

50.0% 0.0% ties 50.0%
Shared challenges 4

Challenge by challenge

The strongest take from each model on every shared challenge, with the AI judge's read.

Magic Burger Explosion: Fiery Photorealism Challenge

Text-to-Image

“Ad for 'Magic Burger'. Dynamic, exploded burger with all components (bun, patty, cheese, lettuce, tomato, sauce) suspended in mid-air. Emphasize photorealistic detail and a sense of motion. Dark, fiery background with glowing embers. Integrate text: 'MAGIC BURGER' as a prominent title, 'LIMITED TIME ONLY' as a secondary message, and '€6.99' in a starburst, all rendered with a fiery, glowing effect.”

FLUX.2 [klein] 9B
Grok Imagine Image

AI Judge Analysis

FLUX.2 [klein] 9B

  • + Excellent typography with a glowing fire texture inside the letters.
  • + Superior photorealistic textures on the beef patties and bun.
  • + Balanced composition that feels like a professional advertising poster.
  • The 'exploded' effect is more of a floating stack than a truly deconstructed burger.
  • Does not include tomato slices as requested in the prompt.

Grok Imagine Image

  • + Better adherence to the 'exploded' request with components widely separated.
  • + Distinct and vibrant fire and smoke effects on the background and text.
  • + Includes all requested ingredients including sliced tomatoes.
  • The pricing starburst has an inconsistent, flat graphic style compared to the realistic burger.
  • Lower quality text rendering on the €6.99 price compared to the rest of the image.

Verdict: FLUX.2 [klein] produces a much more polished and photorealistic advertisement with high-end typography, whereas Grok Imagine better captures the 'exploded' motion requested in the prompt by deconstructing every individual layer. FLUX.2 is the overall winner for its professional finish and higher texture quality, despite missing the tomatoes.

Chalkboard Menu

Text-to-Image

“Handwritten-style chalkboard menu in a cozy café, all text rendered in the exact same realistic chalk handwriting style with natural variations in letter size, slight slant, and chalk texture — no printed or digital fonts anywhere on the board. Title at the top in elegant cursive chalk handwriting: ‘TODAY’S SPECIALS – APRIL 30, 2026’. Below it, three menu items also in the same handwritten chalk style: ‘Truffle Mushroom Risotto – $24’, ‘Grilled Octopus with Lemon & Herbs – $28’, ‘Brown Butter Chocolate Chip Cookies – $9’. At the very bottom, smaller text in the identical handwritten chalk style (slightly smaller but still clearly legible with the same handwriting characteristics): ‘All items made fresh daily • Ask about our gluten-free options’. Warm ambient café lighting, visible chalk dust and smudges, realistic handwriting imperfections, no clean printed text anywhere.”

FLUX.2 [klein] 9B
Grok Imagine Image
0% wins 0% ties 100% wins

AI Judge Analysis

FLUX.2 [klein] 9B

  • + Excellent authentic chalk texture with dusting and eraser marks
  • + Accurately completed the partial prompt for the cookie item
  • + Includes a realistic wooden frame and cafe background
  • Spelling error in 'Mushroom' (spelled 'Mushrropom')
  • The handwriting style is a bit inconsistent between the large and small text

Grok Imagine Image

  • + Perfect spelling on all menu items including 'Mushroom'
  • + Highly consistent and realistic handwriting style throughout the entire board
  • + Excellent lighting and atmosphere fitting a 'cozy café'
  • The chalk texture is slightly cleaner/more digital compared to Model A
  • The '2026' text has a slight alignment wobble

Verdict: Grok Imagine Image is the winner because it successfully spelled all menu items correctly while maintaining a highly consistent and attractive handwriting style. While FLUX.2 [klein] 9B had a slightly more realistic 'smudged' chalk texture, the significant spelling error in 'Mushrropom' makes it less usable for a high-quality prompt adherence task.

Pose & Character Mashup

Editing
Edit instruction

“Use Image 1 as the exact pose reference and Image 2 as the character reference. Recreate the person/character from Image 2 in the exact dynamic pose and body position from Image 1. Keep the exact face, hair, clothing style/details, and expression from Image 2. Match the lighting and environment of Image 1. The final image must show the character from Image 2 performing the precise action/pose from Image 1 with perfect anatomy and natural integration.”

Source
FLUX.2 [klein] 9B
Grok Imagine Image
100% wins 0% ties 0% wins

AI judge analysis unavailable for this challenge.

Outfit Transfer Challenge

Editing
Edit instruction

“Use Image 1 as the base person. Dress them in the exact elaborate outfit from Image 2 (including all layers, accessories, jewelry, and shoes). Carefully adapt the clothing to the body shape and pose in Image 1 while maintaining realistic fabric behavior, correct proportions, and perfect lighting/shadow matching. Keep the person’s exact face, hair, and background completely unchanged.”

Source
FLUX.2 [klein] 9B
Grok Imagine Image

AI Judge Analysis

FLUX.2 [klein] 9B

  • + Perfectly replicates the specific clothing items (coat, scarf, jeans) from Image 2.
  • + Maintains the person's exact face, hair, and vitiligo patterns with high accuracy.
  • + Successfully adapts the pose to fit the hands in the coat pockets.
  • Added extraneous heavy gold beaded necklaces that were not present in Image 2.

Grok Imagine Image

  • + Maintains the subject's face and background reasonably well.
  • + High level of detail on the fabric textures.
  • Completely ignored the clothing in Image 2, substituting it with a generic royal costume.
  • Modified the lighting and skin tone of the subject significantly.

Verdict: FLUX.2 [klein] 9B followed the prompt instructions much more accurately, successfully transferring the specific pea coat, plaid scarf, and denim from Image 2 onto the subject while preserving his likeness. Grok Imagine Image failed the core task by generating an entirely different 'elaborate' outfit that bore no resemblance to the provided source image, although it did maintain the background elements well.

Next steps

Explore each model