Head to head
Esc

Models · slot A

to navigate to pick

FLUX.2 [dev] Turbo fal Grok Imagine Image Pro xAI

Settled by community votes across 12 shared challenges, with an AI judge weighing in on each.

FLUX.2 [dev] Turbo

27.4 arena score

#4 of 44 in Text-to-Image

Skill signature

Not enough comparable category data

The chart appears once both models have ratings across at least three shared arena categories.

Grok Imagine Image Pro

24.8 arena score

#14 of 44 in Text-to-Image

Vote tally

Where the votes landed

FLUX.2 [dev] Turbo

75.0%

win rate

Ties

0.0%

Grok Imagine Image Pro

25.0%

win rate

75.0% 0.0% ties 25.0%
Shared challenges 12

Challenge by challenge

The strongest take from each model on every shared challenge, with the AI judge's read.

Geometric Composition

Text-to-Image

“A glass cube on a wooden table. Inside the cube is a small blue sphere. On top of the cube sits a red book. A green plant is behind the cube, partially visible through the glass. Soft window light from the left.”

FLUX.2 [dev] Turbo
Grok Imagine Image Pro

AI Judge Analysis

FLUX.2 [dev] Turbo

  • + Perfect adherence to all spatial instructions
  • + Excellent photographic realism with natural glass textures and dust
  • + Sophisticated lighting and reflections
  • The plant appears to be both behind and inside the glass simultaneously due to visual overlap

Grok Imagine Image Pro

  • + Clean, minimalist aesthetic
  • + Vibrant colors on the sphere and book
  • The sphere is duplicated or has a nonsensical solid reflection on the right
  • The plant is mostly above/behind rather than 'partially visible through the glass'
  • The glass walls have inconsistent thickness and wavy distortion

Verdict: FLUX.2 [dev] Turbo followed the prompt perfectly, capturing the complex physics of a sphere inside a glass box with a book on top and a plant behind. Grok Imagine Image Pro struggled with the internal contents of the cube, creating a strange duplicate of the blue sphere and failing to show the plant through the glass as requested.

Candid Street Photography

Text-to-Image

“A candid street photo of an elderly Japanese man repairing a red bicycle in light rain, reflections on wet pavement, shallow depth of field, 50mm lens, natural skin texture, imperfect framing, motion blur from passing cars, cinematic but realistic, no stylization.”

FLUX.2 [dev] Turbo
Grok Imagine Image Pro

AI Judge Analysis

FLUX.2 [dev] Turbo

  • + Excellent adherence to the 'imperfect framing' prompt with a tight, gritty composition.
  • + Highly realistic skin textures and age spots on the man's hands and face.
  • + Very detailed mechanical components and tools scattered on the wet pavement.
  • The front wheel of the bicycle is clipping significantly through the pavement.
  • The rain effect is less visible compared to the reflections.

Grok Imagine Image Pro

  • + Clearer depiction of light rain falling through the air.
  • + Stronger 'motion blur' on the passing cars as requested in the prompt.
  • + Clean composition with a nice balance between the subject and the street depth.
  • The wrench in the man's hand is poorly rendered and melting into the bicycle frame.
  • The man's hands look slightly too smooth for an 'elderly' man with 'natural skin texture'.
  • Missing the 'imperfect framing' requested, appearing more like a staged professional shot.

Verdict: FLUX.2 [dev] Turbo captures the requested 'candid' and 'imperfect' aesthetic much better, providing grit and hyper-realistic skin textures that match the elderly description perfectly. While Grok Imagine Image Pro handles the motion blur of the cars and the rain particles better, it suffers from significant AI artifacts in the hands and tools, making FLUX.2 the more believable image despite the clipping wheel.

Fantasy Warrior

Text-to-Image

“Close portrait of a battle-worn paladin in ornate engraved plate armor, hair braided with small beads, faint scars and dirt on the skin, warm torchlight reflecting off metal, shallow depth of field, bokeh sparks, lifelike eyes, highly detailed texture on leather straps and cloth underlayer.”

FLUX.2 [dev] Turbo
Grok Imagine Image Pro
100% wins 0% ties 0% wins

AI Judge Analysis

FLUX.2 [dev] Turbo

  • + Excellent realism in the facial features and skin texture, including subtle pores and believable scarring.
  • + Superior rendering of hair physics and integration of the small colorful beads.
  • + High level of detail on the leather straps and metal engravings with naturalistic lighting.
  • The torch in the background has a slightly soft, digital look compared to the foreground.
  • The bokeh sparks are a bit uniform in some areas.

Grok Imagine Image Pro

  • + Impressive text rendering on the gorget ('Lux in tenebris') which adds to the paladin theme.
  • + Strong, cinematic lighting with high contrast and vibrant orange highlights.
  • + Very intricate armor engraving and creative use of bone or stone beads in the hair.
  • The facial features and skin look slightly more 'digitally painted' or smoothed compared to the realism of Model A.
  • The hair braids appear a bit stiff and symmetrical, lacking natural variation.
  • Some of the bokeh sparks look like flat digital overlays.

Verdict: Both models followed the prompt exceptionally well, but FLUX.2 [dev] Turbo takes the lead due to its superior photographic realism, particularly in the skin textures and the natural fall of the hair. While Grok Imagine Image Pro included impressive thematic touches like the Latin text on the armor, its overall look is slightly more stylized and less lifelike than the FLUX image.

Modern Clean Menu

Text-to-Image

“Modern minimalist restaurant menu design, white background with colorful food photos in grid, sections for appetizers/pizza/mains, bold sans-serif fonts, vibrant accents, clean professional layout for casual dining.”

FLUX.2 [dev] Turbo
Grok Imagine Image Pro

AI Judge Analysis

FLUX.2 [dev] Turbo

  • + Excellent layout that resembles a functional, professional menu with prices and descriptions.
  • + Strong typography with bold sans-serif headers and legible body text.
  • + Vibrant colorful accents at the bottom add a playful, modern feel.
  • The photo grid is a bit cluttered and the images are largely variations of the same pizza.
  • Some text elements like the secondary header are garbled.

Grok Imagine Image Pro

  • + Perfectly clean grid layout that emphasizes the minimalist aesthetic.
  • + High-quality, distinct food photography for each category.
  • + Great adherence to the request for specific categories (appetizers, pizza, mains).
  • Lacks item names, descriptions, and prices, making it look more like a photo collage than a menu design.
  • The composition is a bit too repetitive with three identical columns.

Verdict: FLUX.2 [dev] Turbo produces a much more realistic menu design that includes professional typographic hierarchy, prices, and a more complex layout. Grok Imagine Image Pro creates a very clean, minimalist grid of beautiful food photos, but it fails to include the functional elements of a menu design like item names and descriptions, making it feel less like a finished design product.

Chalkboard Menu

Text-to-Image

“Handwritten-style chalkboard menu in a cozy café, all text rendered in the exact same realistic chalk handwriting style with natural variations in letter size, slight slant, and chalk texture — no printed or digital fonts anywhere on the board. Title at the top in elegant cursive chalk handwriting: ‘TODAY’S SPECIALS – APRIL 30, 2026’. Below it, three menu items also in the same handwritten chalk style: ‘Truffle Mushroom Risotto – $24’, ‘Grilled Octopus with Lemon & Herbs – $28’, ‘Brown Butter Chocolate Chip Cookies – $9’. At the very bottom, smaller text in the identical handwritten chalk style (slightly smaller but still clearly legible with the same handwriting characteristics): ‘All items made fresh daily • Ask about our gluten-free options’. Warm ambient café lighting, visible chalk dust and smudges, realistic handwriting imperfections, no clean printed text anywhere.”

FLUX.2 [dev] Turbo
Grok Imagine Image Pro

AI Judge Analysis

FLUX.2 [dev] Turbo

  • + Features a very authentic chalk texture with realistic smudges and dust.
  • + Excellent composition that includes a café background for context.
  • + Captures natural variations in handwriting slant and pressure.
  • The pricing for the Truffle Mushroom Risotto is redundant and slightly messy.
  • Technical spacing issues between words in the lower section.

Grok Imagine Image Pro

  • + Exceptional text clarity and perfect adherence to the dictated menu items.
  • + Consistent and elegant cursive script that remains highly legible.
  • + Clean layout that fills the board space effectively.
  • The chalk texture is a bit too uniform, appearing slightly digital in some strokes.
  • Lacks the atmospheric background depth found in the competitor.

Verdict: Both models followed the complex text instructions perfectly, including the specific date and pricing. FLUX.2 [dev] Turbo offers a more realistic and atmospheric scene with authentic chalk smudging, whereas Grok Imagine Image Pro provides superior legibility and a cleaner, more professional-looking menu layout. Grok Imagine Image Pro is a slight winner for its flawless transcription of the multi-line prompt.

Pose & Character Mashup

Editing
Edit instruction

“Use Image 1 as the exact pose reference and Image 2 as the character reference. Recreate the person/character from Image 2 in the exact dynamic pose and body position from Image 1. Keep the exact face, hair, clothing style/details, and expression from Image 2. Match the lighting and environment of Image 1. The final image must show the character from Image 2 performing the precise action/pose from Image 1 with perfect anatomy and natural integration.”

Source
FLUX.2 [dev] Turbo
Grok Imagine Image Pro

AI Judge Analysis

FLUX.2 [dev] Turbo

  • + Excellent character transference including the sweater details, scarf, and facial features.
  • + Accurately replicates the lighting and background environment of Image 1.
  • + Successfully incorporates the specific pose, adapting the character from Image 2 to the dynamic position.
  • Anatomy error with the right foot appearing as a knee/stump on the box.
  • Inherited the long flowing hair from the original person in Image 1 rather than keeping the short hair from Image 2.
  • Missing the sunglasses from the character reference.

Grok Imagine Image Pro

  • + Preserves the exact pose and environment of Image 1 perfectly.
  • Failed the character reference task entirely, keeping the woman from Image 1.
  • Did not apply any of the clothing or physical attributes (face, gender) from Image 2.
  • Small artifacts on the hands compared to the original.

Verdict: FLUX.2 [dev] Turbo successfully attempted the complex task of merging the character from Image 2 into the pose of Image 1, capturing the clothing and facial likeness reasonably well despite some anatomical glitches and hair length issues. Grok Imagine Image Pro essentially ignored the character reference and just recreated Image 1 with minor variations. FLUX.2 is the clear winner for actually performing the requested edit.

The Reversed Rodeo

Text-to-Image

“Horse riding astronaut in space — horse on top, not vice versa. Surreal, highly detailed, cinematic.”

FLUX.2 [dev] Turbo
Grok Imagine Image Pro

AI Judge Analysis

FLUX.2 [dev] Turbo

  • + Excellent photorealism and cinematic lighting
  • + High level of detail in the space suit and horse textures
  • + Natural-looking integration of the subjects with the background environment
  • Failed the negative constraint; the astronaut is riding the horse instead of vice versa

Grok Imagine Image Pro

  • + Successfully followed the difficult spatial constraint of 'horse on top'
  • + Vibrant colors and a more surreal composition as requested
  • + Creative interpretation of the horse 'riding' the astronaut in zero gravity
  • Lower realism compared to Model A
  • The planet in the background is a stylized hybrid of Saturn and Jupiter that looks a bit generic

Verdict: This challenge highlights a classic prompt adherence test. FLUX.2 [dev] Turbo produced a much more visually impressive and realistic image, but it completely ignored the specific instruction for the horse to be on top. Grok Imagine Image Pro successfully interpreted the surreal 'horse on top' request, making it the winner for following the complex prompt logic even if the technical rendering is slightly less cinematic.

The Capybara Taxi Driver

Text-to-Image

“Photorealistic scene inside a yellow New York taxi at night. A capybara is driving, wearing a yellow taxi driver cap and a dark jacket. It has a calm, professional expression and both front paws on the steering wheel. In the back seat sits a human businesswoman in a coat, looking at her phone with a completely normal, bored expression (as if this is just another normal ride). Through the windows you can see the streets of Manhattan at night with blurred lights. Realistic taxi interior, photorealistic, detailed fur and fabric, 35mm lens, night lighting with reflections, shallow depth of field.”

FLUX.2 [dev] Turbo
Grok Imagine Image Pro

AI Judge Analysis

FLUX.2 [dev] Turbo

  • + Excellent fur texture and lighting on the capybara.
  • + The passenger's facial expression perfectly captures the 'bored' requirement.
  • + Dynamic city lighting through the windshield improves atmosphere.
  • The passenger is sitting in the front passenger seat rather than the requested back seat.
  • The 'TAX' light on top of the car is weirdly positioned and partially clipped.

Grok Imagine Image Pro

  • + Correctly places the passenger in the back seat as requested.
  • + The 'NYC TLC' text on the capybara's hat is very realistic and relevant.
  • + Captures the full interior of the taxi, enhancing the 'scene inside' composition.
  • The capybara's face is slightly less detailed and more static than the other model.
  • The background lighting is a bit more generic.

Verdict: While FLUX.2 [dev] Turbo produced a more high-definition image with better character expressions, it failed to place the passenger in the back seat. Grok Imagine Image Pro followed the spatial instructions perfectly, placing the businesswoman in the back seat and adding great local details like the TLC medallion text on the hat.

Isometric Miniature Diorama Scenes

Text-to-Image

“Create a clear, 45° top-down isometric miniature 3D cartoon scene of Japan's signature dish: sushi, with soft refined textures, realistic PBR materials, gentle lighting, on a small raised diorama base with minimal garnish and plate. Solid light blue background. At top-center: 'JAPAN' in large bold text, 'SUSHI' below it, small flag icon. Perfectly centered, ultra-clean, high-clarity, square format.”

FLUX.2 [dev] Turbo
Grok Imagine Image Pro
0% wins 0% ties 100% wins

AI Judge Analysis

FLUX.2 [dev] Turbo

  • + Excellent text rendering with clean, professional typography.
  • + Higher texture detail on the fish and rice grains for a realistic PBR look.
  • + Stronger adherence to the 'isometric' diorama request with the square raised base.
  • The placement of the sushi rolls on the plate is slightly off-center relative to the plate's surface.
  • The garnish flower on top looks a bit plasticky compared to the fish.

Grok Imagine Image Pro

  • + Great variety of sushi types (nigiri and maki) arranged nicely on the plate.
  • + Clean, soft lighting that fits the 'miniature 3D cartoon' aesthetic well.
  • + Accurate text and flag icon placement.
  • The textures are a bit too smooth and simplified, losing the 'realistic PBR' quality requested.
  • The perspective is more of a standard 3D render angle than a true 45° isometric view.
  • The wood texture on the base is somewhat stretched and less detailed.

Verdict: FLUX.2 [dev] Turbo followed the prompt more accurately by providing a true isometric perspective and a raised square diorama base. While Grok Imagine Pro offered more variety in the sushi itself, FLUX.2 exhibited superior material textures and cleaner typography, making it feel more like a high-end 3D asset.

Adorable Baby Animals in Sunny Meadow

Text-to-Image

“Hyper-photorealistic scene of fluffy baby animals—a golden retriever puppy, tabby kitten, baby bunny, and red fox kit—with big expressive eyes and ultra-detailed soft fur, playfully chasing butterflies and tumbling together in a lush wildflower meadow, warm golden sunrise light with god rays and dew sparkles, joyful wholesome vibe, 8K masterpiece.”

FLUX.2 [dev] Turbo
Grok Imagine Image Pro

AI Judge Analysis

FLUX.2 [dev] Turbo

  • + Perfect adherence to the requested animal count and types.
  • + Excellent integration of lighting with dew sparkles and god rays.
  • + Very natural fur textures and expressive eye details.
  • The fox kit has a slightly feline-looking face structure.
  • The kitten's pose is a bit awkward with its floating front paw.

Grok Imagine Image Pro

  • + Strong dynamic composition with the fox kit rolling back.
  • + Vibrant colors and clear god rays from the sunrise.
  • + Good variety in the wildflower types.
  • Failed to follow animal count by including two kittens instead of one.
  • The puppy's anatomy looks slightly distorted during the leap.
  • The rabbit feels a bit static and disconnected from the action.

Verdict: FLUX.2 [dev] Turbo followed the prompt more accurately by including exactly one of each animal requested, whereas Grok Imagine Image Pro included an extra kitten. FLUX.2 also achieved a more photorealistic look with better texture on the fur and more realistic dew effects compared to the slightly more stylized and saturated approach of Grok Imagine Image Pro.

Vintage Cafe Logo

Text-to-Image

“Vintage minimalist restaurant logo for "Caffè Florian", retro cloche dome with steam and "Est. 1720" banner, classic typography, warm brown and cream tones, subtle texture on light background, vector emblem style.”

FLUX.2 [dev] Turbo
Grok Imagine Image Pro
100% wins 0% ties 0% wins

AI Judge Analysis

FLUX.2 [dev] Turbo

  • + Excellent adherence to the vintage minimalist aesthetic with a weathered texture.
  • + Perfect typography rendering for both the main name and the 'Est. 1720' banner.
  • + Superior color palette featuring accurate warm brown and cream tones as requested.
  • The steam lines are slightly asymmetrical, though this fits the hand-drawn vintage style.

Grok Imagine Image Pro

  • + Clean vector style with clear line work.
  • + Correct text and date inclusion within a circular emblem format.
  • The 'cloche' is silver/gray, missing the 'warm brown and cream' color instruction for the primary elements.
  • The steam is a single, somewhat awkward thick line that lacks the elegance expected of a logo.
  • Lacks the 'subtle texture' requested, looking more like a modern clip-art graphic.

Verdict: FLUX.2 [dev] Turbo produced a sophisticated, high-quality logo that perfectly captures the vintage, textured aesthetic and color palette described in the prompt. Grok Imagine Image Pro followed the basic layout instructions but failed to incorporate the requested warm tones for the cloche and lacked the professional design finish seen in the other model.

Apollo 11: Journey to Tranquility

Text-to-Image

“Create a clean, modern vector infographic poster about the Apollo 11 mission. NASA-inspired palette (navy, white, muted red, light gray). Flat-vector style, crisp lines, consistent iconography, subtle gradients only. Steps (stop at landing): 1. Launch (Saturn Vicon) 2. Earth Orbit (Earth + orbit ring icon) 3. Translunar (trajectory arc icon) 4. Lunar Orbit (Moon + orbit ring icon) 5. Descent (lunar module descending icon) 6. Landing (lunar module on the surface icon) Small supporting elements (minimal text): • Crew strip: three silhouette icons with only last names: Armstrong, Aldrin, Collins. • Landing site marker: Moon pin labeled "Tranquility" only. Layout constraints: generous margins, large readable labels, clean background with subtle stars. Vector-only, print-poster look, high resolution.”

FLUX.2 [dev] Turbo
Grok Imagine Image Pro

AI Judge Analysis

FLUX.2 [dev] Turbo

  • + Excellent detailed illustrations of the Saturn V and Lunar Module.
  • + Includes the names of the three astronauts correctly.
  • + High-quality NASA-inspired dark color palette.
  • The layout is cluttered and non-linear, making the 'steps' hard to follow.
  • Contains significant text artifacts like 'Saturn Vicon' (concatenated) and duplicate 'Translunar' and 'Landing site' labels.
  • Uses photo-realistic textures for the Earth and Moon instead of the requested flat-vector style.

Grok Imagine Image Pro

  • + Strictly follows the requested 'flat-vector style' with clean, consistent iconography.
  • + Highly intuitive vertical timeline layout that clearly defines the six requested steps.
  • + Very clean typography and excellent source preservation of the NASA color palette.
  • Small spelling error in 'Tranquility Base' (missing an 'L' for US spelling, though 'Tranquility' is correct in the prompt).
  • Icons are smaller and less detailed than those in the competing model.

Verdict: Grok Imagine Image Pro is the clear winner as it perfectly captures the aesthetic and functional requirements of a 'modern vector infographic' with a logical flow. While FLUX.2 [dev] Turbo provides more detailed individual illustrations, its layout is chaotic and it fails to deliver the requested flat-vector style, opting for textured globes instead.

Next steps

Explore each model