Distilled version of Black Forest Labs' FLUX.2 [dev] outperforming it at a cheaper price. Developed by fal.ai.
Settled by community votes across 12 shared challenges, with an AI judge weighing in on each.
FLUX.2 [dev] Turbo
#4 of 44 in Text-to-Image
Not enough comparable category data
The chart appears once both models have ratings across at least three shared arena categories.
Grok Imagine Image Pro
#14 of 44 in Text-to-Image
Where the votes landed
FLUX.2 [dev] Turbo
75.0%
win rate
Ties
0.0%
Grok Imagine Image Pro
25.0%
win rate
Challenge by challenge
The strongest take from each model on every shared challenge, with the AI judge's read.
Geometric Composition
Text-to-Image“A glass cube on a wooden table. Inside the cube is a small blue sphere. On top of the cube sits a red book. A green plant is behind the cube, partially visible through the glass. Soft window light from the left.”
AI Judge Analysis
FLUX.2 [dev] Turbo
- + Perfect adherence to all spatial instructions
- + Excellent photographic realism with natural glass textures and dust
- + Sophisticated lighting and reflections
- − The plant appears to be both behind and inside the glass simultaneously due to visual overlap
Grok Imagine Image Pro
- + Clean, minimalist aesthetic
- + Vibrant colors on the sphere and book
- − The sphere is duplicated or has a nonsensical solid reflection on the right
- − The plant is mostly above/behind rather than 'partially visible through the glass'
- − The glass walls have inconsistent thickness and wavy distortion
Verdict: FLUX.2 [dev] Turbo followed the prompt perfectly, capturing the complex physics of a sphere inside a glass box with a book on top and a plant behind. Grok Imagine Image Pro struggled with the internal contents of the cube, creating a strange duplicate of the blue sphere and failing to show the plant through the glass as requested.
Candid Street Photography
Text-to-Image“A candid street photo of an elderly Japanese man repairing a red bicycle in light rain, reflections on wet pavement, shallow depth of field, 50mm lens, natural skin texture, imperfect framing, motion blur from passing cars, cinematic but realistic, no stylization.”
AI Judge Analysis
FLUX.2 [dev] Turbo
- + Excellent adherence to the 'imperfect framing' prompt with a tight, gritty composition.
- + Highly realistic skin textures and age spots on the man's hands and face.
- + Very detailed mechanical components and tools scattered on the wet pavement.
- − The front wheel of the bicycle is clipping significantly through the pavement.
- − The rain effect is less visible compared to the reflections.
Grok Imagine Image Pro
- + Clearer depiction of light rain falling through the air.
- + Stronger 'motion blur' on the passing cars as requested in the prompt.
- + Clean composition with a nice balance between the subject and the street depth.
- − The wrench in the man's hand is poorly rendered and melting into the bicycle frame.
- − The man's hands look slightly too smooth for an 'elderly' man with 'natural skin texture'.
- − Missing the 'imperfect framing' requested, appearing more like a staged professional shot.
Verdict: FLUX.2 [dev] Turbo captures the requested 'candid' and 'imperfect' aesthetic much better, providing grit and hyper-realistic skin textures that match the elderly description perfectly. While Grok Imagine Image Pro handles the motion blur of the cars and the rain particles better, it suffers from significant AI artifacts in the hands and tools, making FLUX.2 the more believable image despite the clipping wheel.
Fantasy Warrior
Text-to-Image“Close portrait of a battle-worn paladin in ornate engraved plate armor, hair braided with small beads, faint scars and dirt on the skin, warm torchlight reflecting off metal, shallow depth of field, bokeh sparks, lifelike eyes, highly detailed texture on leather straps and cloth underlayer.”
AI Judge Analysis
FLUX.2 [dev] Turbo
- + Excellent realism in the facial features and skin texture, including subtle pores and believable scarring.
- + Superior rendering of hair physics and integration of the small colorful beads.
- + High level of detail on the leather straps and metal engravings with naturalistic lighting.
- − The torch in the background has a slightly soft, digital look compared to the foreground.
- − The bokeh sparks are a bit uniform in some areas.
Grok Imagine Image Pro
- + Impressive text rendering on the gorget ('Lux in tenebris') which adds to the paladin theme.
- + Strong, cinematic lighting with high contrast and vibrant orange highlights.
- + Very intricate armor engraving and creative use of bone or stone beads in the hair.
- − The facial features and skin look slightly more 'digitally painted' or smoothed compared to the realism of Model A.
- − The hair braids appear a bit stiff and symmetrical, lacking natural variation.
- − Some of the bokeh sparks look like flat digital overlays.
Verdict: Both models followed the prompt exceptionally well, but FLUX.2 [dev] Turbo takes the lead due to its superior photographic realism, particularly in the skin textures and the natural fall of the hair. While Grok Imagine Image Pro included impressive thematic touches like the Latin text on the armor, its overall look is slightly more stylized and less lifelike than the FLUX image.
Modern Clean Menu
Text-to-Image“Modern minimalist restaurant menu design, white background with colorful food photos in grid, sections for appetizers/pizza/mains, bold sans-serif fonts, vibrant accents, clean professional layout for casual dining.”
AI Judge Analysis
FLUX.2 [dev] Turbo
- + Excellent layout that resembles a functional, professional menu with prices and descriptions.
- + Strong typography with bold sans-serif headers and legible body text.
- + Vibrant colorful accents at the bottom add a playful, modern feel.
- − The photo grid is a bit cluttered and the images are largely variations of the same pizza.
- − Some text elements like the secondary header are garbled.
Grok Imagine Image Pro
- + Perfectly clean grid layout that emphasizes the minimalist aesthetic.
- + High-quality, distinct food photography for each category.
- + Great adherence to the request for specific categories (appetizers, pizza, mains).
- − Lacks item names, descriptions, and prices, making it look more like a photo collage than a menu design.
- − The composition is a bit too repetitive with three identical columns.
Verdict: FLUX.2 [dev] Turbo produces a much more realistic menu design that includes professional typographic hierarchy, prices, and a more complex layout. Grok Imagine Image Pro creates a very clean, minimalist grid of beautiful food photos, but it fails to include the functional elements of a menu design like item names and descriptions, making it feel less like a finished design product.
Chalkboard Menu
Text-to-Image“Handwritten-style chalkboard menu in a cozy café, all text rendered in the exact same realistic chalk handwriting style with natural variations in letter size, slight slant, and chalk texture — no printed or digital fonts anywhere on the board. Title at the top in elegant cursive chalk handwriting: ‘TODAY’S SPECIALS – APRIL 30, 2026’. Below it, three menu items also in the same handwritten chalk style: ‘Truffle Mushroom Risotto – $24’, ‘Grilled Octopus with Lemon & Herbs – $28’, ‘Brown Butter Chocolate Chip Cookies – $9’. At the very bottom, smaller text in the identical handwritten chalk style (slightly smaller but still clearly legible with the same handwriting characteristics): ‘All items made fresh daily • Ask about our gluten-free options’. Warm ambient café lighting, visible chalk dust and smudges, realistic handwriting imperfections, no clean printed text anywhere.”
AI Judge Analysis
FLUX.2 [dev] Turbo
- + Features a very authentic chalk texture with realistic smudges and dust.
- + Excellent composition that includes a café background for context.
- + Captures natural variations in handwriting slant and pressure.
- − The pricing for the Truffle Mushroom Risotto is redundant and slightly messy.
- − Technical spacing issues between words in the lower section.
Grok Imagine Image Pro
- + Exceptional text clarity and perfect adherence to the dictated menu items.
- + Consistent and elegant cursive script that remains highly legible.
- + Clean layout that fills the board space effectively.
- − The chalk texture is a bit too uniform, appearing slightly digital in some strokes.
- − Lacks the atmospheric background depth found in the competitor.
Verdict: Both models followed the complex text instructions perfectly, including the specific date and pricing. FLUX.2 [dev] Turbo offers a more realistic and atmospheric scene with authentic chalk smudging, whereas Grok Imagine Image Pro provides superior legibility and a cleaner, more professional-looking menu layout. Grok Imagine Image Pro is a slight winner for its flawless transcription of the multi-line prompt.
Pose & Character Mashup
Editing“Use Image 1 as the exact pose reference and Image 2 as the character reference. Recreate the person/character from Image 2 in the exact dynamic pose and body position from Image 1. Keep the exact face, hair, clothing style/details, and expression from Image 2. Match the lighting and environment of Image 1. The final image must show the character from Image 2 performing the precise action/pose from Image 1 with perfect anatomy and natural integration.”
AI Judge Analysis
FLUX.2 [dev] Turbo
- + Excellent character transference including the sweater details, scarf, and facial features.
- + Accurately replicates the lighting and background environment of Image 1.
- + Successfully incorporates the specific pose, adapting the character from Image 2 to the dynamic position.
- − Anatomy error with the right foot appearing as a knee/stump on the box.
- − Inherited the long flowing hair from the original person in Image 1 rather than keeping the short hair from Image 2.
- − Missing the sunglasses from the character reference.
Grok Imagine Image Pro
- + Preserves the exact pose and environment of Image 1 perfectly.
- − Failed the character reference task entirely, keeping the woman from Image 1.
- − Did not apply any of the clothing or physical attributes (face, gender) from Image 2.
- − Small artifacts on the hands compared to the original.
Verdict: FLUX.2 [dev] Turbo successfully attempted the complex task of merging the character from Image 2 into the pose of Image 1, capturing the clothing and facial likeness reasonably well despite some anatomical glitches and hair length issues. Grok Imagine Image Pro essentially ignored the character reference and just recreated Image 1 with minor variations. FLUX.2 is the clear winner for actually performing the requested edit.
The Reversed Rodeo
Text-to-Image“Horse riding astronaut in space — horse on top, not vice versa. Surreal, highly detailed, cinematic.”
AI Judge Analysis
FLUX.2 [dev] Turbo
- + Excellent photorealism and cinematic lighting
- + High level of detail in the space suit and horse textures
- + Natural-looking integration of the subjects with the background environment
- − Failed the negative constraint; the astronaut is riding the horse instead of vice versa
Grok Imagine Image Pro
- + Successfully followed the difficult spatial constraint of 'horse on top'
- + Vibrant colors and a more surreal composition as requested
- + Creative interpretation of the horse 'riding' the astronaut in zero gravity
- − Lower realism compared to Model A
- − The planet in the background is a stylized hybrid of Saturn and Jupiter that looks a bit generic
Verdict: This challenge highlights a classic prompt adherence test. FLUX.2 [dev] Turbo produced a much more visually impressive and realistic image, but it completely ignored the specific instruction for the horse to be on top. Grok Imagine Image Pro successfully interpreted the surreal 'horse on top' request, making it the winner for following the complex prompt logic even if the technical rendering is slightly less cinematic.
The Capybara Taxi Driver
Text-to-Image“Photorealistic scene inside a yellow New York taxi at night. A capybara is driving, wearing a yellow taxi driver cap and a dark jacket. It has a calm, professional expression and both front paws on the steering wheel. In the back seat sits a human businesswoman in a coat, looking at her phone with a completely normal, bored expression (as if this is just another normal ride). Through the windows you can see the streets of Manhattan at night with blurred lights. Realistic taxi interior, photorealistic, detailed fur and fabric, 35mm lens, night lighting with reflections, shallow depth of field.”
AI Judge Analysis
FLUX.2 [dev] Turbo
- + Excellent fur texture and lighting on the capybara.
- + The passenger's facial expression perfectly captures the 'bored' requirement.
- + Dynamic city lighting through the windshield improves atmosphere.
- − The passenger is sitting in the front passenger seat rather than the requested back seat.
- − The 'TAX' light on top of the car is weirdly positioned and partially clipped.
Grok Imagine Image Pro
- + Correctly places the passenger in the back seat as requested.
- + The 'NYC TLC' text on the capybara's hat is very realistic and relevant.
- + Captures the full interior of the taxi, enhancing the 'scene inside' composition.
- − The capybara's face is slightly less detailed and more static than the other model.
- − The background lighting is a bit more generic.
Verdict: While FLUX.2 [dev] Turbo produced a more high-definition image with better character expressions, it failed to place the passenger in the back seat. Grok Imagine Image Pro followed the spatial instructions perfectly, placing the businesswoman in the back seat and adding great local details like the TLC medallion text on the hat.
Isometric Miniature Diorama Scenes
Text-to-Image“Create a clear, 45° top-down isometric miniature 3D cartoon scene of Japan's signature dish: sushi, with soft refined textures, realistic PBR materials, gentle lighting, on a small raised diorama base with minimal garnish and plate. Solid light blue background. At top-center: 'JAPAN' in large bold text, 'SUSHI' below it, small flag icon. Perfectly centered, ultra-clean, high-clarity, square format.”
AI Judge Analysis
FLUX.2 [dev] Turbo
- + Excellent text rendering with clean, professional typography.
- + Higher texture detail on the fish and rice grains for a realistic PBR look.
- + Stronger adherence to the 'isometric' diorama request with the square raised base.
- − The placement of the sushi rolls on the plate is slightly off-center relative to the plate's surface.
- − The garnish flower on top looks a bit plasticky compared to the fish.
Grok Imagine Image Pro
- + Great variety of sushi types (nigiri and maki) arranged nicely on the plate.
- + Clean, soft lighting that fits the 'miniature 3D cartoon' aesthetic well.
- + Accurate text and flag icon placement.
- − The textures are a bit too smooth and simplified, losing the 'realistic PBR' quality requested.
- − The perspective is more of a standard 3D render angle than a true 45° isometric view.
- − The wood texture on the base is somewhat stretched and less detailed.
Verdict: FLUX.2 [dev] Turbo followed the prompt more accurately by providing a true isometric perspective and a raised square diorama base. While Grok Imagine Pro offered more variety in the sushi itself, FLUX.2 exhibited superior material textures and cleaner typography, making it feel more like a high-end 3D asset.
Adorable Baby Animals in Sunny Meadow
Text-to-Image“Hyper-photorealistic scene of fluffy baby animals—a golden retriever puppy, tabby kitten, baby bunny, and red fox kit—with big expressive eyes and ultra-detailed soft fur, playfully chasing butterflies and tumbling together in a lush wildflower meadow, warm golden sunrise light with god rays and dew sparkles, joyful wholesome vibe, 8K masterpiece.”
AI Judge Analysis
FLUX.2 [dev] Turbo
- + Perfect adherence to the requested animal count and types.
- + Excellent integration of lighting with dew sparkles and god rays.
- + Very natural fur textures and expressive eye details.
- − The fox kit has a slightly feline-looking face structure.
- − The kitten's pose is a bit awkward with its floating front paw.
Grok Imagine Image Pro
- + Strong dynamic composition with the fox kit rolling back.
- + Vibrant colors and clear god rays from the sunrise.
- + Good variety in the wildflower types.
- − Failed to follow animal count by including two kittens instead of one.
- − The puppy's anatomy looks slightly distorted during the leap.
- − The rabbit feels a bit static and disconnected from the action.
Verdict: FLUX.2 [dev] Turbo followed the prompt more accurately by including exactly one of each animal requested, whereas Grok Imagine Image Pro included an extra kitten. FLUX.2 also achieved a more photorealistic look with better texture on the fur and more realistic dew effects compared to the slightly more stylized and saturated approach of Grok Imagine Image Pro.
Vintage Cafe Logo
Text-to-Image“Vintage minimalist restaurant logo for "Caffè Florian", retro cloche dome with steam and "Est. 1720" banner, classic typography, warm brown and cream tones, subtle texture on light background, vector emblem style.”
AI Judge Analysis
FLUX.2 [dev] Turbo
- + Excellent adherence to the vintage minimalist aesthetic with a weathered texture.
- + Perfect typography rendering for both the main name and the 'Est. 1720' banner.
- + Superior color palette featuring accurate warm brown and cream tones as requested.
- − The steam lines are slightly asymmetrical, though this fits the hand-drawn vintage style.
Grok Imagine Image Pro
- + Clean vector style with clear line work.
- + Correct text and date inclusion within a circular emblem format.
- − The 'cloche' is silver/gray, missing the 'warm brown and cream' color instruction for the primary elements.
- − The steam is a single, somewhat awkward thick line that lacks the elegance expected of a logo.
- − Lacks the 'subtle texture' requested, looking more like a modern clip-art graphic.
Verdict: FLUX.2 [dev] Turbo produced a sophisticated, high-quality logo that perfectly captures the vintage, textured aesthetic and color palette described in the prompt. Grok Imagine Image Pro followed the basic layout instructions but failed to incorporate the requested warm tones for the cloche and lacked the professional design finish seen in the other model.
Apollo 11: Journey to Tranquility
Text-to-Image“Create a clean, modern vector infographic poster about the Apollo 11 mission. NASA-inspired palette (navy, white, muted red, light gray). Flat-vector style, crisp lines, consistent iconography, subtle gradients only. Steps (stop at landing): 1. Launch (Saturn Vicon) 2. Earth Orbit (Earth + orbit ring icon) 3. Translunar (trajectory arc icon) 4. Lunar Orbit (Moon + orbit ring icon) 5. Descent (lunar module descending icon) 6. Landing (lunar module on the surface icon) Small supporting elements (minimal text): • Crew strip: three silhouette icons with only last names: Armstrong, Aldrin, Collins. • Landing site marker: Moon pin labeled "Tranquility" only. Layout constraints: generous margins, large readable labels, clean background with subtle stars. Vector-only, print-poster look, high resolution.”
AI Judge Analysis
FLUX.2 [dev] Turbo
- + Excellent detailed illustrations of the Saturn V and Lunar Module.
- + Includes the names of the three astronauts correctly.
- + High-quality NASA-inspired dark color palette.
- − The layout is cluttered and non-linear, making the 'steps' hard to follow.
- − Contains significant text artifacts like 'Saturn Vicon' (concatenated) and duplicate 'Translunar' and 'Landing site' labels.
- − Uses photo-realistic textures for the Earth and Moon instead of the requested flat-vector style.
Grok Imagine Image Pro
- + Strictly follows the requested 'flat-vector style' with clean, consistent iconography.
- + Highly intuitive vertical timeline layout that clearly defines the six requested steps.
- + Very clean typography and excellent source preservation of the NASA color palette.
- − Small spelling error in 'Tranquility Base' (missing an 'L' for US spelling, though 'Tranquility' is correct in the prompt).
- − Icons are smaller and less detailed than those in the competing model.
Verdict: Grok Imagine Image Pro is the clear winner as it perfectly captures the aesthetic and functional requirements of a 'modern vector infographic' with a logical flow. While FLUX.2 [dev] Turbo provides more detailed individual illustrations, its layout is chaotic and it fails to deliver the requested flat-vector style, opting for textured globes instead.
Explore each model
xAI's premium image generation model offering higher fidelity output and stronger performance on single-image editing benchmarks compared to the standard Grok Imagine model