Fast distilled version of Black Forest Labs' FLUX.2 [dev] optimized for speed and cost efficiency.
Settled by community votes across 8 shared challenges, with an AI judge weighing in on each.
FLUX.2 [dev] Flash
#5 of 44 in Text-to-Image
Not enough comparable category data
The chart appears once both models have ratings across at least three shared arena categories.
Grok Imagine Image Pro
#14 of 44 in Text-to-Image
Where the votes landed
FLUX.2 [dev] Flash
75.0%
win rate
Ties
0.0%
Grok Imagine Image Pro
25.0%
win rate
Challenge by challenge
The strongest take from each model on every shared challenge, with the AI judge's read.
Geometric Composition
Text-to-Image“A glass cube on a wooden table. Inside the cube is a small blue sphere. On top of the cube sits a red book. A green plant is behind the cube, partially visible through the glass. Soft window light from the left.”
AI Judge Analysis
FLUX.2 [dev] Flash
- + Excellent photographic realism with natural dust and refraction on the glass.
- + Correctly places the plant behind the cube so it is visible through the glass.
- + The lighting is soft and directional as requested, creating a cohesive scene.
- − The sphere is slightly smaller than what might be expected for 'small', though still accurate.
Grok Imagine Image Pro
- + Strong colors and clean composition.
- + Good textural contrast between the wood, matte sphere, and glass.
- − Creates a strange hallucination where a second half-sphere appears on the right edge of the glass.
- − The plant is mostly to the side rather than 'behind' the cube as requested.
- − The glass appears unnaturally thick and distorted in a way that looks less realistic.
Verdict: FLUX.2 [dev] Flash followed the spatial instructions perfectly, placing the plant behind the glass cube and rendering the physics of light through glass very realistically. Grok Imagine Image Pro struggled with the internal contents of the cube, resulting in a redundant half-sphere artifact and failing to position the plant correctly according to the prompt.
Candid Street Photography
Text-to-Image“A candid street photo of an elderly Japanese man repairing a red bicycle in light rain, reflections on wet pavement, shallow depth of field, 50mm lens, natural skin texture, imperfect framing, motion blur from passing cars, cinematic but realistic, no stylization.”
AI Judge Analysis
FLUX.2 [dev] Flash
- + Excellent depiction of motion blur with passing cars
- + High realism and skin texture detail
- + Successfully followed 'imperfect framing' with a slightly awkward, candid-style composition
- − Internal logic of the bike is messy (spokes and handlebars are jumbled)
- − The man's hands appear slightly merged with the bike's cables
Grok Imagine Image Pro
- + Natural-looking interaction with the bike using a wrench
- + Better background depth and lighting on the wet pavement
- + More coherent bike structure
- − Cars in the background lack the 'motion blur' requested
- − The framing feels too centered and professional, missing the 'imperfect framing' prompt
- − Face texture is slightly softer/less detailed than the competitor
Verdict: FLUX.2 [dev] Flash captured the specific photography technicalities of the prompt much better, including the motion blur and the requested 'imperfect' candid framing. Grok Imagine Image Pro produced a cleaner subject and more coherent bicycle, but failed to incorporate the motion blur and the spontaneous feel of a street photo.
Fantasy Warrior
Text-to-Image“Close portrait of a battle-worn paladin in ornate engraved plate armor, hair braided with small beads, faint scars and dirt on the skin, warm torchlight reflecting off metal, shallow depth of field, bokeh sparks, lifelike eyes, highly detailed texture on leather straps and cloth underlayer.”
AI Judge Analysis
FLUX.2 [dev] Flash
- + Excellent skin texture with realistic blood and dirt
- + Highly detailed and intricate metal engravings
- + Very lifelike, clear eye rendering
- − The braids are a bit messy and merge into the background in spots
- − The color palette is slightly muted
Grok Imagine Image Pro
- + Stronger visual storytelling with the Latin inscription 'Lux in tenebris' on the gorget
- + Very distinctive and well-defined hair beads
- + Better lighting contrast with stronger warm reflections
- − The facial scars look a bit like digital paint strokes rather than skin damage
- − Some minor aliasing on the bokeh sparks
Verdict: Both models followed the prompt exceptionally well, but FLUX.2 [dev] Flash produces a more natural, lifelike face with superior skin texture. Grok Imagine Image Pro has slightly better composition and lighting contrast, and its addition of thematic text on the armor adds a nice touch, though the facial details feel slightly more 'rendered' than FLUX.
Chalkboard Menu
Text-to-Image“Handwritten-style chalkboard menu in a cozy café, all text rendered in the exact same realistic chalk handwriting style with natural variations in letter size, slight slant, and chalk texture — no printed or digital fonts anywhere on the board. Title at the top in elegant cursive chalk handwriting: ‘TODAY’S SPECIALS – APRIL 30, 2026’. Below it, three menu items also in the same handwritten chalk style: ‘Truffle Mushroom Risotto – $24’, ‘Grilled Octopus with Lemon & Herbs – $28’, ‘Brown Butter Chocolate Chip Cookies – $9’. At the very bottom, smaller text in the identical handwritten chalk style (slightly smaller but still clearly legible with the same handwriting characteristics): ‘All items made fresh daily • Ask about our gluten-free options’. Warm ambient café lighting, visible chalk dust and smudges, realistic handwriting imperfections, no clean printed text anywhere.”
AI Judge Analysis
FLUX.2 [dev] Flash
- + Excellent chalk texture with realistic smudging and dusting on the board.
- + Superior handwriting style that looks authentically human and elegant.
- + Very high-quality background and lighting within the café setting.
- − Slightly messy layout for the second menu item where 'Herbs' is oddly placed below a dash.
Grok Imagine Image Pro
- + Perfect text accuracy including the final menu item 'Cookies'.
- + Clean, legible layout with consistent spacing between items.
- + Adheres well to the cursive requirement for the title.
- − The chalk texture looks a bit too sharp and digital compared to Model A.
- − The handwriting looks slightly more uniform and less like natural chalk strokes.
Verdict: Both models followed the complex prompt with nearly 100% accuracy in text rendering. FLUX.2 [dev] Flash produces a more atmospheric and realistic image with authentic chalk dust and human-like variations in the script, while Grok Imagine Image Pro provides a cleaner, more readable board with slightly more precise letterforms. FLUX.2 [dev] Flash is the winner for its superior 'chalkboard' feel and artistic composition.
The Capybara Taxi Driver
Text-to-Image“Photorealistic scene inside a yellow New York taxi at night. A capybara is driving, wearing a yellow taxi driver cap and a dark jacket. It has a calm, professional expression and both front paws on the steering wheel. In the back seat sits a human businesswoman in a coat, looking at her phone with a completely normal, bored expression (as if this is just another normal ride). Through the windows you can see the streets of Manhattan at night with blurred lights. Realistic taxi interior, photorealistic, detailed fur and fabric, 35mm lens, night lighting with reflections, shallow depth of field.”
AI Judge Analysis
FLUX.2 [dev] Flash
- + Excellent texture on the capybara fur and the jacket.
- + The capybara's head is sized more naturally for the space, filling the driver's area.
- + Good depth of field with the blurred Manhattan background.
- − The placement of the capybara's front paws on the steering wheel looks slightly distorted.
- − The passenger's face is a bit blurry and less detailed than the driver's.
Grok Imagine Image Pro
- + High level of detail in the dash instruments, including a realistic taxi meter showing a fare.
- + The passenger's expression and clothing are very high-quality and realistic.
- + Vibrant, recognizable Manhattan backdrop with neon lights and building textures.
- − The capybara appears a bit small in the driver's seat, making its arms look unnaturally long.
- − The hat is a simple baseball cap rather than a traditional taxi driver service cap.
Verdict: Both models followed the complex prompt exceptionally well, capturing the surreal scenario with high photorealism. Grok Imagine Image Pro stands out for its superior background detail and the addition of a functional-looking taxi meter, while FLUX.2 [dev] Flash provides a more convincing capybara appearance and a better interpretation of the requested chauffeur-style cap.
Isometric Miniature Diorama Scenes
Text-to-Image“Create a clear, 45° top-down isometric miniature 3D cartoon scene of Japan's signature dish: sushi, with soft refined textures, realistic PBR materials, gentle lighting, on a small raised diorama base with minimal garnish and plate. Solid light blue background. At top-center: 'JAPAN' in large bold text, 'SUSHI' below it, small flag icon. Perfectly centered, ultra-clean, high-clarity, square format.”
AI Judge Analysis
FLUX.2 [dev] Flash
- + Perfect adherence to the isometric perspective and square diorama base.
- + High-quality text rendering and placement.
- + Excellent material textures, especially on the fish and wood grain.
- − The hybrid sushi roll/nigiri logic is a bit anatomically strange for sushi.
- − Slightly less variety in the dish compared to Model B.
Grok Imagine Image Pro
- + Great variety of sushi types including nigiri and maki rolls.
- + Excellent clay-like 'cartoon' aesthetic that feels very polished.
- + Clear and well-centered text with a nice shadow effect.
- − The circular plate on a circular base feels a bit redundant compared to a standard diorama aesthetic.
- − Materials are slightly more stylized/plastic than the requested PBR realism.
Verdict: Both models followed the complex prompt very well, including the specific text and flag requirements. FLUX.2 [dev] Flash delivered a more technically accurate isometric diorama with superior texture work on the wood and fish, while Grok Imagine Image Pro provided a more appealing variety of sushi and a charming 'cartoon' 3D style that feels more cohesive as a miniature scene.
Adorable Baby Animals in Sunny Meadow
Text-to-Image“Hyper-photorealistic scene of fluffy baby animals—a golden retriever puppy, tabby kitten, baby bunny, and red fox kit—with big expressive eyes and ultra-detailed soft fur, playfully chasing butterflies and tumbling together in a lush wildflower meadow, warm golden sunrise light with god rays and dew sparkles, joyful wholesome vibe, 8K masterpiece.”
AI Judge Analysis
FLUX.2 [dev] Flash
- + Excellent fur texture and lighting integration on all animals.
- + Dynamic and playful composition with animals looking toward the butterflies.
- + Beautiful rendering of dew drops and god rays contributing to the '8K masterpiece' vibe.
- − The kitten/bunny hybrid in the bottom center is a significant anatomical error.
- − The fox's front right leg is missing or poorly placed behind the head.
Grok Imagine Image Pro
- + Perfectly captures the 'tumbling' motion with the fox kit on its back.
- + Clearer distinction between the specific animal types requested.
- + Strong adherence to the 'lush wildflower meadow' with a variety of flower types and colors.
- − The lighting on the animals feels slightly flat and 'cut-out' compared to the background.
- − The kitten has an oddly elongated neck and stiff pose.
- − Duplicated the kitten (two instead of one) which wasn't requested.
Verdict: FLUX.2 [dev] Flash produces a more aesthetically pleasing image with superior lighting and texture, but it suffers from a major anatomical failure by merging a kitten and a rabbit into one creature. Grok Imagine Image Pro follows the spirit of the prompt's action much better and avoids hybrid animals, though its lighting feels less photorealistic and it unnecessarily doubles the number of kittens. Grok is the preferred winner here simply for maintaining creature logic, which is essential for the prompt's requirements.
Apollo 11: Journey to Tranquility
Text-to-Image“Create a clean, modern vector infographic poster about the Apollo 11 mission. NASA-inspired palette (navy, white, muted red, light gray). Flat-vector style, crisp lines, consistent iconography, subtle gradients only. Steps (stop at landing): 1. Launch (Saturn Vicon) 2. Earth Orbit (Earth + orbit ring icon) 3. Translunar (trajectory arc icon) 4. Lunar Orbit (Moon + orbit ring icon) 5. Descent (lunar module descending icon) 6. Landing (lunar module on the surface icon) Small supporting elements (minimal text): • Crew strip: three silhouette icons with only last names: Armstrong, Aldrin, Collins. • Landing site marker: Moon pin labeled "Tranquility" only. Layout constraints: generous margins, large readable labels, clean background with subtle stars. Vector-only, print-poster look, high resolution.”
AI Judge Analysis
FLUX.2 [dev] Flash
- + Captures a more complex, illustrative 'poster' feel with a deep navy background.
- + Higher level of detail in the lunar module and planetary renders.
- + Includes creative extras like the crew silhouettes and a surface horizon.
- − Text is cluttered and contains significant spelling errors ('Sataurr' Iccòn', 'Tranquility' vs 'Tranquilty').
- − The layout is chaotic with overlapping labels and redundant icons (multiple landing/descent steps).
- − Fails the 'flat-vector' style requirement by include highly textured 3D-like elements.
Grok Imagine Image Pro
- + Perfectly adheres to the 'clean, flat-vector' style and requested NASA color palette.
- + Numbered steps are logical, organized, and follow the requested sequence exactly.
- + Text is legible and spelling is largely accurate with a clean, professional typeface.
- − The composition is a bit simple/vertical, leaving a lot of empty gray space on the sides.
- − The 'Translunar' icon is just a red arc without an integrated trajectory visual.
Verdict: Grok Imagine Image Pro is the clear winner as it strictly followed the stylistic 'vector' and 'flat' constraints of the prompt, resulting in a professional-looking infographic. While FLUX.2 [dev] Flash attempted a more ambitious illustration, it failed on typography, layout organization, and the specific 'flat' style requested, leading to a cluttered and messy final product.
Explore each model
xAI's premium image generation model offering higher fidelity output and stronger performance on single-image editing benchmarks compared to the standard Grok Imagine model