FLUX.2 [pro] vs Grok Imagine Image

Head-to-head across 15 challenges

FLUX.2 [pro]

50.0%

win rate

Ties

0.0%

Grok Imagine Image

50.0%

win rate

50.0% 0.0% ties 50.0%

Challenge Results

Modern Clean Menu

Text-to-Image

“Modern minimalist restaurant menu design, white background with colorful food photos in grid, sections for appetizers/pizza/mains, bold sans-serif fonts, vibrant accents, clean professional layout for casual dining.”

FLUX.2 [pro]
Grok Imagine Image
25% wins 0% ties 75% wins

AI Judge Analysis

FLUX.2 [pro]

  • + Clean, professional layout that closely aligns with high-end graphic design standards.
  • + Excellent font clarity and relatively logical sectioning.
  • + Consistent use of color coding for different sections.
  • Contains several spelling errors like 'MINS' instead of 'Mains'.
  • Repeats the exact same dish names (Garlic Bread, Margherita Pizza) across different functional categories.

Grok Imagine Image

  • + Dynamic and creative composition with food elements overlapping the white space.
  • + Individually high-quality food photography with vibrant colors.
  • + Comprehensive menu content featuring a wide variety of dishes.
  • The layout is more of a collage than a structured grid as requested.
  • Significant text repetition and nonsensical typos (e.g., repeating 'Steak Frites' multiple times).

Verdict: FLUX.2 [pro] followed the 'grid' prompt much better, resulting in a cleaner, more realistic menu layout that feels professional. While Grok Imagine Image has more artistic food photography, its layout is cluttered and the text rendering is less consistent than FLUX.2 [pro].

Candid Street Photography

Text-to-Image

“A candid street photo of an elderly Japanese man repairing a red bicycle in light rain, reflections on wet pavement, shallow depth of field, 50mm lens, natural skin texture, imperfect framing, motion blur from passing cars, cinematic but realistic, no stylization.”

FLUX.2 [pro]
Grok Imagine Image

AI Judge Analysis

FLUX.2 [pro]

  • + Excellent anatomical detail in the hands and facial features.
  • + Superior rendering of complex bicycle mechanics like the derailleur and chain.
  • + Masterful handling of wet pavement textures and diffuse reflections.

Grok Imagine Image

  • + Stronger adherence to the 'motion blur' requirement for background traffic.
  • + Effective 'imperfect framing' that feels like a genuine candid snapshot.
  • + Realistic film-like grain and color grading.
  • The subject's face is obscured and less detailed than requested.
  • The bicycle's structure becomes messy and illogical near the rear wheel.

Verdict: FLUX.2 [pro] excels in technical execution, providing a crisp, high-detail image with realistic skin textures and a perfectly rendered bicycle. Grok Imagine Image better captures the specific 'candid' and 'motion blur' atmosphere requested, but fails on technical details like the anatomy of the subject and the bicycle's mechanical structure.

Fantasy Warrior

Text-to-Image

“Close portrait of a battle-worn paladin in ornate engraved plate armor, hair braided with small beads, faint scars and dirt on the skin, warm torchlight reflecting off metal, shallow depth of field, bokeh sparks, lifelike eyes, highly detailed texture on leather straps and cloth underlayer.”

FLUX.2 [pro]
Grok Imagine Image

AI Judge Analysis

FLUX.2 [pro]

  • + Excellent depiction of warm torchlight reflecting off the metal surface.
  • + Highly detailed texture on the leather straps and the burlap-style cloth underlayer.
  • + Lifelike eye texture and realistic skin imperfections including grit and scars.

Grok Imagine Image

  • + Intricate engraving on the plate armor that covers a larger portion of the frame.
  • + Clear interpretation of the braided hair with beads and bokeh spark effects.
  • + Good balance of cool ambient light and warm focal light.
  • The 'battle-worn' scars look more like thin surface-level scratches than healed or significant wounds.
  • The skin texture is a bit too smooth and airbrushed for the requested grit.
  • The metal reflections feel less like organic torchlight compared to Model A.

Verdict: FLUX.2 [pro] followed the prompt more effectively by delivering a truly 'battle-worn' appearance with believable skin textures, deep scars, and authentic lighting reflections. While Grok Imagine produced beautiful armor engravings and a high-quality composition, it lacked the specific grit and realistic textile textures (leather and cloth) clearly visible in the FLUX.2 [pro] output.

Man and Car in California

Editing
Edit instruction

“Make a photo of the man driving the car down the California coastline”

Source
FLUX.2 [pro]
Grok Imagine Image

AI Judge Analysis

FLUX.2 [pro]

  • + Successfully preserved the identity of the specific man from the source image
  • + Maintained the exact car model and design from the reference image
  • + Perfectly implemented the requested California coastline background with motion blur on the wheels
  • The transition between the man's head and the car seat has a slight masking artifact

Grok Imagine Image

  • + Very high visual quality with beautiful lighting and scenery
  • + Dynamic composition with effective use of motion blur
  • Completely failed to use the specific man provided in the source image
  • Changed the car model to a newer version of the Rolls-Royce convertible
  • Failed the core 'editing' task by generating a new scene instead of combining source elements

Verdict: FLUX.2 [pro] followed the editing instructions perfectly by accurately placing the specific man and car from the source images into the requested coastline setting. Grok Imagine failed the task by generating a generic man and a different car model, ignoring the specific visual data provided in the source images.

Geometric Composition

Text-to-Image

“A glass cube on a wooden table. Inside the cube is a small blue sphere. On top of the cube sits a red book. A green plant is behind the cube, partially visible through the glass. Soft window light from the left.”

FLUX.2 [pro]
Grok Imagine Image
0% wins 0% ties 100% wins

AI Judge Analysis

FLUX.2 [pro]

  • + Excellent photographic realism and lighting
  • + Highly accurate cube geometry and refraction
  • + Clean and professional composition
  • The sphere is resting on the bottom rather than being centered
  • The plant in the background is quite blurred

Grok Imagine Image

  • + Successfully positions the sphere in the center of the cube
  • + The plant is more distinctly visible through the glass
  • + Good adherence to the window lighting instruction
  • The red book appears to be floating slightly above the cube
  • The blue sphere is levitating without any physical support, which looks unnatural
  • The cube edges are slightly rounded and less like a sharp glass cube

Verdict: FLUX.2 [pro] produced a much more realistic image with superior lighting and materials, whereas Grok Imagine Image struggled with the physical interaction between objects, resulting in a floating book and sphere. While Grok Imagine Image followed the spatial placement of the sphere more creatively, the overall quality and believability of FLUX.2 [pro] make it the better image.

Bald man challenge

Editing
Edit instruction

“Give the person a full, thick head of natural hair with realistic texture, density, and a natural hairline. Preserve facial features and lighting.”

Before After
FLUX.2 [pro]
Before After
Grok Imagine Image
0% wins 0% ties 100% wins

AI Judge Analysis

FLUX.2 [pro]

  • + Successfully added a very full, thick head of hair as requested.
  • + The texture and lighting of the added hair blend reasonably well with the face.
  • + Completely changes the person's look while maintaining the overall scene context.
  • Failed to preserve facial features, significantly altering the face shape and mustache/beard color.
  • The new face looks like an entirely different person (older, different nose and brow).

Grok Imagine Image

  • + Excellent preservation of original facial features, beard, and lighting.
  • + The added hair is very realistic and matches the existing beard color and texture.
  • + Perfect adherence to the instruction of preserving the original subject.
  • The hair isn't quite as 'full and thick' as Image A, opting for a more standard cut.

Verdict: Grok Imagine Image followed the edit instructions perfectly, adding a realistic head of hair while keeping the subject's face identical to the source image. FLUX.2 [pro] failed the preservation task by generating a completely different face that merely shares the same glasses and jacket as the original.

Night Sky Transformation

Editing
Edit instruction

“Change the scene to night: a deep, dark sky with subtle, glistening stars visible behind the mountain.”

Before After
FLUX.2 [pro]
Before After
Grok Imagine Image
0% wins 0% ties 100% wins

AI Judge Analysis

FLUX.2 [pro]

  • + Perfect source preservation of the village and mountain structure.
  • + Realistic night lighting that appropriately darkens the landscape.
  • + Subtle, realistic star density as requested.
  • The sky feels slightly flat compared to the complex clouds of the original.

Grok Imagine Image

  • + Excellent sky detail with varied star magnitudes.
  • + Maintains the structural integrity of the mountain and village layout.
  • + Captures the 'glistening' aspect of the stars very well.
  • Slightly more aggressive re-rendering of the village texture compared to the source.

Verdict: Both models performed excellent image-to-image translations, preserving the complex village and mountain geometry of the source. FLUX.2 [pro] followed the prompt more literally with 'subtle' stars and a deeper black sky, whereas Grok Imagine Image created a more visually striking sky with higher star density and more atmospheric depth.

Over-the-top cartoon caricature

Editing
Edit instruction

“Create a caricature of me and my job. Make it exaggerated and humorous, incorporating my profession as a tv show anchor and my love for dogs and hockey.”

Source
FLUX.2 [pro]
Grok Imagine Image
80% wins 0% ties 20% wins

AI Judge Analysis

FLUX.2 [pro]

  • + Excellent cartoon/caricature style that is consistent across all elements.
  • + Highly creative integration of dogs into the scene, including them as co-anchors in chairs.
  • + Clear and fun adherence to hockey themes with pucks and sticks.
  • The facial features are less recognizable as the woman from the source image due to the heavy cartoonization.
  • The eyes feel slightly generic for a custom caricature.

Grok Imagine Image

  • + Strong facial likeness, maintaining the specific features and expression of the woman from the source image.
  • + Perfectly follows the caricature trope of a large head on a small body.
  • + Clever details like the dog in a hockey helmet and the skating dog in the background.
  • Mixed art styles: the face is photorealistic while the dogs and background are illustrated.
  • The hand on the left is missing fingers where it touches the dog.

Verdict: Both models successfully captured all elements of the prompt (TV anchor, dogs, and hockey). FLUX.2 [pro] created a more cohesive and humorous illustration, but Grok Imagine Image won on the 'caricature' aspect by maintaining a much stronger facial resemblance to the user while using the classic big-head/small-body style. Grok Imagine Image is preferred for its ability to transform the specific person provided into the caricature.

Adorable Baby Animals in Sunny Meadow

Text-to-Image

“Hyper-photorealistic scene of fluffy baby animals—a golden retriever puppy, tabby kitten, baby bunny, and red fox kit—with big expressive eyes and ultra-detailed soft fur, playfully chasing butterflies and tumbling together in a lush wildflower meadow, warm golden sunrise light with god rays and dew sparkles, joyful wholesome vibe, 8K masterpiece.”

FLUX.2 [pro]
Grok Imagine Image
100% wins 0% ties 0% wins

AI Judge Analysis

FLUX.2 [pro]

  • + Excellent composition with dynamic posing and interaction between the animals.
  • + Superior rendering of lighting, including soft god rays and realistic dew sparkles on the grass.
  • + High level of anatomical detail and texture in the fur and paw pads.
  • Failed to include the baby bunny, showing only three of the four requested animals.
  • More of a digital art aesthetic than 'hyper-photorealistic'.

Grok Imagine Image

  • + Included all four requested animals: puppy, kitten, fox, and bunny.
  • + Very bold, dramatic god rays that emphasize the sunrise theme.
  • + Captured the 'big expressive eyes' requested in the prompt for all subjects.
  • Static, 'posed' composition lacks the 'playfully chasing' and 'tumbling' action requested.
  • Distinct 'AI look' with overly smooth fur textures and less realistic butterfly/insect shapes.
  • Proportions are a bit inconsistent, especially the kitten's face.

Verdict: Both models struggled with the full set of requirements: FLUX.2 [pro] produced a much higher quality, more dynamic image with beautiful lighting, but missed the bunny. Grok Imagine included all four animals, but the image is static and lacks the realistic texture and sophisticated composition found in the first image. FLUX.2 [pro] is preferred for its artistic merit and successful execution of the 'playful' atmosphere, despite the missing subject.

Heroic Super Hero Portrait

Text-to-Image

“Hyper-photorealistic full-body portrait of a female superhero standing triumphantly on a New York skyscraper rooftop at golden sunset, wearing a classic modest superhero costume with flowing cape, chest emblem, gloves, and boots in red and blue colors, practical design, short hair, strong determined heroic expression looking into the distance, powerful confident stance with hands on hips and cape billowing dramatically in the wind, detailed urban cityscape background, warm natural sunlight with sharp shadows and fabric highlights, ultra-sharp textures on suit, hair, and concrete, 8K masterpiece, empowering family-friendly style.”

FLUX.2 [pro]
Grok Imagine Image
100% wins 0% ties 0% wins

AI Judge Analysis

FLUX.2 [pro]

  • + Excellent realism with high-detail skin textures and fabric weaving.
  • + Superior background detail with a recognizable, complex New York skyline.
  • + Great lighting and shadows that accurately reflect the golden sunset.
  • The suit leans more toward armored 'movie' design rather than a classic modest costume.

Grok Imagine Image

  • + Follows the 'hands on hips' prompt more closely on one side, though the other hand is a fist.
  • + Captures the 'short hair' requirement very well with a classic pixie cut.
  • + The cape has a very dramatic, clean silhouette against the sky.
  • Lower overall resolution and detail in the cityscape compared to Model A.
  • The lighting is somewhat flat and lacks the high-fidelity texture seen in the competitor.
  • The 'S' emblem on the chest is distorted and less integrated into the suit fabric.

Verdict: FLUX.2 [pro] is the clear winner due to its significantly higher level of detail, realism, and superior environment rendering. While Grok Imagine Image captured the hair style and cape shape well, it lacked the photographic clarity and complex city background that FLUX.2 [pro] provided.

Studio Ghibli Anime Style

Editing
Edit instruction

“Transform this photo into a Studio Ghibli–inspired illustration. Use soft pastel colors, hand-painted textures, gentle lighting, dreamy backgrounds, and a warm, nostalgic mood”

Source
FLUX.2 [pro]
Grok Imagine Image
83% wins 0% ties 17% wins

AI Judge Analysis

FLUX.2 [pro]

  • + Captures the Studio Ghibli art style perfectly with watercolor textures and soft line art.
  • + Preserves the positions and expressions of all three characters very accurately.
  • + The warm, desaturated palette and lighting create the requested nostalgic mood.
  • The character faces are slightly generic compared to the distinct features of the original people.

Grok Imagine Image

  • + Maintains excellent structural fidelity to the original source image.
  • + Vibrant blue sky adds a pleasing pop of color to the composition.
  • + Effective translation of the background into a hand-painted architectural style.
  • The style leans more toward generic anime/digital illustration than the specific 'hand-painted' watercolor look of Ghibli.
  • The central man's face is rendered with more realistic shading that clashes slightly with the other two more stylized characters.

Verdict: FLUX.2 [pro] is the winner because it successfully captured the specific aesthetic of Studio Ghibli, including the hazy lighting and textured watercolor paper effect. While Grok Imagine produced a high-quality illustration, it felt more like a modern digital anime style rather than the requested dreamy, nostalgic hand-painted feel.

Neutral Expression to Genuine Smile

Editing
Edit instruction
{
  "action": "image_edit",
  "reference": "uploaded neutral portrait",
  "change": "Warm genuine Duchenne smile: lips curved up, slight natural teeth, soft eye crinkles, subtle cheek raise",
  "details": "Realistic smiling skin (dimples if present, soft cheek shadows), slightly brighter eyes; keep exact eye shape/color/iris",
  "preserve_exact": "Face identity/structure, eyes/nose/lips/eyebrows, hair, skin texture/pores/freckles, makeup, clothing, head pose, background, lighting, shadows, framing",
  "no_changes": "No face shape change, no new features, no gaze shift, no hair/clothing/lighting/background edits",
  "style": "Ultra-photorealistic 8K portrait, sharp face focus, natural soft lighting, realistic skin glow"
}
Before After
FLUX.2 [pro]
Before After
Grok Imagine Image
25% wins 0% ties 75% wins

AI Judge Analysis

FLUX.2 [pro]

  • + Excellent Duchenne smile with realistic eye crinkling.
  • + High visual quality with natural skin texture and shading around the mouth.
  • + Maintains character identity and hair appearance well.
  • The teeth look slightly too perfect and uniform compared to the source's natural aesthetic.
  • Subtle smoothing of some skin freckles from the original.

Grok Imagine Image

  • + Near-perfect preservation of skin texture, freckles, and lighting from the source image.
  • + More natural teeth shape that aligns better with the source's realism.
  • + Excellent adherence to eye preservation requirements.
  • The eye crinkles are slightly less pronounced than requested for a full Duchenne smile.
  • The smile is a bit more restrained compared to the 'warm genuine' request.

Verdict: Both models performed exceptionally well at preserving the identity and details of the original photograph. FLUX.2 [pro] produced a more expressive and convincing smile (better eye-crinkling), though Grok Imagine Image was superior at maintaining every single pore, freckle, and the natural irregularities of the teeth from the source.

Golden Hour Stroll

Editing
Edit instruction

“Add dynamic motion to this photo: make hair blow in the wind, add leaves flying, energetic and lively feel.”

Source
FLUX.2 [pro]
Grok Imagine Image
0% wins 0% ties 100% wins

AI Judge Analysis

FLUX.2 [pro]

  • + Excellent preservation of the subject's face and the dog's features
  • + High-quality, realistic leaves with natural depth of field
  • + Natural hair movement that follows the implied wind direction
  • The denim jacket flap edit looks slightly stiff and unnatural

Grok Imagine Image

  • + Great sense of volume in the hair movement
  • + The dog's ears are adjusted to show movement
  • + Significant amount of leaves added to fill the scene
  • Leaves look like flat 2D overlays and lack realistic lighting/shadows
  • Noticeable changes to the woman's face (nose and mouth shape differ from source)

Verdict: FLUX.2 [pro] is the winner because it successfully adds the requested dynamic effects while perfectly preserving the identity of the woman and the dog from the source image. While Grok Imagine Image adds more movement to the dog's ears, its leaf effects look like low-quality stickers and it unnecessarily alters the facial features of the woman.

Vintage Cafe Logo

Text-to-Image

“Vintage minimalist restaurant logo for "Caffè Florian", retro cloche dome with steam and "Est. 1720" banner, classic typography, warm brown and cream tones, subtle texture on light background, vector emblem style.”

FLUX.2 [pro]
Grok Imagine Image
100% wins 0% ties 0% wins

AI Judge Analysis

FLUX.2 [pro]

  • + Perfect adherence to text, including the accent in 'Caffè'.
  • + Balanced circular framing creating a professional emblem feel.
  • + Clean vector style with sophisticated hatching for texture.
  • The steam is a bit literal and less stylized than the dome.
  • The 'Est. 1720' text is slightly small within the banner.

Grok Imagine Image

  • + Strong, bold visual hierarchy with high contrast.
  • + Creative integration of a coffee cup shape and spoon behind the cloche.
  • + The steam has a more fluid, organic design.
  • Repeats 'Est. 1720' twice, which was not requested.
  • The spoon and cup handles make the silhouette a bit cluttered for a minimalist logo.
  • Missing the banner for the date as requested in the prompt.

Verdict: FLUX.2 [pro] followed the prompt more precisely, including the specific requested banner and the correct accent on 'Caffè'. Grok Imagine created a more visually striking illustration but failed on composition by repeating text and ignoring the banner requirement. FLUX.2 [pro] is preferred for its adherence to the vector emblem style and accurate text rendering.

Apollo 11: Journey to Tranquility

Text-to-Image

“Create a clean, modern vector infographic poster about the Apollo 11 mission. NASA-inspired palette (navy, white, muted red, light gray). Flat-vector style, crisp lines, consistent iconography, subtle gradients only. Steps (stop at landing): 1. Launch (Saturn Vicon) 2. Earth Orbit (Earth + orbit ring icon) 3. Translunar (trajectory arc icon) 4. Lunar Orbit (Moon + orbit ring icon) 5. Descent (lunar module descending icon) 6. Landing (lunar module on the surface icon) Small supporting elements (minimal text): • Crew strip: three silhouette icons with only last names: Armstrong, Aldrin, Collins. • Landing site marker: Moon pin labeled "Tranquility" only. Layout constraints: generous margins, large readable labels, clean background with subtle stars. Vector-only, print-poster look, high resolution.”

FLUX.2 [pro]
Grok Imagine Image

AI Judge Analysis

FLUX.2 [pro]

  • + Excellent flat-vector aesthetic that looks professional and cohesive.
  • + Strong composition with a clear top-to-bottom flow of mission phases.
  • + Clean typography for headers and accurate names for the crew members.
  • The placeholder body text below headers is illegible gibberish.
  • Merged two steps (Descent and Landing) into a single visual area without distinct labels.

Grok Imagine Image

  • + Excellent adherence to the numbered steps, clearly illustrating all 6 requested phases.
  • + Text rendering for main titles is very clear and includes the NASA logo for 'NASA-inspired'.
  • + Accurately represents the crew names and the specific landing location 'Tranquility'.
  • The 'Translunar' graphic is a bit cluttered with redundant text like '3rajoory'.
  • The iconography style is slightly less consistent than Model A, with various levels of detail.

Verdict: Grok Imagine Image followed the specific instructions regarding the six steps much more accurately than FLUX.2 [pro], providing a clearer educational layout. While FLUX.2 [pro] has a more sophisticated and professional graphic design aesthetic, Grok Imagine Image succeeded in delivering a complete infographic that includes all requested stages and accurate labels.

FLUX.2 [pro]

Black Forest Labs' state-of-the-art image generation model with maximum quality and speed, supporting text-to-image and multi-reference image editing with up to 4MP output

Grok Imagine Image

An image generation model by xAI designed to generate highly aesthetic images from text descriptions.