Head to head
Esc

Models · slot A

to navigate to pick

Grok Imagine Image xAI Recraft V4 Recraft AI

Settled by community votes across 11 shared challenges, with an AI judge weighing in on each.

Grok Imagine Image

24.1 arena score

#19 of 44 in Text-to-Image

Skill signature

Not enough comparable category data

The chart appears once both models have ratings across at least three shared arena categories.

Recraft V4

26.4 arena score

#8 of 44 in Text-to-Image

Vote tally

Where the votes landed

Grok Imagine Image

75.0%

win rate

Ties

0.0%

Recraft V4

25.0%

win rate

75.0% 0.0% ties 25.0%
Shared challenges 11

Challenge by challenge

The strongest take from each model on every shared challenge, with the AI judge's read.

Geometric Composition

Text-to-Image

“A glass cube on a wooden table. Inside the cube is a small blue sphere. On top of the cube sits a red book. A green plant is behind the cube, partially visible through the glass. Soft window light from the left.”

Grok Imagine Image
Recraft V4

AI Judge Analysis

Grok Imagine Image

  • + Excellent adherence to lighting instructions with a clear soft glow from the left.
  • + Realistic surface textures on the wooden table and book.
  • + Good composition with the plant pot visible through the glass.
  • The glass object is slightly rectangular rather than a perfect cube.
  • The blue sphere appears to be floating in mid-air inside the glass without a clear means of support.

Recraft V4

  • + The glass object is a more accurate geometric cube.
  • + The blue sphere has realistic internal reflections and refraction.
  • + The book has a more aged, realistic texture on the pages.
  • The lighting is a bit flat compared to Model A.
  • The blue sphere is quite large, pushing the definition of a 'small' sphere.

Verdict: Both models followed the spatial instructions perfectly, placing all objects in the requested positions. Grok Imagine achieved better lighting and atmosphere, while Recraft V4 produced a more accurate cube shape and more complex glass refractions. Grok Imagine is slightly preferred for its superior handling of the 'soft window light' which creates a more convincing photographic scene.

Candid Street Photography

Text-to-Image

“A candid street photo of an elderly Japanese man repairing a red bicycle in light rain, reflections on wet pavement, shallow depth of field, 50mm lens, natural skin texture, imperfect framing, motion blur from passing cars, cinematic but realistic, no stylization.”

Grok Imagine Image
Recraft V4
100% wins 0% ties 0% wins

AI Judge Analysis

Grok Imagine Image

  • + Excellent execution of car motion blur in the background
  • + Captures an authentic Japanese urban feel with the background signage and face mask
  • + The lighting on the wet pavement is highly realistic
  • The man's hands are anatomically confused where they meet the bicycle frame
  • The framing is slightly too tight, cutting off parts of the bicycle

Recraft V4

  • + Beautiful rain texture and reflections on the pavement
  • + Strong sense of environmental atmosphere and depth
  • + The man's posture and clothing feel natural to the prompt
  • The front wheel of the bicycle is disconnected from the frame
  • Motion blur on the cars is missing, they appear largely static despite the lights
  • The overall composition feels a bit more posed than 'candid'

Verdict: Both models captured the mood of the prompt effectively. Grok Imagine followed the specific request for motion blur on passing cars much better and felt like a true candid snapshot, whereas Recraft V4 created a more atmospheric scene but failed on technical bicycle anatomy and the motion blur requirement.

Fantasy Warrior

Text-to-Image

“Close portrait of a battle-worn paladin in ornate engraved plate armor, hair braided with small beads, faint scars and dirt on the skin, warm torchlight reflecting off metal, shallow depth of field, bokeh sparks, lifelike eyes, highly detailed texture on leather straps and cloth underlayer.”

Grok Imagine Image
Recraft V4
0% wins 0% ties 100% wins

AI Judge Analysis

Grok Imagine Image

  • + Extremely intricate engraving on the armor plating
  • + Beautiful lighting with high-contrast bokeh sparks
  • + Clear, sharp textures on skin and hair beads
  • The 'battle-worn' aesthetic is very clean and stylized rather than gritty
  • Facial hair/dirt looks more like faint makeup than actual battle grime

Recraft V4

  • + Excellent adherence to the 'battle-worn' prompt with realistic dirt and texture
  • + Realistic skin quality with natural imperfections and intense expression
  • + Great depiction of the cloth underlayer and leather straps
  • The sparks have a slightly synthetic, linear look in some areas
  • The armor engraving is slightly less detailed compared to Model A

Verdict: Both models followed the prompt excellently, but they offer different styles. Grok Imagine creates a more fantastical, high-polish cinematic look with stunning armor details, while Recraft V4 captures a more grounded, realistic, and gritty atmosphere that better reflects the 'battle-worn' instruction. Recraft V4 is the winner for its superior texture on the skin and more authentic characterization.

Modern Clean Menu

Text-to-Image

“Modern minimalist restaurant menu design, white background with colorful food photos in grid, sections for appetizers/pizza/mains, bold sans-serif fonts, vibrant accents, clean professional layout for casual dining.”

Grok Imagine Image
Recraft V4

AI Judge Analysis

Grok Imagine Image

  • + Features a more realistic complex layout with columns and dividers
  • + Captures the 'vibrant accents' request with green and blue color pops
  • + Follows a dense, professional menu structure
  • Lacks readable body text and prices
  • Includes numerous spelling errors in large headings
  • Food images are overlapping or clipped by text in a messy way

Recraft V4

  • + Excellent typography with perfectly legible text and pricing
  • + Very clean, high-quality food photography consistent with a modern minimalist aesthetic
  • + Displays a literal and organized grid that is easy to navigate
  • Layout is a bit sparse for a full restaurant menu
  • The grid feels more like an app interface than a physical print menu

Verdict: Recraft V4 is the clear winner as it produces a functional, legible menu with perfect spelling and high-quality food imagery. While Grok Imagine captures a more traditional print layout, its text is largely gibberish and its organization is cluttered compared to the clean professionalism of Recraft V4.

Magic Burger Explosion: Fiery Photorealism Challenge

Text-to-Image

“Ad for 'Magic Burger'. Dynamic, exploded burger with all components (bun, patty, cheese, lettuce, tomato, sauce) suspended in mid-air. Emphasize photorealistic detail and a sense of motion. Dark, fiery background with glowing embers. Integrate text: 'MAGIC BURGER' as a prominent title, 'LIMITED TIME ONLY' as a secondary message, and '€6.99' in a starburst, all rendered with a fiery, glowing effect.”

Grok Imagine Image
Recraft V4

AI Judge Analysis

Grok Imagine Image

  • + Excellent typography integrated with fiery effects.
  • + Highly vibrant and appetizing food rendering with good motion.
  • + Strong adherence to all prompt elements including the starburst.
  • The 'exploded' effect is a bit cluttered compared to commercial photography.

Recraft V4

  • + Clean, more realistic food textures and lighting.
  • + Good 'blown apart' composition that feels airy.
  • + Accurate text rendering and placement.
  • The background is less 'fiery' and dramatic than requested.
  • The starburst graphic feels like a flat overlay rather than integrated.

Verdict: Grok Imagine delivers a more commercially ready advertisement with striking fiery typography and intense atmosphere. Recraft V4 offers superior photorealism on the burger ingredients themselves but falls short on the 'fiery background' and font integration requirements.

Chalkboard Menu

Text-to-Image

“Handwritten-style chalkboard menu in a cozy café, all text rendered in the exact same realistic chalk handwriting style with natural variations in letter size, slight slant, and chalk texture — no printed or digital fonts anywhere on the board. Title at the top in elegant cursive chalk handwriting: ‘TODAY’S SPECIALS – APRIL 30, 2026’. Below it, three menu items also in the same handwritten chalk style: ‘Truffle Mushroom Risotto – $24’, ‘Grilled Octopus with Lemon & Herbs – $28’, ‘Brown Butter Chocolate Chip Cookies – $9’. At the very bottom, smaller text in the identical handwritten chalk style (slightly smaller but still clearly legible with the same handwriting characteristics): ‘All items made fresh daily • Ask about our gluten-free options’. Warm ambient café lighting, visible chalk dust and smudges, realistic handwriting imperfections, no clean printed text anywhere.”

Grok Imagine Image
Recraft V4

AI Judge Analysis

Grok Imagine Image

  • + Excellent chalk texture within the letters
  • + Accurately rendered all requested text with natural handwriting variations
  • + Convincing smudge marks and chalk dust for realism
  • The title is in all-caps rather than the requested elegant cursive
  • The font style is somewhat consistent, losing some of the 'natural variation' requested in the prompt

Recraft V4

  • + The visual composition of the café background is high quality
  • + Includes creative illustrations of the menu items as a bonus
  • The text looks like a digital font rather than authentic handwriting
  • The text lacks the requested chalky texture and variation
  • The letter 'c' in Octopus is poorly formed

Verdict: Grok Imagine followed the text instructions much more effectively, producing a board that genuinely looks handwritten with realistic chalk texture. While Recraft V4 created a nice scene and added illustrations, the text rendering feels like a digital overlay and fails to capture the 'natural handwriting' requirement specified in the prompt.

The Capybara Taxi Driver

Text-to-Image

“Photorealistic scene inside a yellow New York taxi at night. A capybara is driving, wearing a yellow taxi driver cap and a dark jacket. It has a calm, professional expression and both front paws on the steering wheel. In the back seat sits a human businesswoman in a coat, looking at her phone with a completely normal, bored expression (as if this is just another normal ride). Through the windows you can see the streets of Manhattan at night with blurred lights. Realistic taxi interior, photorealistic, detailed fur and fabric, 35mm lens, night lighting with reflections, shallow depth of field.”

Grok Imagine Image
Recraft V4

AI Judge Analysis

Grok Imagine Image

  • + Excellent photorealism and sharpness throughout the scene.
  • + Perfectly captures the 'bored' expression of the businesswoman.
  • + Highly detailed rendering of the capybara's fur and the taxi interior.
  • The passenger is seated in the front seat instead of the back seat as requested.
  • The steering wheel placement is slightly awkward relative to the capybara's body.

Recraft V4

  • + Successfully placed the businesswoman in the back seat as requested.
  • + Includes accurate text on the driver hat ('NYC TAXI').
  • + Great side-profile composition that feels very candid.
  • The passenger's face is slightly blurry and less detailed than the driver.
  • The capybara's paws are somewhat small and less distinct on the wheel.

Verdict: Grok Imagine Image produces a more high-fidelity, polished image with better facial expressions, but it fails the spatial instruction to place the passenger in the back seat. Recraft V4 adheres more closely to the prompt's layout by placing the businesswoman in the rear, even if the overall image is slightly grainier. Recraft V4 is the winner for following all complex instructions while maintaining a strong cinematic aesthetic.

Isometric Miniature Diorama Scenes

Text-to-Image

“Create a clear, 45° top-down isometric miniature 3D cartoon scene of Japan's signature dish: sushi, with soft refined textures, realistic PBR materials, gentle lighting, on a small raised diorama base with minimal garnish and plate. Solid light blue background. At top-center: 'JAPAN' in large bold text, 'SUSHI' below it, small flag icon. Perfectly centered, ultra-clean, high-clarity, square format.”

Grok Imagine Image
Recraft V4

AI Judge Analysis

Grok Imagine Image

  • + Excellent adherence to the '3D cartoon' and 'soft refined texture' style keywords.
  • + Clean, bold text rendering with perfect professional layout.
  • + High-clarity isometric composition that feels cohesive and intentional.
  • The white fish pieces have a slightly plastic, less 'food-like' appearance compared to the salmon.

Recraft V4

  • + Highly realistic PBR materials, especially on the salmon texture and the plate.
  • + Good interpretation of the 'diorama base' by using a salt/ice block.
  • + Accurate text and flag placement.
  • Completely ignored the '3D cartoon' style request in favor of photorealism.
  • The composition feels slightly more cluttered with the addition of ginger and a textured base.
  • The text is thin and lacks the requested 'large bold' presence.

Verdict: Grok Imagine followed the stylistic cues much better, perfectly capturing the 3D cartoon miniature aesthetic with bold, clean typography. While Recraft V4 produced highly realistic textures, it missed the specific cartoon style requested and the bold impact of the text. Grok Imagine is the winner for its superior layout and closer adherence to the overall visual prompt.

Adorable Baby Animals in Sunny Meadow

Text-to-Image

“Hyper-photorealistic scene of fluffy baby animals—a golden retriever puppy, tabby kitten, baby bunny, and red fox kit—with big expressive eyes and ultra-detailed soft fur, playfully chasing butterflies and tumbling together in a lush wildflower meadow, warm golden sunrise light with god rays and dew sparkles, joyful wholesome vibe, 8K masterpiece.”

Grok Imagine Image
Recraft V4
100% wins 0% ties 0% wins

AI Judge Analysis

Grok Imagine Image

  • + Strong composition with a sense of forward motion
  • + Vibrant, high-contrast colors and professional lighting effects
  • Lacks the butterflies explicitly mentioned in the prompt
  • Illustrative, 'Disney-style' aesthetic rather than the requested hyper-photorealistic scene

Recraft V4

  • + Excellent adherence to all prompt elements including butterflies and specific animals
  • + Achieves a high level of photorealistic detail in fur and environment
  • + Beautiful interpretation of god rays and dew sparkles
  • The fox's front right leg has an anatomically awkward bend

Verdict: Recraft V4 followed the prompt much more accurately by including the butterflies and delivering a photorealistic style, whereas Grok Imagine produced more of a stylized digital illustration. Recraft V4 also captured higher-quality fine details like dew and realistic fur textures, making it the superior version for this specific request.

Vintage Cafe Logo

Text-to-Image

“Vintage minimalist restaurant logo for "Caffè Florian", retro cloche dome with steam and "Est. 1720" banner, classic typography, warm brown and cream tones, subtle texture on light background, vector emblem style.”

Grok Imagine Image
Recraft V4

AI Judge Analysis

Grok Imagine Image

  • + Excellent typography rendering with the correct accent on 'Caffè'
  • + Superior vector-style shading and subtle texture on the cloche
  • + Successfully includes the specific banner element requested in the prompt
  • Includes an unrequested spoon and cup handle attached to the cloche
  • The 'Est. 1720' text is redundant, appearing both on the banner and below the main text

Recraft V4

  • + Elegant and unique 'classic typography' that fits a vintage aesthetic well
  • + Clean minimalist composition that adheres better to the 'minimalist' keyword
  • + Accurate representation of woodcut-style shading on the cloche dome
  • Failed to include the requested 'banner' for the 'Est. 1720' text
  • The accent on the 'è' is positioned more like an apostrophe than a grave accent

Verdict: Grok Imagine Image followed the structural prompt requirements more closely by including the specific banner element and 'Est. 1720' placement, though it added unnecessary extra graphics like a spoon. Recraft V4 produced a much more sophisticated typographic design that better captures the 'vintage minimalist' feel, but missed the banner requirement entirely. Grok Imagine is the winner for better prompt adherence regarding specific layout elements.

Apollo 11: Journey to Tranquility

Text-to-Image

“Create a clean, modern vector infographic poster about the Apollo 11 mission. NASA-inspired palette (navy, white, muted red, light gray). Flat-vector style, crisp lines, consistent iconography, subtle gradients only. Steps (stop at landing): 1. Launch (Saturn Vicon) 2. Earth Orbit (Earth + orbit ring icon) 3. Translunar (trajectory arc icon) 4. Lunar Orbit (Moon + orbit ring icon) 5. Descent (lunar module descending icon) 6. Landing (lunar module on the surface icon) Small supporting elements (minimal text): • Crew strip: three silhouette icons with only last names: Armstrong, Aldrin, Collins. • Landing site marker: Moon pin labeled "Tranquility" only. Layout constraints: generous margins, large readable labels, clean background with subtle stars. Vector-only, print-poster look, high resolution.”

Grok Imagine Image
Recraft V4
100% wins 0% ties 0% wins

AI Judge Analysis

Grok Imagine Image

  • + Excellent adherence to the requested NASA-inspired color palette.
  • + Includes all six requested steps with appropriate icons for each.
  • + Higher visual polish in the vector art style.
  • Several spelling errors in the labels (e.g., 'Translujory', 'Moom').
  • Layout is a bit cluttered with large stars and extra text elements.

Recraft V4

  • + Perfect text rendering for all labels and names.
  • + Clean, professional minimalist layout with plenty of whitespace.
  • + Accurate icons for the trajectory and lunar modules.
  • Light gray background makes the infographics feel slightly washed out compared to a navy theme.
  • The icon for Lunar Orbit is a bit simplistic compared to the Earth icon.

Verdict: Recraft V4 is the winner due to its flawless text rendering and superior layout, which is critical for an infographic. While Grok Imagine captures the NASA aesthetic and dark theme more vividly, its significant spelling errors and cluttered design make it less functional as a piece of graphic design.

Next steps

Explore each model