Head to head
Esc

Models · slot A

to navigate to pick

Grok Imagine Image xAI Recraft V4 Pro Recraft AI

Settled by community votes across 11 shared challenges, with an AI judge weighing in on each.

Grok Imagine Image

24.1 arena score

#19 of 44 in Text-to-Image

Skill signature

Not enough comparable category data

The chart appears once both models have ratings across at least three shared arena categories.

Recraft V4 Pro

24.3 arena score

#18 of 44 in Text-to-Image

Vote tally

Where the votes landed

Grok Imagine Image

0.0%

win rate

Ties

0.0%

Recraft V4 Pro

100.0%

win rate

0.0% 0.0% ties 100.0%
Shared challenges 11

Challenge by challenge

The strongest take from each model on every shared challenge, with the AI judge's read.

Geometric Composition

Text-to-Image

“A glass cube on a wooden table. Inside the cube is a small blue sphere. On top of the cube sits a red book. A green plant is behind the cube, partially visible through the glass. Soft window light from the left.”

Grok Imagine Image
Recraft V4 Pro

AI Judge Analysis

Grok Imagine Image

  • + Excellent adherence to the 'plant behind the cube visible through glass' instruction
  • + Realistic soft lighting consistency from the left side
  • + Good textures on the wooden table and red book cover
  • The blue sphere is floating physically impossible in the center of the cube
  • The cube looks more like a rectangular glass frame than a solid cube

Recraft V4 Pro

  • + The blue sphere realistically sits on the bottom of the cube
  • + Strong photographic quality with realistic reflections on the glass and sphere
  • + Clear spatial relationship between the book and the cube
  • The plant is more 'beside' the cube than 'behind' it, missing the refraction effect through the glass
  • The sphere is quite large compared to the 'small' sphere requested in the prompt

Verdict: Grok Imagine followed the complex spatial instruction regarding the plant being viewed through the glass much better than Recraft V4 Pro. However, Recraft V4 Pro produced a more physically grounded image where the sphere actually rests on the bottom of the cube, whereas Grok Imagine's sphere floats unnaturally.

Candid Street Photography

Text-to-Image

“A candid street photo of an elderly Japanese man repairing a red bicycle in light rain, reflections on wet pavement, shallow depth of field, 50mm lens, natural skin texture, imperfect framing, motion blur from passing cars, cinematic but realistic, no stylization.”

Grok Imagine Image
Recraft V4 Pro

AI Judge Analysis

Grok Imagine Image

  • + Excellent authentic captures of motion blur from passing cars.
  • + Highly realistic lighting and pavement reflections that feel like a snapshot.
  • + Achieves a convincing 'imperfect framing' requested in the prompt.
  • The subject's face is obscured and his hands are distorted.
  • Minor anatomical issues with the bicycle frame integration.

Recraft V4 Pro

  • + Beautiful rain effects and atmospheric lighting.
  • + The subject's aged skin texture is highly detailed and realistic.
  • + Superior character and bicycle composition with clearer focus.
  • Fails to include the requested motion blur from passing cars.
  • The framing is a bit too 'perfectly' centered for a requested candid street photo.

Verdict: Grok Imagine followed the technical aspects of the prompt more closely, successfully including the motion blur and the requested imperfect snapshot aesthetic. However, Recraft V4 Pro produced a much more visually striking image with superior rendering of the subject and rain, even though it missed the specific car motion blur requirement.

Fantasy Warrior

Text-to-Image

“Close portrait of a battle-worn paladin in ornate engraved plate armor, hair braided with small beads, faint scars and dirt on the skin, warm torchlight reflecting off metal, shallow depth of field, bokeh sparks, lifelike eyes, highly detailed texture on leather straps and cloth underlayer.”

Grok Imagine Image
Recraft V4 Pro
0% wins 0% ties 100% wins

AI Judge Analysis

Grok Imagine Image

  • + Exquisite intricate engraving on the plate armor
  • + Beautiful warm lighting with cinematic bokeh
  • + Exceptional crispness in the eye and skin textures
  • The character looks more like a model than a 'battle-worn' warrior
  • The hair beads are somewhat generic compared to the intricate prompt

Recraft V4 Pro

  • + Excellent interpretation of 'battle-worn' with much more realistic dirt and grime
  • + Includes colorful, specific beads in the hair braids as requested
  • + More authentic and gritty facial expression
  • Lighting is a bit flat compared to Model A
  • The armor engraving is slightly less detailed and more muted

Verdict: Grok Imagine provides a more visually stunning, cinematic image with incredible armor detail and lighting, but Recraft V4 Pro captures the 'battle-worn' atmosphere much more effectively with realistic dirt and a grittier character. While Grok is more 'beautiful,' Recraft is more faithful to the specific mood and texture descriptions of the prompt.

Modern Clean Menu

Text-to-Image

“Modern minimalist restaurant menu design, white background with colorful food photos in grid, sections for appetizers/pizza/mains, bold sans-serif fonts, vibrant accents, clean professional layout for casual dining.”

Grok Imagine Image
Recraft V4 Pro

AI Judge Analysis

Grok Imagine Image

  • + Excellent typography style with bold headers
  • + Very creative and integrated layout of food photography
  • + High visual appeal and professional aesthetic for a casual menu
  • Text is largely illegible gibberish
  • The grid layout is slightly disjointed and non-linear

Recraft V4 Pro

  • + Perfectly legible and coherent text in English
  • + Strict adherence to the requested grid layout
  • + Very clean and functional minimalist design
  • Composition is a bit generic/standard
  • Less artistic flair compared to the other model

Verdict: Recraft V4 Pro is the clear winner for a design task because it produces fully legible, logical, and usable text, whereas Grok Imagine fills the design with placeholder gibberish. While Grok Imagine has a more creative and visually striking artistic layout, Recraft V4 Pro follows the grid prompt more accurately and provides a practical, production-ready menu.

Magic Burger Explosion: Fiery Photorealism Challenge

Text-to-Image

“Ad for 'Magic Burger'. Dynamic, exploded burger with all components (bun, patty, cheese, lettuce, tomato, sauce) suspended in mid-air. Emphasize photorealistic detail and a sense of motion. Dark, fiery background with glowing embers. Integrate text: 'MAGIC BURGER' as a prominent title, 'LIMITED TIME ONLY' as a secondary message, and '€6.99' in a starburst, all rendered with a fiery, glowing effect.”

Grok Imagine Image
Recraft V4 Pro

AI Judge Analysis

Grok Imagine Image

  • + Excellent text integration with a true fiery, glowing effect as requested
  • + Highly vibrant and appetizing color palette
  • + Superb explosive motion through the use of sauce droplets and sparks
  • The starburst for the price is a bit simplistic/clipart-like in style
  • Some components like the lettuce look slightly stylized compared to the patty

Recraft V4 Pro

  • + Extremely photorealistic burger textures, particularly the buns and patty
  • + Good adherence to the 'exploded' layout with many flying ingredients
  • + Sophisticated starburst design around the price
  • The text doesn't feel 'fiery' or 'glowing' as requested, but rather just has a flame texture overlay
  • The 'LIMITED TIME ONLY' text is less legible and loses the glowing impact

Verdict: Grok Imagine followed the stylistic instructions for the text much better, creating a cohesive fiery aesthetic that feels like a real advertisement. While Recraft V4 Pro achieved higher realism on the burger itself, its text treatment was flat, and it missed the 'glowing' requirement for the typography.

Chalkboard Menu

Text-to-Image

“Handwritten-style chalkboard menu in a cozy café, all text rendered in the exact same realistic chalk handwriting style with natural variations in letter size, slight slant, and chalk texture — no printed or digital fonts anywhere on the board. Title at the top in elegant cursive chalk handwriting: ‘TODAY’S SPECIALS – APRIL 30, 2026’. Below it, three menu items also in the same handwritten chalk style: ‘Truffle Mushroom Risotto – $24’, ‘Grilled Octopus with Lemon & Herbs – $28’, ‘Brown Butter Chocolate Chip Cookies – $9’. At the very bottom, smaller text in the identical handwritten chalk style (slightly smaller but still clearly legible with the same handwriting characteristics): ‘All items made fresh daily • Ask about our gluten-free options’. Warm ambient café lighting, visible chalk dust and smudges, realistic handwriting imperfections, no clean printed text anywhere.”

Grok Imagine Image
Recraft V4 Pro

AI Judge Analysis

Grok Imagine Image

  • + Excellent chalk texture with realistic dusty residue.
  • + Perfect adherence to the 'handwritten cursive' and 'natural variation' requirement.
  • + Authentic blackboard composition with a wooden frame and smudges.
  • The date is slightly less 'elegant' cursive than the title might suggest but still very high quality.

Recraft V4 Pro

  • + Clean layout and easy to read text.
  • + Successfully completed the truncated text from the prompt for the cookies.
  • Text looks like a digital font rather than genuine chalk handwriting.
  • The text lacks the requested chalk texture and natural slant variation.
  • The title is in a blocky sans-serif style rather than the requested elegant cursive.

Verdict: Grok Imagine is the clear winner as it masterfully captured the requested chalk texture and handwritten aesthetic, creating a believable and cozy cafe menu. Recraft V4 Pro produced text that appears too clean and digital, failing to meet the core requirement for natural handwriting and chalk variation.

The Capybara Taxi Driver

Text-to-Image

“Photorealistic scene inside a yellow New York taxi at night. A capybara is driving, wearing a yellow taxi driver cap and a dark jacket. It has a calm, professional expression and both front paws on the steering wheel. In the back seat sits a human businesswoman in a coat, looking at her phone with a completely normal, bored expression (as if this is just another normal ride). Through the windows you can see the streets of Manhattan at night with blurred lights. Realistic taxi interior, photorealistic, detailed fur and fabric, 35mm lens, night lighting with reflections, shallow depth of field.”

Grok Imagine Image
Recraft V4 Pro

AI Judge Analysis

Grok Imagine Image

  • + Excellent photorealism with sharp textures on the capybara's fur and the taxi dashboard.
  • + Perfect adherence to the 'front paws on steering wheel' and 'bored expression' instructions for the woman.
  • + High cinematic quality with realistic depth of field and urban lighting.
  • The passenger is sitting in the front seat next to the driver instead of the back seat.
  • The perspective is from 'below' the dashboard looking up, which is a slightly unusual angle for a taxi ride.

Recraft V4 Pro

  • + Correctly places the woman in the back seat as requested in the prompt.
  • + Atmospheric rainy night lighting provides a moody and realistic NYC aesthetic.
  • + The capybara's cap includes 'NYC' text which adds to the theme.
  • The woman's face is slightly blurry and less detailed than in Model A.
  • The side-view composition feels less intimate and less like an 'inside the taxi' shot compared to the front-on view.

Verdict: Both models captured the surreal nature of the prompt very well, but they succeeded in different areas. Grok Imagine produced a much more detailed and polished image with better facial expressions, but failed to put the passenger in the back seat. Recraft V4 Pro correctly positioned the passenger in the back seat and captured the NYC rainy atmosphere beautifully, making it the winner for better prompt adherence regarding composition.

Isometric Miniature Diorama Scenes

Text-to-Image

“Create a clear, 45° top-down isometric miniature 3D cartoon scene of Japan's signature dish: sushi, with soft refined textures, realistic PBR materials, gentle lighting, on a small raised diorama base with minimal garnish and plate. Solid light blue background. At top-center: 'JAPAN' in large bold text, 'SUSHI' below it, small flag icon. Perfectly centered, ultra-clean, high-clarity, square format.”

Grok Imagine Image
Recraft V4 Pro
0% wins 0% ties 100% wins

AI Judge Analysis

Grok Imagine Image

  • + Excellent 3D cartoon aesthetic with soft, appealing textures
  • + Perfectly follows the 45° isometric perspective requirement
  • + Very clean and bold text rendering that matches the style
  • The white fish textures are slightly plasticky

Recraft V4 Pro

  • + Features more realistic PBR material textures on the plate and fish
  • + Includes accurate Japanese flag icon and text spacing
  • + Good variety of sushi types presented
  • The diorama base made of loose rice looks slightly unappealing/messy
  • The 'cartoon' style is less consistent than Model A
  • The text is smaller and has less visual weight

Verdict: Both models followed the complex prompt requirements for text, flags, and isometric diorama composition very well. Grok Imagine (Image A) is the winner as it better captures the '3D cartoon miniature' aesthetic with a much cleaner and more professional-looking base, whereas Recraft V4 Pro (Image B) has a strangely textured base that detracts from the overall appeal.

Adorable Baby Animals in Sunny Meadow

Text-to-Image

“Hyper-photorealistic scene of fluffy baby animals—a golden retriever puppy, tabby kitten, baby bunny, and red fox kit—with big expressive eyes and ultra-detailed soft fur, playfully chasing butterflies and tumbling together in a lush wildflower meadow, warm golden sunrise light with god rays and dew sparkles, joyful wholesome vibe, 8K masterpiece.”

Grok Imagine Image
Recraft V4 Pro

AI Judge Analysis

Grok Imagine Image

  • + Features a very vibrant and cheerful color palette
  • + Successfully captures all animal types in a cohesive group
  • Lighting and textures look more like a digital illustration than a hyper-photorealistic photo
  • Fails to include the butterflies requested in the prompt

Recraft V4 Pro

  • + Achieves a high level of photorealism in fur textures and lighting
  • + Includes all requested elements including butterflies and god rays
  • The kitten and fox anatomy appears slightly distorted in their posing
  • The background hills look a bit flat compared to the foreground

Verdict: Recraft V4 Pro better adheres to the 'hyper-photorealistic' instruction by using natural lighting and realistic textures, whereas Grok Imagine delivers a stylized digital art look. Recraft V4 Pro also successfully includes the butterflies and dew sparkles which are missing in the other output.

Vintage Cafe Logo

Text-to-Image

“Vintage minimalist restaurant logo for "Caffè Florian", retro cloche dome with steam and "Est. 1720" banner, classic typography, warm brown and cream tones, subtle texture on light background, vector emblem style.”

Grok Imagine Image
Recraft V4 Pro

AI Judge Analysis

Grok Imagine Image

  • + Excellent typography rendering with the correct accent on 'Caffè'.
  • + Clean vector aesthetic that fits the 'emblem style' prompt.
  • + Includes all requested elements including the cloche, steam, and multiple instances of the established date.
  • The 'Est. 1720' is displayed twice and not in the requested banner format.
  • Includes strange artifacts sticking out of the right side of the cloche that look like a broken spoon or handle.

Recraft V4 Pro

  • + Successfully includes the 'Est. 1720' banner as requested.
  • + Features a more elegant, handwritten vibe with high-quality line work on the cloche.
  • + Stronger composition with a more balanced vertical flow.
  • The accent over the 'e' in 'Caffè' is stylized into the letter rather than being a distinct diacritic.
  • The steam appears below the cloche rather than rising from it, which is physically counter-intuitive.

Verdict: Grok Imagine delivers a cleaner vector emblem that is highly legible, though it contains some nonsensical graphical artifacts on the right side of the icon. Recraft V4 Pro follows the specific prompt details better by including the requested banner, and its hand-drawn vintage style feels more authentic to a historical restaurant logo. Recraft V4 Pro is the winner for its superior artistic composition and adherence to the banner requirement.

Apollo 11: Journey to Tranquility

Text-to-Image

“Create a clean, modern vector infographic poster about the Apollo 11 mission. NASA-inspired palette (navy, white, muted red, light gray). Flat-vector style, crisp lines, consistent iconography, subtle gradients only. Steps (stop at landing): 1. Launch (Saturn Vicon) 2. Earth Orbit (Earth + orbit ring icon) 3. Translunar (trajectory arc icon) 4. Lunar Orbit (Moon + orbit ring icon) 5. Descent (lunar module descending icon) 6. Landing (lunar module on the surface icon) Small supporting elements (minimal text): • Crew strip: three silhouette icons with only last names: Armstrong, Aldrin, Collins. • Landing site marker: Moon pin labeled "Tranquility" only. Layout constraints: generous margins, large readable labels, clean background with subtle stars. Vector-only, print-poster look, high resolution.”

Grok Imagine Image
Recraft V4 Pro

AI Judge Analysis

Grok Imagine Image

  • + Excellent adherence to the dark navy NASA palette and space-themed background.
  • + Successful rendering of all six specific steps requested with matching icons.
  • + Includes extra details like the astronaut names and a stylized NASA logo.
  • Several spelling errors in the labels (e.g., 'Translunery', '3rajcoory').
  • The lunar module icons for Descent and Landing look more like helmets or robots than the actual module.

Recraft V4 Pro

  • + Exceptional text legibility and perfect spelling throughout the infographic.
  • + Superior modern vector aesthetic with clean linework and professional font choices.
  • + The icons for the lunar module are more recognizable and correctly styled.
  • The Saturn V rocket is elongated with incorrect proportions.
  • Uses a light background which differs slightly from the traditional navy 'NASA-inspired' dark palette expectation.

Verdict: Both models followed the prompt instructions very well, correctly identifying the six steps of the mission. However, Recraft V4 Pro is the clear winner due to its professional, clean vector style and perfect text rendering, whereas Grok Imagine suffered from significant spelling errors and less polished iconography despite having a more appealing background color.

Next steps

Explore each model