xAI's premium image generation model offering higher fidelity output and stronger performance on single-image editing benchmarks compared to the standard Grok Imagine model
Settled by community votes across 11 shared challenges, with an AI judge weighing in on each.
Grok Imagine Image Pro
#14 of 44 in Text-to-Image
Not enough comparable category data
The chart appears once both models have ratings across at least three shared arena categories.
Recraft V4
#8 of 44 in Text-to-Image
Where the votes landed
Grok Imagine Image Pro
40.0%
win rate
Ties
40.0%
Recraft V4
20.0%
win rate
Challenge by challenge
The strongest take from each model on every shared challenge, with the AI judge's read.
Geometric Composition
Text-to-Image“A glass cube on a wooden table. Inside the cube is a small blue sphere. On top of the cube sits a red book. A green plant is behind the cube, partially visible through the glass. Soft window light from the left.”
AI Judge Analysis
Grok Imagine Image Pro
- + Excellent reflection and refraction physics within the glass cube.
- + High-quality text rendering on the book spine.
- + Realistic lighting and texture on the wooden table.
- − The glass cube has an open top rather than being a solid or fully enclosed cube structure.
Recraft V4
- + Successfully captured the 'sphere inside a cube' concept with a solid glass aesthetic.
- + Good depth of field with the plant clearly positioned behind the glass.
- − The sphere appears to be floating unnaturally in the center of the cube.
- − The perspective of the cube is slightly skewed, looking less like a perfect cube and more like a thick-walled block.
Verdict: Grok Imagine Image Pro produced a more aesthetically pleasing and realistic image with superior lighting and texture, though it interpreted the cube more as a glass 'case' without a lid. Recraft V4 followed the literal prompt of placing the sphere 'inside' the volume of the glass more closely, but the levitation of the sphere and the lack of top-down perspective made it feel less grounded. Grok Imagine Image Pro is the winner for its impressive handling of complex reflections and shadows.
Candid Street Photography
Text-to-Image“A candid street photo of an elderly Japanese man repairing a red bicycle in light rain, reflections on wet pavement, shallow depth of field, 50mm lens, natural skin texture, imperfect framing, motion blur from passing cars, cinematic but realistic, no stylization.”
AI Judge Analysis
Grok Imagine Image Pro
- + Excellent anatomical detail in the hands and face
- + Superior technical execution of the bicycle geometry
- + Realistic skin texture and garment materials
- − The rain effect is very subtle compared to the prompt
- − Motion blur on cars is relatively limited
Recraft V4
- + Excellent depiction of heavy rain and wet surface reflections
- + Stronger sense of motion blur and atmosphere
- + Dynamic 'imperfect' framing that feels more like a candid street photo
- − Significant anatomical issues where the hands meet the bicycle frame
- − Distorted bicycle wheel and fork geometry
- − Lower resolution in the subject's facial details
Verdict: Grok Imagine Image Pro produces a much more technically sound image with clean anatomy and realistic object geometry, making it the more professional-looking output. Recraft V4 captures the rainy atmosphere and 'candid' motion better, but it suffers from severe distortions in the man's hands and the structure of the red bicycle.
Fantasy Warrior
Text-to-Image“Close portrait of a battle-worn paladin in ornate engraved plate armor, hair braided with small beads, faint scars and dirt on the skin, warm torchlight reflecting off metal, shallow depth of field, bokeh sparks, lifelike eyes, highly detailed texture on leather straps and cloth underlayer.”
AI Judge Analysis
Grok Imagine Image Pro
- + Exceptional textural detail on the engraved plate armor and fabric under-layers.
- + Very sharp, lifelike eyes with realistic light reflections.
- + Excellent adherence to the 'beads' and 'braids' prompt with distinct, high-quality jewelry.
- − The facial features and 'battle-worn' elements appear slightly too clean and Hollywood-stylized.
- − The depth of field is a bit flat across the character's shoulders.
Recraft V4
- + Conveys a more realistic sense of grit and 'battle-worn' exhaustion through skin texture and lighting.
- + The lighting is more cinematic, with a stronger orange-to-blue contrast.
- + Good inclusion of various leather strap textures and cloth elements.
- − The armor engraving is less detailed and lacks the high-fidelity crispness seen in the other model.
- − The hair braids and beads are a bit messy and less defined.
- − Anatomical integration near the neck/gorget area is slightly awkward.
Verdict: Grok Imagine Image Pro creates a visually stunning image with high-frequency detail in the armor and clear, beautiful eyes, though it looks a bit like a high-budget video game character. Recraft V4 captures the 'battle-worn' mood and lighting more effectively but falls short on the fine engraving and bead details requested in the prompt. Grok Imagine Image Pro is the preferred choice for its superior technical clarity and meticulous adherence to the material requests.
Modern Clean Menu
Text-to-Image“Modern minimalist restaurant menu design, white background with colorful food photos in grid, sections for appetizers/pizza/mains, bold sans-serif fonts, vibrant accents, clean professional layout for casual dining.”
AI Judge Analysis
Grok Imagine Image Pro
- + Excellent photo quality with rich detail and vibrant colors.
- + Strict adherence to the requested grid layout with distinct sections.
- + Includes realistic footer information like address and phone number.
- − Text rendering is shaky with some spelling errors (e.g., 'Salmom', 'Pepperani').
- − Body text is repetitive, using the same description for multiple items.
Recraft V4
- + Near-perfect text legibility and correct spelling across all items.
- + Clean, minimalist aesthetic that feels like a modern brand identity.
- + Food images have consistent lighting and a professional cut-out look.
- − The 'grid' is a bit loose with uneven spacing between items.
- − Food photos are less visually 'lush' compared to the 3D realism of the competitor.
Verdict: Recraft V4 is the clear winner for a design challenge because it produces perfectly legible, correctly spelled text that is actually usable in a professional context. While Grok Imagine Image Pro offers more vibrant food photography, its text rendering is messy and the repetitive descriptions break the illusion of a real menu.
Chalkboard Menu
Text-to-Image“Handwritten-style chalkboard menu in a cozy café, all text rendered in the exact same realistic chalk handwriting style with natural variations in letter size, slight slant, and chalk texture — no printed or digital fonts anywhere on the board. Title at the top in elegant cursive chalk handwriting: ‘TODAY’S SPECIALS – APRIL 30, 2026’. Below it, three menu items also in the same handwritten chalk style: ‘Truffle Mushroom Risotto – $24’, ‘Grilled Octopus with Lemon & Herbs – $28’, ‘Brown Butter Chocolate Chip Cookies – $9’. At the very bottom, smaller text in the identical handwritten chalk style (slightly smaller but still clearly legible with the same handwriting characteristics): ‘All items made fresh daily • Ask about our gluten-free options’. Warm ambient café lighting, visible chalk dust and smudges, realistic handwriting imperfections, no clean printed text anywhere.”
AI Judge Analysis
Grok Imagine Image Pro
- + Excellent chalk texture throughout the board
- + Highly realistic variation in letter size and baseline slant
- + The handwriting looks genuinely human-made rather than a font
- − The 'Today's Specials' title is in print-style rather than the requested elegant cursive
- − The board occupies almost the entire frame with very little café context visible
Recraft V4
- + Beautiful composition and depth of field showing a cozy café background
- + Excellent prompt adherence for the cursive style requested in the title
- + Very clean and readable handwriting
- − The chalk texture is a bit too uniform, appearing slightly digital in some places
- − The handwriting is almost too perfect, missing some of the 'natural variations' requested in the prompt
Verdict: Both models followed the complex text requirements perfectly, which is impressive. Grok Imagine Image Pro has superior chalk texture and more realistic 'imperfect' handwriting, while Recraft V4 followed the specific stylistic instruction for a cursive title and provided a much more aesthetically pleasing café background scene.
The Reversed Rodeo
Text-to-Image“Horse riding astronaut in space — horse on top, not vice versa. Surreal, highly detailed, cinematic.”
AI Judge Analysis
Grok Imagine Image Pro
- + Excellent adherence to the 'horse on top' spatial instruction
- + Vibrant, cinematic colors and a high-quality nebula background
- + Creative and surreal composition that matches the requested vibe
- − The horse is floating above the astronaut rather than physically 'riding' him
- − Some anatomical awkwardness in the horse's back leg
Recraft V4
- + High technical detail and realistic lighting
- + Very cinematic atmospheric effects with dust and asteroids
- − Failed the primary negative constraint: the astronaut is riding the horse
- − The prompt specifically asked for the horse to be on top
Verdict: Grok Imagine Image Pro successfully followed the specific and difficult instruction to have the horse riding the astronaut, creating a truly surreal image. Recraft V4 ignored the specific 'not vice versa' instruction and generated a standard astronaut riding a horse, which is a much common training data pattern. Grok is the clear winner for prompt adherence.
The Capybara Taxi Driver
Text-to-Image“Photorealistic scene inside a yellow New York taxi at night. A capybara is driving, wearing a yellow taxi driver cap and a dark jacket. It has a calm, professional expression and both front paws on the steering wheel. In the back seat sits a human businesswoman in a coat, looking at her phone with a completely normal, bored expression (as if this is just another normal ride). Through the windows you can see the streets of Manhattan at night with blurred lights. Realistic taxi interior, photorealistic, detailed fur and fabric, 35mm lens, night lighting with reflections, shallow depth of field.”
AI Judge Analysis
Grok Imagine Image Pro
- + Excellent text rendering on the hat with specific NYC TLC Medallion details.
- + Superior photographic clarity and realistic lighting on the subjects.
- + Accurately captures the 'bored' expression requested for the passenger.
- − The perspective from the dashboard feels a bit cramped compared to a side view.
- − The capybara's hands look slightly more like paws with claws than natural limb placement.
Recraft V4
- + The side-profile composition effectively shows both the driver and the passenger in a balanced way.
- + Detailed textures on the passenger's coat and the taxi seats.
- + Good atmospheric rain effect on the car windows.
- − The capybara's hat is simple and lacks the requested 'taxi driver' branding or text.
- − The lighting is somewhat flat and less cinematically 'photorealistic' than Model A.
- − Perspective issues with the passenger's feet and the car floor.
Verdict: Grok Imagine Image Pro wins due to its superior photorealistic quality and precise adherence to the text details on the taxi cap. While Recraft V4 offers an interesting side-profile composition, Grok's output feels much more like a high-end film still with better facial expressions and lighting.
Isometric Miniature Diorama Scenes
Text-to-Image“Create a clear, 45° top-down isometric miniature 3D cartoon scene of Japan's signature dish: sushi, with soft refined textures, realistic PBR materials, gentle lighting, on a small raised diorama base with minimal garnish and plate. Solid light blue background. At top-center: 'JAPAN' in large bold text, 'SUSHI' below it, small flag icon. Perfectly centered, ultra-clean, high-clarity, square format.”
AI Judge Analysis
Grok Imagine Image Pro
- + Excellent 3D cartoon aesthetic with soft, pillowy textures.
- + Perfectly executed text and flag icon placement.
- + Clean isometric composition with a high-quality wood grain base.
- − The rice grains are a bit large and stylized, leaning more towards 'toy' than 'realistic PBR'.
Recraft V4
- + Features more realistic PBR textures on the fish and rice.
- + Interesting use of a crystal/ice diorama base.
- + Accurate text rendering and centered composition.
- − The 'JAPAN' text is slightly thinner and less impactful than requested.
- − The lighting transition on the base creates a harsh shadow that detracts from the 'gentle lighting' prompt.
Verdict: Grok Imagine Image Pro perfectly captured the 3D cartoon miniature aesthetic with a clean, professional finish and superior typography. While Recraft V4 offered more realistic textures for the food itself, it failed to harmonize the 'cartoon' and 'realistic' elements as effectively as Grok.
Adorable Baby Animals in Sunny Meadow
Text-to-Image“Hyper-photorealistic scene of fluffy baby animals—a golden retriever puppy, tabby kitten, baby bunny, and red fox kit—with big expressive eyes and ultra-detailed soft fur, playfully chasing butterflies and tumbling together in a lush wildflower meadow, warm golden sunrise light with god rays and dew sparkles, joyful wholesome vibe, 8K masterpiece.”
AI Judge Analysis
Grok Imagine Image Pro
- + Excellent depiction of a wide variety of meadow flowers
- + Playful interaction between the fox and the group
- + Distinct and realistic god rays from the sun
- − Generated two kittens instead of one
- − The puppy's paw appears slightly disconnected or anatomically awkward due to the angle
- − Lighting feels a bit more like a digital illustration than a photorealistic shot
Recraft V4
- + Perfect adherence to the requested animal count (one of each)
- + Incredible fur texture and ultra-realistic lighting with dew sparkles
- + Dynamic composition with animals running toward the viewer
- − The bunny is positioned somewhat oddly in the air behind the others
- − Some butterflies are very small and look like simple flecks of color
Verdict: Recraft V4 is the winner as it accurately followed the prompt's request for one kitten, whereas Grok Imagine Image Pro added a second kitten. Recraft V4 also achieved a higher level of photorealism with its lighting and fur textures, whereas Grok's output leaned slightly more toward a high-end digital painting style.
Vintage Cafe Logo
Text-to-Image“Vintage minimalist restaurant logo for "Caffè Florian", retro cloche dome with steam and "Est. 1720" banner, classic typography, warm brown and cream tones, subtle texture on light background, vector emblem style.”
AI Judge Analysis
Grok Imagine Image Pro
- + Excellent typography rendering with perfect spelling of 'Caffè Florian'.
- + Strong vector emblem composition with a well-integrated banner.
- + Accurate colors and subtle cloth-like background texture.
- − The steam swirl is a bit generic and thin compared to the thicker vector lines.
- − Composition is a bit safe/traditional compared to more creative font choices.
Recraft V4
- + Elegant custom typography with a sophisticated retro feel.
- + Great use of engraving-style linework on the cloche for texture.
- + Clean minimalist aesthetic that feels like a modern luxury brand.
- − Missed the 'Est. 1720' banner requirement; it is placed in a semi-circle instead.
- − The steam element is very small and lacks the 'vintage' feel of the rest of the icon.
- − Overall layout is less 'emblem style' than requested.
Verdict: Grok Imagine Image Pro followed the prompt instructions more precisely, including the specific banner detail and the round emblem style. While Recraft V4 produced more stylish and high-end typography, its failure to include the requested banner makes Grok Imagine Image Pro the winner for accuracy.
Apollo 11: Journey to Tranquility
Text-to-Image“Create a clean, modern vector infographic poster about the Apollo 11 mission. NASA-inspired palette (navy, white, muted red, light gray). Flat-vector style, crisp lines, consistent iconography, subtle gradients only. Steps (stop at landing): 1. Launch (Saturn Vicon) 2. Earth Orbit (Earth + orbit ring icon) 3. Translunar (trajectory arc icon) 4. Lunar Orbit (Moon + orbit ring icon) 5. Descent (lunar module descending icon) 6. Landing (lunar module on the surface icon) Small supporting elements (minimal text): • Crew strip: three silhouette icons with only last names: Armstrong, Aldrin, Collins. • Landing site marker: Moon pin labeled "Tranquility" only. Layout constraints: generous margins, large readable labels, clean background with subtle stars. Vector-only, print-poster look, high resolution.”
AI Judge Analysis
Grok Imagine Image Pro
- + Excellent adherence to the vertical timeline structure requested.
- + Perfect text rendering for all mission phases and crew names.
- + Highly consistent iconography within a unified design system.
- − The 'Descent' icon includes a large red rocket flame which feels slightly less 'flat-vector' than the rest of the design.
- − Small spelling error in 'Tranquility' (missing one 'i').
Recraft V4
- + Modern, dynamic layout with high-quality vector illustrations.
- + Beautifully rendered Saturn V and Lunar Module icons with crisp lines.
- + Strong use of the specified NASA-inspired color palette.
- − The 'Lunar Orbit' icon mistakenly uses an Earth-like planet instead of a Moon.
- − The 'Earth Orbit' illustration is missing the orbit ring requested.
- − Text rendering is slightly inconsistent, with some labels floating without clear alignment.
Verdict: Grok Imagine Image Pro followed the specific infographic structure much better, providing all six steps in a clear, logical vertical timeline with perfect text. Recraft V4 produced more visually ambitious vector illustrations, but failed on major details like using the wrong planet for the lunar orbit and missing the specific orbit ring requests.
Explore each model
Recraft's latest text-to-image generation model with high-quality output, supporting various aspect ratios and custom color palettes