GPT Image 2 vs Recraft V4 Pro

Head-to-head across 7 challenges

GPT Image 2

66.7%

win rate

Ties

0.0%

Recraft V4 Pro

33.3%

win rate

66.7% 0.0% ties 33.3%

Challenge Results

Modern Clean Menu

Text-to-Image

“Modern minimalist restaurant menu design, white background with colorful food photos in grid, sections for appetizers/pizza/mains, bold sans-serif fonts, vibrant accents, clean professional layout for casual dining.”

GPT Image 2
Recraft V4 Pro

AI Judge Analysis

GPT Image 2

  • + Excellent typography and layout design that looks like a professionally produced menu.
  • + High-quality, appetizing food photography with consistent lighting.
  • + Perfectly legible text including descriptions, prices, and contact information.
  • The 'Sirloin Steak' image contains some minor blurry artifacts in the top left corner of the plate.

Recraft V4 Pro

  • + Strict adherence to the 'grid' request with a clean 3x3 layout.
  • + Accurate spelling and legible text throughout the design.
  • + Good implementation of bold sans-serif fonts as requested.
  • The layout is very rigid and lacks the branding elements of a real-world menu.
  • Food images are slightly inconsistent in crop and background style.
  • The Spaghetti Bolognese image appears slightly low-contrast and less appetizing compared to others.

Verdict: GPT Image 2 produces a significantly more professional and realistic menu design with branding, color accents, and a dynamic yet organized layout. While Recraft V4 Pro follows the grid prompt strictly, it feels like a template rather than a finished design. GPT Image 2 is the preferred choice for its superior visual appeal and comprehensive execution of the 'modern minimalist' aesthetic.

Magic Burger Explosion: Fiery Photorealism Challenge

Text-to-Image

“Ad for 'Magic Burger'. Dynamic, exploded burger with all components (bun, patty, cheese, lettuce, tomato, sauce) suspended in mid-air. Emphasize photorealistic detail and a sense of motion. Dark, fiery background with glowing embers. Integrate text: 'MAGIC BURGER' as a prominent title, 'LIMITED TIME ONLY' as a secondary message, and '€6.99' in a starburst, all rendered with a fiery, glowing effect.”

GPT Image 2
Recraft V4 Pro

AI Judge Analysis

GPT Image 2

  • + Excellent integration of text with a fiery, glowing effect that perfectly matches the prompt.
  • + Higher level of photorealistic detail in the food textures, particularly the patty and buns.
  • + Dynamic and well-balanced composition that fills the frame effectively.
  • The background is quite busy, which slightly reduces the contrast of the smaller embers.

Recraft V4 Pro

  • + Clean layout with clear separation between the burger and the background.
  • + Correct inclusion of all requested elements including the starburst shape for the price.
  • The text rendering is less cohesive, with the fire texture appearing as a flat fill rather than a glow.
  • The food components look somewhat flatter and less appetizing compared to the other model.
  • Minimal sense of motion or 'explosion' despite the prompt requirements.

Verdict: GPT Image 2 followed the prompt's aesthetic requirements much more closely, particularly regarding the 'fiery, glowing effect' for the text which looks integrated into the scene. While Recraft V4 Pro correctly included all text elements, the overall visual execution feels more like a basic composite, whereas GPT Image 2 delivers a professional, high-impact advertisement look.

Chalkboard Menu

Text-to-Image

“Handwritten-style chalkboard menu in a cozy café, all text rendered in the exact same realistic chalk handwriting style with natural variations in letter size, slight slant, and chalk texture — no printed or digital fonts anywhere on the board. Title at the top in elegant cursive chalk handwriting: ‘TODAY’S SPECIALS – APRIL 30, 2026’. Below it, three menu items also in the same handwritten chalk style: ‘Truffle Mushroom Risotto – $24’, ‘Grilled Octopus with Lemon & Herbs – $28’, ‘Brown Butter Chocolate Chip Cookies – $9’. At the very bottom, smaller text in the identical handwritten chalk style (slightly smaller but still clearly legible with the same handwriting characteristics): ‘All items made fresh daily • Ask about our gluten-free options’. Warm ambient café lighting, visible chalk dust and smudges, realistic handwriting imperfections, no clean printed text anywhere.”

GPT Image 2
Recraft V4 Pro
100% wins 0% ties 0% wins

AI Judge Analysis

GPT Image 2

  • + Excellent chalk texture with natural graininess and pressure variations
  • + Perfectly captures the authentic 'handwritten' aesthetic requested
  • + High resolution for all text characters
  • The date '2026' has some slight character distortion on the 6

Recraft V4 Pro

  • + Clean layout with great spacing between menu items
  • + Excellent environment lighting and composition
  • The text looks like a digital comic-sans style font rather than actual chalk handwriting
  • The title fails to use the requested elegant cursive style
  • The chalkboard texture on the board surface is visible, but the text itself lacks chalk artifacts

Verdict: GPT Image 2 is the superior response as it captures the 'handwritten chalk' nuance perfectly, providing realistic texture and variation in letterforms. Recraft V4 Pro produces text that looks like a clean digital overlay, failing to meet the prompt's stylistic requirements for authentic handwriting and cursive titles.

The Reversed Rodeo

Text-to-Image

“Horse riding astronaut in space — horse on top, not vice versa. Surreal, highly detailed, cinematic.”

GPT Image 2
Recraft V4 Pro

AI Judge Analysis

GPT Image 2

  • + Excellent adherence to the specific 'horse on top' role-reversal instruction
  • + High level of texture detail in the spacesuit and lunar ground
  • + Creative use of a saddle on the astronaut's back
  • The horse's front legs and harness connection are anatomically confusing
  • Very centered and static composition compared to Model B

Recraft V4 Pro

  • + Stunning cinematic lighting and dynamic composition
  • + Exceptional visual quality and atmospheric effects
  • + High level of realism in the horse and astronaut renders
  • Completely failed the negative constraint to put the horse on top
  • Portrays a standard horse riding trope instead of the requested surreal reversal

Verdict: GPT Image 2 is the clear winner because it followed the specific, difficult instruction to reverse the roles and put the horse on top of the astronaut. Recraft V4 Pro produced a much more beautiful and cinematic image, but it failed the primary prompt constraint by placing the astronaut on top of the horse.

The Capybara Taxi Driver

Text-to-Image

“Photorealistic scene inside a yellow New York taxi at night. A capybara is driving, wearing a yellow taxi driver cap and a dark jacket. It has a calm, professional expression and both front paws on the steering wheel. In the back seat sits a human businesswoman in a coat, looking at her phone with a completely normal, bored expression (as if this is just another normal ride). Through the windows you can see the streets of Manhattan at night with blurred lights. Realistic taxi interior, photorealistic, detailed fur and fabric, 35mm lens, night lighting with reflections, shallow depth of field.”

GPT Image 2
Recraft V4 Pro
100% wins 0% ties 0% wins

AI Judge Analysis

GPT Image 2

  • + Excellent close-up detail on the capybara's fur and the texture of its jacket.
  • + Cinematic lighting that accurately reflects a city at night.
  • + Stronger 'professional' expression on the capybara's face.
  • The capybara's paws lack clear anatomical definition against the steering wheel.
  • The passenger is slightly out of focus, making it harder to judge her expression.

Recraft V4 Pro

  • + Great composition showing more of the taxi exterior and the rainy Manhattan atmosphere.
  • + Both front paws are clearly visible and placed on the steering wheel as requested.
  • + The passenger's bored expression and use of the phone are very clear.
  • The capybara's jacket looks a bit more like a human-sized shirt that doesn't fit well.
  • The lighting on the capybara is slightly flat compared to the background.

Verdict: Both models followed the prompt exceptionally well, capturing the surreal humor of the scene with high photorealism. GPT Image 2 has superior textures and lighting for a cinematic feel, while Recraft V4 Pro offers a better composition that clearly shows both characters and the rainy city environment. Recraft V4 Pro is the likely winner for its clearer storytelling and precise adherence to the pose and background details.

The Halloween Invitation

Text-to-Image

“Vintage gothic Halloween party invitation. Dark parchment poster, spooky border with webs and thorns, central glowing jack-o-lantern, bats, twisted trees, moody night sky. Add elegant gothic title text saying "Halloween Party Invitation", a small scroll banner saying "You are invited to a night of frights", and event details at the bottom: Date: 30.10.2026 Time: 7pm Location: The Arches, NYC Spooky but polished, cinematic lighting, square format.”

GPT Image 2
Recraft V4 Pro

AI Judge Analysis

GPT Image 2

  • + Excellent typography with a true vintage gothic aesthetic.
  • + Superior composition that feels like a completed, high-end invitation.
  • + Deeper adherence to the 'dark parchment' and 'thorns' prompt elements.
  • The text 'Invitation' overlaps slightly with the background moon.
  • Specific requested location 'The Arches' is represented by a bridge, which is a bit literal but works.

Recraft V4 Pro

  • + Crisp and highly legible text at the bottom and top.
  • + Clear, vibrant glowing jack-o-lantern center.
  • The layout is less like a 'poster' and more like a standard photo with text overlays.
  • The 'banner' for the secondary text is much too small and lacks the asked-for scroll detail.
  • The border is very thin and lacks the intricate details requested.

Verdict: GPT Image 2 (Model A) is the clear winner as it successfully captures the 'vintage gothic' and 'parchment poster' aesthetic, creating a cohesive design. In contrast, Recraft V4 Pro (Model B) feels like a modern photograph with basic text overlays, failing to integrate the border, banner, and vintage texture effectively.

Vintage Cafe Logo

Text-to-Image

“Vintage minimalist restaurant logo for "Caffè Florian", retro cloche dome with steam and "Est. 1720" banner, classic typography, warm brown and cream tones, subtle texture on light background, vector emblem style.”

GPT Image 2
Recraft V4 Pro
0% wins 0% ties 100% wins

AI Judge Analysis

GPT Image 2

  • + Excellent typography with perfect spelling and accent marks
  • + Sophisticated use of texture and light background for a vintage feel
  • + Well-balanced composition with an intricate border and banner
  • The steam effect looks a bit like a single hair or a flame rather than vapor

Recraft V4 Pro

  • + Successfully incorporates a more modern 'minimalist' vector style as requested
  • + Creative use of steam rising between the text and the cloche
  • + Accurate text and date rendering
  • The cloche rendering is slightly messy with hatching lines that don't look clean
  • Missing the accent on the 'e' in 'Caffè' (replaced with a generic serif shape)
  • Background lacks the 'subtle texture' requested, appearing flat white

Verdict: GPT Image 2 captured the 'vintage' and 'warm' aesthetic more effectively, delivering a high-quality emblem with great attention to detail in the typography and texture. While Recraft V4 Pro followed the 'minimalist' part of the prompt well, its execution felt less finished and missed the specific textural quality requested.

GPT Image 2

OpenAI's state-of-the-art image generation model with arbitrary resolution up to 4K and strong instruction following

Recraft V4 Pro

Recraft's latest image generation model at ~2048px resolution with stronger composition, refined lighting, and realistic materials for print-ready and large-scale work