Recraft's latest text-to-image generation model with high-quality output, supporting various aspect ratios and custom color palettes
Settled by community votes across 8 shared challenges, with an AI judge weighing in on each.
Recraft V4
#8 of 44 in Text-to-Image
Not enough comparable category data
The chart appears once both models have ratings across at least three shared arena categories.
Stable Diffusion 3.5 Large
#25 of 44 in Text-to-Image
Where the votes landed
Recraft V4
85.7%
win rate
Ties
0.0%
Stable Diffusion 3.5 Large
14.3%
win rate
Challenge by challenge
The strongest take from each model on every shared challenge, with the AI judge's read.
Geometric Composition
Text-to-Image“A glass cube on a wooden table. Inside the cube is a small blue sphere. On top of the cube sits a red book. A green plant is behind the cube, partially visible through the glass. Soft window light from the left.”
AI Judge Analysis
Recraft V4
- + Perfect adherence to spatial instructions including the book on top and sphere inside.
- + Highly realistic textures for the wood, paper, and glass.
- + Sophisticated lighting and reflections consistent with a window source.
- − The sphere appears to be floating without a visible support structure, which might look slightly unnatural.
Stable Diffusion 3.5 Large
- + Clean, modern aesthetic with bright lighting.
- + Accurately places a green plant behind the glass structure.
- − Fails the spatial prompt by putting the cube on top of the book and the sphere on top of the book.
- − The sphere is technically outside the glass cube (it is sitting on the book which is under the cube).
- − Harsh lighting creates slightly blown-out highlights on the wood.
Verdict: Recraft V4 followed all spatial instructions perfectly, placing the sphere inside the cube and the book on top. Stable Diffusion 3.5 Large struggled with the complex prepositional logic, reversing the order of the objects. Recraft V4 also produced a much more photorealistic image with better material rendering.
Candid Street Photography
Text-to-Image“A candid street photo of an elderly Japanese man repairing a red bicycle in light rain, reflections on wet pavement, shallow depth of field, 50mm lens, natural skin texture, imperfect framing, motion blur from passing cars, cinematic but realistic, no stylization.”
AI Judge Analysis
Recraft V4
- + Excellent handling of motion blur on passing cars as requested.
- + Authentic street photography feel with 'imperfect' framing and natural-looking background clutter.
- + Realistic rendering of wet pavement and light reflection.
- − The structural anatomy of the bicycle is broken, especially the front fork and wheel attachment.
- − The man's hand is poorly defined and blends into the bike frame.
Stable Diffusion 3.5 Large
- + Stronger anatomical realism of the subject's face and skin texture.
- + The bicycle structure is more coherent and recognizable.
- + Vibrant color palette and good use of shallow depth of field.
- − Failed to include 'motion blur' on the car in the background, which is sharp.
- − The lighting on the man's hair and clothes feels slightly like a studio setup rather than natural street lighting.
Verdict: Recraft V4 captures the 'candid street photo' aesthetic and complex prompt requirements like motion blur much better than the competitor. However, Stable Diffusion 3.5 Large produces a much cleaner and more anatomically correct primary subject, even if it ignores the request for motion blur on passing vehicles.
Fantasy Warrior
Text-to-Image“Close portrait of a battle-worn paladin in ornate engraved plate armor, hair braided with small beads, faint scars and dirt on the skin, warm torchlight reflecting off metal, shallow depth of field, bokeh sparks, lifelike eyes, highly detailed texture on leather straps and cloth underlayer.”
AI Judge Analysis
Recraft V4
- + Excellent depiction of multiple braids with beads as requested.
- + Realistic weathering with dirt and grit that feels integrated into the skin.
- + Outstanding detail on the leather straps and garment underlayers.
- − The sparks look like simple white dots rather than glowing embers.
- − The lighting is slightly flat despite the presence of the torch in the background.
Stable Diffusion 3.5 Large
- + Beautifully intricate engraving on the plate armor.
- + Complex and realistic hair texture and braiding.
- + Stronger contrast and more cinematic lighting composition.
- − Missed the request for beads in the hair braids.
- − The chainmail/underlayer texture is slightly mushy in certain areas compared to the armor.
Verdict: Recraft V4 followed the specific technical details of the prompt more closely, particularly regarding the beads in the hair and the texture of the leather straps. Stable Diffusion 3.5 Large produced a more aesthetically striking image with superior armor engraving, but it missed the small beads and the skin weathering was slightly less pronounced than in Recraft V4.
Modern Clean Menu
Text-to-Image“Modern minimalist restaurant menu design, white background with colorful food photos in grid, sections for appetizers/pizza/mains, bold sans-serif fonts, vibrant accents, clean professional layout for casual dining.”
AI Judge Analysis
Recraft V4
- + Exceptional text rendering with perfect spelling of all menu items and headers.
- + Clean, professional grid layout that mimics a real-world digital or print menu.
- + Consistent lighting and high-quality product photography for all food items.
- − Minimalist aesthetic might feel a bit sparse to some users.
- − Some items are missing descriptions compared to others in the grid.
Stable Diffusion 3.5 Large
- + Vibrant color palette with a dynamic use of food photography borders.
- + Creative layout that feels more like a sophisticated brand identity.
- − Poor text rendering with numerous spelling errors like 'APPETIZRS' and 'MAIMAES'.
- − The grid layout feels cluttered and cuts off many of the food photos.
- − Fails to clearly separate the sections requested in an legible way.
Verdict: Recraft V4 is the clear winner as it produces a functional, professional menu with flawless text and a highly organized layout that perfectly matches the 'modern minimalist' prompt. Stable Diffusion 3.5 Large creates a more artistic composition but fails significantly on text legibility and practical usability, making it look like an AI-generated approximation rather than a usable design.
Isometric Miniature Diorama Scenes
Text-to-Image“Create a clear, 45° top-down isometric miniature 3D cartoon scene of Japan's signature dish: sushi, with soft refined textures, realistic PBR materials, gentle lighting, on a small raised diorama base with minimal garnish and plate. Solid light blue background. At top-center: 'JAPAN' in large bold text, 'SUSHI' below it, small flag icon. Perfectly centered, ultra-clean, high-clarity, square format.”
AI Judge Analysis
Recraft V4
- + Excellent typography and graphic design integration.
- + High-realism materials for the sushi and ice base.
- + Clean, professional composition following all layout instructions.
- − Leaner more towards photorealism than the requested 3D cartoon style.
- − The crushed ice texture is a bit messy compared to a clean diorama base.
Stable Diffusion 3.5 Large
- + Perfectly captures the 3D cartoon miniature/toy aesthetic.
- + Creative use of signage and flags to incorporate text into the scene.
- + Strong rendering of soft, stylized PBR materials.
- − Failed to place the text at the top-center of the image as specified.
- − Composition is a bit cluttered with more garnish than requested.
Verdict: Recraft V4 followed the layout and typography instructions perfectly, placing the text and flag exactly where requested with high clarity, though it opted for a more realistic style. Stable Diffusion 3.5 Large better captured the '3D cartoon miniature' aesthetic but moved the required text into the scene elements and failed the specific layout placement.
Adorable Baby Animals in Sunny Meadow
Text-to-Image“Hyper-photorealistic scene of fluffy baby animals—a golden retriever puppy, tabby kitten, baby bunny, and red fox kit—with big expressive eyes and ultra-detailed soft fur, playfully chasing butterflies and tumbling together in a lush wildflower meadow, warm golden sunrise light with god rays and dew sparkles, joyful wholesome vibe, 8K masterpiece.”
AI Judge Analysis
Recraft V4
- + Excellent fur texture and realism in all four animals
- + Effectively captures the specific lighting effects like god rays and dew sparkles
- + Distinctly renders the fox, kitten, puppy, and bunny as requested
- − The bunny's paws look slightly amorphous and floating
- − Almost too sharp, pushing the photorealism into a slightly uncanny digital composite feel
Stable Diffusion 3.5 Large
- + Beautiful composition and bokeh effect in the background
- + Great sense of movement and 'tumbling' joy among the animals
- + Soft, warm lighting that perfectly matches the 'wholesome' vibe
- − The kitten and fox look very similar in facial structure
- − Lower overall detail in the fur texture compared to Model A
- − The butterfly on the rabbit's ear is poorly integrated
Verdict: Recraft V4 wins on technical execution and prompt adherence, providing high-detail textures and capturing all four specific animals with clear distinctions. Stable Diffusion 3.5 Large offers more artistic charm and better motion, but the animals look somewhat generic and the image lacks the '8K masterpiece' clarity of its competitor.
Vintage Cafe Logo
Text-to-Image“Vintage minimalist restaurant logo for "Caffè Florian", retro cloche dome with steam and "Est. 1720" banner, classic typography, warm brown and cream tones, subtle texture on light background, vector emblem style.”
AI Judge Analysis
Recraft V4
- + Excellent typography with a sophisticated, custom feel
- + Proper spelling of 'Caffè' including the accent
- + Clean vector aesthetic with professional cross-hatching details
Stable Diffusion 3.5 Large
- + Successfully included the 'banner' and 'texture' requested in the prompt
- + Good use of the 'Est. 1720' as a centered element
- + Warm cream and brown tones perfectly match the vintage brief
- − Misspelled the primary name as 'Cafféé'
- − The cloche icon looks disorganized with clashing steam/fire elements
- − The banner construction is slightly asymmetrical
Verdict: Recraft V4 produced a much more professional logo with superior typography and correct spelling, though it missed the specific 'banner' element and texture requested. Stable Diffusion 3.5 Large followed the layout instructions more literally by including the banner and background texture, but it failed on basic spelling and icon clarity. Recraft V4 is the preferred choice for a real-world logo application due to its polish and design coherence.
Apollo 11: Journey to Tranquility
Text-to-Image“Create a clean, modern vector infographic poster about the Apollo 11 mission. NASA-inspired palette (navy, white, muted red, light gray). Flat-vector style, crisp lines, consistent iconography, subtle gradients only. Steps (stop at landing): 1. Launch (Saturn Vicon) 2. Earth Orbit (Earth + orbit ring icon) 3. Translunar (trajectory arc icon) 4. Lunar Orbit (Moon + orbit ring icon) 5. Descent (lunar module descending icon) 6. Landing (lunar module on the surface icon) Small supporting elements (minimal text): • Crew strip: three silhouette icons with only last names: Armstrong, Aldrin, Collins. • Landing site marker: Moon pin labeled "Tranquility" only. Layout constraints: generous margins, large readable labels, clean background with subtle stars. Vector-only, print-poster look, high resolution.”
AI Judge Analysis
Recraft V4
- + Perfectly followed the 6-step sequence with accurate text labels.
- + Excellent text rendering and clean, modern vector iconography.
- + Correctly adhered to the specified NASA-inspired color palette.
- − The trajectory line for the descent step is a bit simplistic compared to the rest of the art.
Stable Diffusion 3.5 Large
- + Detailed illustration style that captures an older NASA technical poster aesthetic.
- + Good use of the navy and muted red color palette.
- − Failed to follow the requested 6-step sequence, providing jumbled elements instead.
- − Unreadable and garbled text throughout the infographic.
- − Incorrectly included a Space Shuttle-style vehicle instead of a Saturn V rocket.
Verdict: Recraft V4 produced a professional-grade infographic that followed every instruction, specifically accurately numbering the steps and rendering all text perfectly. Stable Diffusion 3.5 Large failed on prompt adherence, including nonsensical text and incorrect spacecraft (a shuttle) while ignoring the requested logical flow.
Explore each model
Stability AI's 8.1-billion parameter Multimodal Diffusion Transformer (MMDiT) text-to-image model featuring improved image quality, typography, complex prompt understanding, and resource-efficiency