Stable Diffusion 3.5 Medium vs Wan 2.7

Head-to-head across 2 challenges

Stable Diffusion 3.5 Medium

50.0%

win rate

Ties

50.0%

Wan 2.7

0.0%

win rate

50.0% 50.0% ties 0.0%

Challenge Results

The Reversed Rodeo

Text-to-Image

“Horse riding astronaut in space — horse on top, not vice versa. Surreal, highly detailed, cinematic.”

Stable Diffusion 3.5 Medium
Wan 2.7
50% wins 50% ties 0% wins

AI Judge Analysis

Stable Diffusion 3.5 Medium

  • + Excellent visual realism with cinematic lighting
  • + Complex interaction between the horse's legs and the planetary horizon creates a unique surrealist effect
  • Failed the negative constraint entirely by placing the astronaut on top of the horse
  • The astronaut appears to have three legs or an extremely distorted limb

Wan 2.7

  • + Clean, high-resolution textures on both the spacesuit and the horse
  • + Creative addition of multiple small planets and satellites to enhance the space theme
  • Failed the core specific instruction to place the horse on top of the astronaut
  • The composition feels a bit cluttered with repetitive planet assets

Verdict: Both Stable Diffusion 3.5 Medium and Wan 2.7 failed the challenging semantic constraint of placing the 'horse on top' of the astronaut, both defaulting to the common trope of an astronaut riding a horse. Wan 2.7 is slightly better in terms of technical image quality and anatomy, whereas Stable Diffusion 3.5 Medium has significant anatomical glitches in the astronaut's legs.

The Halloween Invitation

Text-to-Image

“Vintage gothic Halloween party invitation. Dark parchment poster, spooky border with webs and thorns, central glowing jack-o-lantern, bats, twisted trees, moody night sky. Add elegant gothic title text saying "Halloween Party Invitation", a small scroll banner saying "You are invited to a night of frights", and event details at the bottom: Date: 30.10.2026 Time: 7pm Location: The Arches, NYC Spooky but polished, cinematic lighting, square format.”

Stable Diffusion 3.5 Medium
Wan 2.7

AI Judge Analysis

Stable Diffusion 3.5 Medium

  • + Features a classic spooky aesthetic with vibrant jack-o-lanterns.
  • + Good contrast between the parchment and the dark background.
  • Numerous spelling errors including Halloweeen and Inviloween.
  • The layout is cluttered and the text rendering is distorted.
  • Failed to include the specific scrolls and banner design requested.

Wan 2.7

  • + Excellent text rendering with near-perfect spelling for all requested details.
  • + Superior composition with a clear central focal point and elegant gothic framing.
  • + Followed all complex instructions including the small scroll banner and specific date/time.
  • The lighting on the central pumpkin is slightly flatter than Model A's version.
  • The text 'You are invited...' is slightly off-center on its banner.

Verdict: Wan 2.7 significantly outperforms Stable Diffusion 3.5 Medium by correctly rendering all the requested text and event details with a high degree of legibility. While Stable Diffusion 3.5 Medium captures a moody atmosphere, its failure to spell basic words and follow the layout instructions makes it unusable as an invitation. Wan 2.7 produced a polished, professional-looking design that perfectly adheres to the gothic vintage prompt.

Stable Diffusion 3.5 Medium

Stability AI's 2.5-billion parameter Multimodal Diffusion Transformer with improvements (MMDiT-X) text-to-image model optimized for consumer hardware, featuring improved image quality, typography, and complex prompt understanding

Wan 2.7

Alibaba's Wan 2.7 image generation and editing model for text-to-image, reference-guided generation, and instruction-based image edits