GPT Image 2 vs Qwen Image 2.0

Head-to-head across 2 challenges

GPT Image 2

100.0%

win rate

Ties

0.0%

Qwen Image 2.0

0.0%

win rate

100.0% 0.0% ties 0.0%

Challenge Results

The Reversed Rodeo

Text-to-Image

“Horse riding astronaut in space — horse on top, not vice versa. Surreal, highly detailed, cinematic.”

GPT Image 2
Qwen Image 2.0
100% wins 0% ties 0% wins

AI Judge Analysis

GPT Image 2

  • + Excellent adherence to the specific 'horse on top' prompt instruction
  • + Realistic textures on the space suit and horse fur
  • + Coherent surrealist composition with the horse holding the reins
  • The astronaut's hands/gloves are shaped more like feet than human hands

Qwen Image 2.0

  • + High visual quality with vibrant colors and lighting
  • + Dynamic sense of motion with the horse running through space
  • + Good level of detail on the space suit and horse's mane
  • Failed the primary prompt instruction to have the horse on top of the astronaut
  • Included strange scaly skin artifacts on the horse's neck

Verdict: GPT Image 2 successfully followed the difficult logical constraint of the prompt ('horse on top, not vice versa'), creating a surreal and humorous image. Qwen Image 2.0 ignored the specific positioning instructions and produced a standard 'astronaut riding a horse' image. Despite some minor anatomical issues with the astronaut's hands, GPT Image 2 is the winner for its superior prompt adherence.

The Halloween Invitation

Text-to-Image

“Vintage gothic Halloween party invitation. Dark parchment poster, spooky border with webs and thorns, central glowing jack-o-lantern, bats, twisted trees, moody night sky. Add elegant gothic title text saying "Halloween Party Invitation", a small scroll banner saying "You are invited to a night of frights", and event details at the bottom: Date: 30.10.2026 Time: 7pm Location: The Arches, NYC Spooky but polished, cinematic lighting, square format.”

GPT Image 2
Qwen Image 2.0

AI Judge Analysis

GPT Image 2

  • + Excellent typography with a cohesive gothic font for all sections.
  • + Stunning visual complexity with intricate borders, background silhouettes, and atmospheric lighting.
  • + Perfect adherence to all prompt details, including the specific date and location.
  • The parchment texture is quite dark, making parts of the thorns less distinct.

Qwen Image 2.0

  • + Clean, readable layout with a clear focal point on the jack-o-lantern.
  • + Accurately represents all requested text and objects.
  • The background and trees look somewhat generic compared to the 'cinematic' request.
  • Text rendering on the scroll is slightly shaky and less integrated than Model A.
  • Lighting feels flat and lacks the 'moody' atmosphere requested.

Verdict: GPT Image 2 is the superior choice as it fully captures the 'cinematic' and 'vintage gothic' aesthetic with professional-grade typography and rich textures. While Qwen Image 2.0 followed the prompt instructions accurately, its execution is much simpler and looks more like a standard clip-art flyer than a polished invitation.

GPT Image 2

OpenAI's state-of-the-art image generation model with arbitrary resolution up to 4K and strong instruction following

Qwen Image 2.0

Alibaba's Qwen Image 2.0 model with enhanced text rendering, supporting both Chinese and English prompts with up to 6 images per request