Black Forest Labs' 12-billion parameter flow transformer for high-quality text-to-image generation, suitable for personal and commercial use with streaming support
Settled by community votes across 1 shared challenge, with an AI judge weighing in on each.
FLUX.1 [dev]
#42 of 44 in Text-to-Image
Not enough comparable category data
The chart appears once both models have ratings across at least three shared arena categories.
Qwen Image 2.0
#32 of 44 in Text-to-Image
Where the votes landed
FLUX.1 [dev]
0%
win rate
Ties
0%
Qwen Image 2.0
0%
win rate
Challenge by challenge
The strongest take from each model on every shared challenge, with the AI judge's read.
The Halloween Invitation
Text-to-Image“Vintage gothic Halloween party invitation. Dark parchment poster, spooky border with webs and thorns, central glowing jack-o-lantern, bats, twisted trees, moody night sky. Add elegant gothic title text saying "Halloween Party Invitation", a small scroll banner saying "You are invited to a night of frights", and event details at the bottom: Date: 30.10.2026 Time: 7pm Location: The Arches, NYC Spooky but polished, cinematic lighting, square format.”
AI Judge Analysis
FLUX.1 [dev]
- + Features a beautiful thorny border and central illustration.
- + Good contrast and cinematic lighting on the jack-o-lantern.
- − Numerous text errors including 'Falloween Rantcl' and 'You Tre'.
- − Did not include the spider webs requested in the prompt.
- − Poor text layout with redundant information like '7pm, 7pm'.
Qwen Image 2.0
- + Excellent text accuracy, rendering all requested words and details perfectly.
- + Successfully captures the 'vintage gothic parchment' aesthetic with webs and thorns.
- + Cinematic atmosphere with misty trees and a central glowing pumpkin.
- − The transition from the central scene to the parchment border is slightly harsh near the bottom.
Verdict: Qwen Image 2.0 is the clear winner as it followed every instruction, including the specific texture of the parchment, the inclusion of spider webs, and flawless text rendering. FLUX.1 [dev] struggled significantly with the text content and ignored the web requirement, resulting in a less functional invitation.
Explore each model
Alibaba's Qwen Image 2.0 model with enhanced text rendering, supporting both Chinese and English prompts with up to 6 images per request