Qwen Image 2512 vs Wan 2.6
Head-to-head across 4 challenges
Qwen Image 2512
50.0%
win rate
Ties
0.0%
Wan 2.6
50.0%
win rate
Challenge Results
Modern Clean Menu
Text-to-Image“Modern minimalist restaurant menu design, white background with colorful food photos in grid, sections for appetizers/pizza/mains, bold sans-serif fonts, vibrant accents, clean professional layout for casual dining.”
AI Judge Analysis
Qwen Image 2512
- + Excellent photographic quality and consistency in the food grid
- + Very intentional use of bold, condensed sans-serif fonts
- + Clean, highly structured layout with clear vertical divisions
- − Text rendering is mostly gibberish despite looking stylistically correct
- − Lacks the 'vibrant accents' requested, sticking to a strict black and white theme
Wan 2.6
- + Successfully incorporates vibrant color accents as requested
- + Better text legibility overall
- + Clearly defined sections for Appetizers, Pizza, and Mains as per the prompt
- − The grid layout is slightly messy with uneven borders and clipping
- − Food photography is a bit less consistent in lighting and angle compared to Model A
Verdict: Wan 2.6 adhered better to the prompt's specific request for sections (Appetizers/Pizza/Mains) and vibrant accents, whereas Qwen Image 2512 combined sections into a 'Pizza/Means' header and used a purely monochrome UI. While Qwen produced more consistent food photography, Wan 2.6's design feels more like a complete, colorful casual dining menu.
Chalkboard Menu
Text-to-Image“Handwritten-style chalkboard menu in a cozy café, all text rendered in the exact same realistic chalk handwriting style with natural variations in letter size, slight slant, and chalk texture — no printed or digital fonts anywhere on the board. Title at the top in elegant cursive chalk handwriting: ‘TODAY’S SPECIALS – APRIL 30, 2026’. Below it, three menu items also in the same handwritten chalk style: ‘Truffle Mushroom Risotto – $24’, ‘Grilled Octopus with Lemon & Herbs – $28’, ‘Brown Butter Chocolate Chip Cookies – $9’. At the very bottom, smaller text in the identical handwritten chalk style (slightly smaller but still clearly legible with the same handwriting characteristics): ‘All items made fresh daily • Ask about our gluten-free options’. Warm ambient café lighting, visible chalk dust and smudges, realistic handwriting imperfections, no clean printed text anywhere.”
AI Judge Analysis
Qwen Image 2512
- + Excellent text legibility and spelling consistency
- + Accurately captures the elegant cursive request for the title while maintaining a consistent style throughout
- + Superior chalk texture and smudging realism on the board surface
- − Slightly mispelled 'Risotto' as 'Risitto'
- − The handwriting looks somewhat digitally clean compared to the rougher chalk request
Wan 2.6
- + Features a very authentic, crumbly chalk texture with visible dust
- + Successfully rendered all three menu items clearly
- + The 'handwritten' feel is more organic with natural grit
- − Redundant price rendering for the first two items ($24 and $28 are written twice)
- − The 'TODAY's Specials' title uses a mix of cases that wasn't specifically requested
- − The perspective of the board is slightly more distorted than Model A
Verdict: Qwen Image 2512 provides a much cleaner and more professional-looking menu with superior spelling accuracy, despite a minor typo in 'Risotto'. Wan 2.1 captures the gritty, dusty texture of chalk more realistically but suffers from distracting logic errors, such as repeating the prices twice for the same line item.
Isometric Miniature Diorama Scenes
Text-to-Image“Create a clear, 45° top-down isometric miniature 3D cartoon scene of Japan's signature dish: sushi, with soft refined textures, realistic PBR materials, gentle lighting, on a small raised diorama base with minimal garnish and plate. Solid light blue background. At top-center: 'JAPAN' in large bold text, 'SUSHI' below it, small flag icon. Perfectly centered, ultra-clean, high-clarity, square format.”
AI Judge Analysis
Qwen Image 2512
- + Excellent text styling with a 3D-effect border
- + Rich detail in the sushi textures and vegetable garnishes
- + Better adherence to the 'small diorama' request with complex base layering
- − The flag icon is slightly off-center relative to the text
Wan 2.6
- + Very clean, minimalist aesthetic
- + Perfectly aligned typography
- + High-quality realistic PBR textures on the fish and wood
- − The sushi models are slightly floating or poorly integrated with the rice textures
- − The flag icon is placed to the left rather than 'below' the primary text as implied by the hierarchy
Verdict: Qwen Image 2512 better captures the 'miniature 3D cartoon scene' via more elaborate diorama details and stylized typography. Wan 2.6 is technically very clean with great light behavior, but it feels slightly more like a generic product render than a curated isometric diorama.
Vintage Cafe Logo
Text-to-Image“Vintage minimalist restaurant logo for "Caffè Florian", retro cloche dome with steam and "Est. 1720" banner, classic typography, warm brown and cream tones, subtle texture on light background, vector emblem style.”
AI Judge Analysis
Qwen Image 2512
- + Perfect text rendering for both the name and the banner
- + High-quality illustrative detail with a classic emblem feel
- + Excellent use of the requested color palette and textures
- − Less 'minimalist' than requested, leaning more into detailed illustration
- − The steam is a bit oversized compared to the dome
Wan 2.6
- + Strictly follows the 'minimalist' and 'vector' keywords
- + Clean, simple composition suitable for a modern logo
- + Accurate representation of a cloche dome
- − The banner is very small and awkwardly placed
- − The background texture is limited to the edges
- − Lacks the 'vintage' richness requested
Verdict: Qwen Image 2512 produces a much more professional and aesthetically pleasing result that captures the 'vintage' and 'classic' requirements perfectly, even if it is less minimalist than requested. Wan 2.6 captures the minimalist vector style well, but the banner is poorly integrated and the overall design feels a bit generic.
Qwen Image 2512
Improved version of Alibaba's Qwen image model with better text rendering, finer natural textures, and more realistic human generation.
Wan 2.6
Alibaba's multimodal generation model from the Wan AI suite, supporting text-to-video, image-to-video, reference-to-video with audio, and text-to-image, in both Chinese and English