ImagineArt 1.5 (Preview) Vyro AI Stable Diffusion 3.5 Large Stability AI

Settled by community votes across 7 shared challenges, with an AI judge weighing in on each.

ImagineArt 1.5 (Preview)

26.8 arena score

#6 of 44 in Text-to-Image

Skill signature

Not enough comparable category data

The chart appears once both models have ratings across at least three shared arena categories.

Stable Diffusion 3.5 Large

22.9 arena score

#25 of 44 in Text-to-Image

Vote tally

Where the votes landed

ImagineArt 1.5 (Preview)

83.3%

win rate

Ties

0.0%

Stable Diffusion 3.5 Large

16.7%

win rate

83.3% 0.0% ties 16.7%

Shared challenges 7

Challenge by challenge

The strongest take from each model on every shared challenge, with the AI judge's read.

Geometric Composition

Text-to-Image

“A glass cube on a wooden table. Inside the cube is a small blue sphere. On top of the cube sits a red book. A green plant is behind the cube, partially visible through the glass. Soft window light from the left.”

ImagineArt 1.5 (Preview)

Stable Diffusion 3.5 Large

100% wins 0% ties 0% wins

AI Judge Analysis

ImagineArt 1.5 (Preview)

+ Excellent photorealism and texture detail on the wooden table and book cover.
+ Accurate glass refraction and reflections.
+ Correct placement of the green plant behind the cube.

− Failed the spatial instruction 'red book sits on top of the cube' by placing the cube on the book.
− The lighting direction is somewhat diffused rather than strongly from the left as requested.

Stable Diffusion 3.5 Large

+ Followed the complex spatial instruction: 'On top of the cube sits a red book'.
+ High clarity and clean geometric lines for the glass display case.
+ Light source clearly enters from the left as requested.

− The green plant is predominantly above/in front rather than 'behind' and visible through the glass.
− The small blue sphere is floating without a clear physical support inside the cube.

Verdict: Stable Diffusion 3.5 Large is the winner primarily because it successfully managed the difficult spatial prompt of placing the book on top of the cube, whereas ImagineArt 1.5 (Preview) defaulted to the more common arrangement of putting the object on the book. While ImagineArt 1.5 (Preview) has slightly better photographic textures and plant placement, Stable Diffusion 3.5 Large followed more of the specific prompt details overall.

Candid Street Photography

Text-to-Image

“A candid street photo of an elderly Japanese man repairing a red bicycle in light rain, reflections on wet pavement, shallow depth of field, 50mm lens, natural skin texture, imperfect framing, motion blur from passing cars, cinematic but realistic, no stylization.”

ImagineArt 1.5 (Preview)

Stable Diffusion 3.5 Large

80% wins 0% ties 20% wins

AI Judge Analysis

ImagineArt 1.5 (Preview)

+ Excellent close-up detail of skin texture and the bicycle's mechanical components.
+ Convincing rain effects with realistic water droplets on the bike frame and ripples in puddles.
+ Successful 'imperfect framing' requested in the prompt, creating a candid street photography feel.

− The passing car lacks the requested motion blur, appearing frozen in time.
− Anatomical issues with the hands, specifically the fingers merging with the tool.

Stable Diffusion 3.5 Large

+ Stronger cinematic composition and use of atmospheric depth.
+ Good representation of the wet pavement reflections and rainy environment.
+ Better depiction of the background vehicle context.

− The bicycle frame geometry is physically impossible, with several disconnected or overlapping tubes.
− Rain streaks look like a static filter overlay rather than interacting with the 3D space.
− Missing the specific fine skin texture and candid closeness requested for a 50mm lens shot.

Verdict: ImagineArt 1.5 (Preview) captures the requested 'candid street photo' aesthetic and 'natural skin texture' much more effectively than its competitor, with impressive detail on the wet bicycle. While Stable Diffusion 3.5 Large has a nice cinematic atmosphere, the bicycle's broken anatomy and the less realistic rain rendering make it the weaker choice for this specific prompt.

Fantasy Warrior

Text-to-Image

“Close portrait of a battle-worn paladin in ornate engraved plate armor, hair braided with small beads, faint scars and dirt on the skin, warm torchlight reflecting off metal, shallow depth of field, bokeh sparks, lifelike eyes, highly detailed texture on leather straps and cloth underlayer.”

ImagineArt 1.5 (Preview)

Stable Diffusion 3.5 Large

100% wins 0% ties 0% wins

AI Judge Analysis

ImagineArt 1.5 (Preview)

+ Excellent depiction of warm torchlight with realistic highlights on the skin and armor.
+ Perfect adherence to the 'beads in hair' prompt with clearly visible details.
+ Stronger 'battle-worn' appearance with visible dirt, sweat, and facial character.

− The torch flame is a bit literal and distracting compared to the more cinematic atmosphere of Model B.
− The leather strap shows some minor texture blurring.

Stable Diffusion 3.5 Large

+ Exquisite engraving detail on the plate armor that looks very ornate.
+ High-quality skin texture and very piercing, lifelike eyes.
+ Great bokeh effect with an epic sense of scale in the background.

− Failed to include the requested small beads in the hair braids.
− The lighting feels more like daylight than the requested 'warm torchlight'.

Verdict: ImagineArt 1.5 followed the specific prompt details much better, including the hair beads and the distinct warm torchlight atmosphere. While Stable Diffusion 3.5 Large produced beautiful armor engravings and high-fidelity skin textures, it missed the bead requirement and the lighting looks primarily like cool daylight.

Modern Clean Menu

Text-to-Image

“Modern minimalist restaurant menu design, white background with colorful food photos in grid, sections for appetizers/pizza/mains, bold sans-serif fonts, vibrant accents, clean professional layout for casual dining.”

ImagineArt 1.5 (Preview)

Stable Diffusion 3.5 Large

100% wins 0% ties 0% wins

AI Judge Analysis

ImagineArt 1.5 (Preview)

+ Excellent realistic presentation as a trifold menu mockup.
+ High-quality, vibrant food photography that feels professional.
+ Clear and coherent categorization with 'APPATIZERS', 'PIZZA', and 'MAINS' headers.

− The perspective view makes some text hard to read as a flat design reference.
− Spelling errors in main headers (e.g., 'APPATIZERS').

Stable Diffusion 3.5 Large

+ Perfectly adheres to the 'grid' layout requested in the prompt.
+ Strong typography for the main 'Menu' title and section headers.
+ Flat, front-facing orientation is ideal for graphic design evaluation.

− Significant spelling errors in every section (e.g., 'MAIMAES', 'APPETIZRS', 'PIZETZA').
− The repetitive pizza images lack the variety usually found in a professional menu.

Verdict: ImagineArt 1.5 (Preview) produces a more visually appealing and realistic mockup that captures the 'casual dining' vibe through professional-looking food photography. Stable Diffusion 3.5 Large adheres better to the 'grid' layout requirement but suffers from severe spelling errors and repetitive imagery, making ImagineArt 1.5 the more useful output for a design concept.

Isometric Miniature Diorama Scenes

Text-to-Image

“Create a clear, 45° top-down isometric miniature 3D cartoon scene of Japan's signature dish: sushi, with soft refined textures, realistic PBR materials, gentle lighting, on a small raised diorama base with minimal garnish and plate. Solid light blue background. At top-center: 'JAPAN' in large bold text, 'SUSHI' below it, small flag icon. Perfectly centered, ultra-clean, high-clarity, square format.”

ImagineArt 1.5 (Preview)

Stable Diffusion 3.5 Large

67% wins 0% ties 33% wins

AI Judge Analysis

ImagineArt 1.5 (Preview)

+ Excellent 3D floating typography with 45° isometric perspective
+ Highly realistic textures on the fish and rice grains
+ Perfectly maintains the minimalist 'diorama on a base' aesthetic requested

− The word 'SUSHIN' has a typo at the end
− The flag icon is very small and integrated into a letter rather than standing alone

Stable Diffusion 3.5 Large

+ Perfect text rendering of 'JAPAN' and 'SUSHI'
+ Includes a clear flag icon as requested
+ Strong 3D toy-like cartoon aesthetic

− Failed the 'top-center' text placement instruction by putting it on a small sign
− The scene is busier than the requested 'minimal garnish'
− Perspective is more of a front-angle than a 45° top-down isometric view

Verdict: ImagineArt 1.5 (Preview) followed the composition, perspective, and minimalist diorama instructions much more closely, resulting in a cleaner professional render, despite a small typo in the word 'SUSHI'. Stable Diffusion 3.5 Large managed the text and flag perfectly but failed to place the text at the top-center and chose a much more cluttered, less isometric layout.

Vintage Cafe Logo

Text-to-Image

“Vintage minimalist restaurant logo for "Caffè Florian", retro cloche dome with steam and "Est. 1720" banner, classic typography, warm brown and cream tones, subtle texture on light background, vector emblem style.”

ImagineArt 1.5 (Preview)

Stable Diffusion 3.5 Large

100% wins 0% ties 0% wins

AI Judge Analysis

ImagineArt 1.5 (Preview)

+ Perfect text rendering for both the name and the date.
+ Clean, professional circular emblem composition.
+ Accurate interpretation of 'vector emblem style'.

− The steam is positioned under the cloche lid but above the plate in a slightly floating way.
− The 'Est. 1720' is in a tab rather than a flowing banner.

Stable Diffusion 3.5 Large

+ Excellent 'vintage minimalist' aesthetic with great subtle background texture.
+ Features a classic banner as requested.
+ Includes steam both inside and coming out of the top of the cloche.

− Spelling error in the main name ('Cafféé' instead of 'Caffè').
− The cloche illustration is a bit clunky with an odd handle shape inside.

Verdict: ImagineArt 1.5 (Preview) produced a much more usable logo with perfect typography and a cohesive circular design. While Stable Diffusion 3.5 Large captured the vintage texture and banner request better, it failed on the basic requirement of spelling the name 'Caffè' correctly.

Apollo 11: Journey to Tranquility

Text-to-Image

“Create a clean, modern vector infographic poster about the Apollo 11 mission. NASA-inspired palette (navy, white, muted red, light gray). Flat-vector style, crisp lines, consistent iconography, subtle gradients only. Steps (stop at landing): 1. Launch (Saturn Vicon) 2. Earth Orbit (Earth + orbit ring icon) 3. Translunar (trajectory arc icon) 4. Lunar Orbit (Moon + orbit ring icon) 5. Descent (lunar module descending icon) 6. Landing (lunar module on the surface icon) Small supporting elements (minimal text): • Crew strip: three silhouette icons with only last names: Armstrong, Aldrin, Collins. • Landing site marker: Moon pin labeled "Tranquility" only. Layout constraints: generous margins, large readable labels, clean background with subtle stars. Vector-only, print-poster look, high resolution.”

ImagineArt 1.5 (Preview)

Stable Diffusion 3.5 Large

0% wins 0% ties 100% wins

AI Judge Analysis

ImagineArt 1.5 (Preview)

+ Successfully followed the sequential infographic steps from launch to landing.
+ Relatively legible text for major headers like 'LAUNCH' and 'LANDING'.
+ Pleasing composition with a clear narrative flow across the page.

− Text includes spelling errors like 'TRANCLUTAL' and 'ALERIN'.
− Icons at the bottom showing 5 crew members are inaccurate to the 3-man Apollo 11 mission.
− The Saturn V icon is a bit generic and the landing module is duplicated strangely.

Stable Diffusion 3.5 Large

+ Highly detailed vector aesthetic with a sophisticated NASA-inspired color palette.
+ Includes a visually impressive, large Moon surface and detailed Earth icons.

− Completely failed to follow the logical 6-step infographic sequence requested.
− Text is mostly illegible gibberish or 'lorem ipsum' style characters.
− The spacecraft depicted is a mixture of a Space Shuttle and a rocket, which is historically inaccurate for Apollo 11.

Verdict: ImagineArt 1.5 (Preview) followed the prompt's structural requirements much better, creating a logical 6-step journey from Earth to the Moon with semi-legible labels. Stable Diffusion 3.5 Large produced a more visually complex 'blueprint' style image, but it ignored the specific step-by-step instructions and failed to render readable text or accurate mission icons.

Next steps

Explore each model

ImagineArt 1.5 (Preview)

Vyro AI

Vyro AI's professional-grade text-to-image model delivering photorealistic output with accurate text rendering and typography precision for commercial workflows

Vote this model in the arena

Arena profile Lumenfall catalog

Stable Diffusion 3.5 Large

Stability AI

Stability AI's 8.1-billion parameter Multimodal Diffusion Transformer (MMDiT) text-to-image model featuring improved image quality, typography, complex prompt understanding, and resource-efficiency

Vote this model in the arena

Arena profile Lumenfall catalog