OpenAI's previous generation image model with higher quality than DALL-E 2 and support for larger resolutions
Settled by community votes across 9 shared challenges, with an AI judge weighing in on each.
DALL-E 3
#35 of 44 in Text-to-Image
Not enough comparable category data
The chart appears once both models have ratings across at least three shared arena categories.
FLUX.2 [dev] Flash
#5 of 44 in Text-to-Image
Where the votes landed
DALL-E 3
0.0%
win rate
Ties
0.0%
FLUX.2 [dev] Flash
100.0%
win rate
Challenge by challenge
The strongest take from each model on every shared challenge, with the AI judge's read.
Geometric Composition
Text-to-Image“A glass cube on a wooden table. Inside the cube is a small blue sphere. On top of the cube sits a red book. A green plant is behind the cube, partially visible through the glass. Soft window light from the left.”
AI Judge Analysis
DALL-E 3
- + High artistic quality and sharp lighting effects
- + Excellent wooden texture and detail
- − Failed spatial logic by putting the book inside and the plant outside the glass structure
- − Ignored the 'red book on top' instruction
- − Added an unnecessary wooden frame around the glass cube
FLUX.2 [dev] Flash
- + Perfect adherence to all spatial instructions and object placements
- + More realistic, photographic style
- + Correctly interpreted 'on top' and 'inside' exactly as prompted
- − The glass cube is slightly skewed in perspective compared to the table
- − The sphere is centered but lacks the complex refraction seen in the other model
Verdict: FLUX.2 [dev] Flash followed the complex spatial instructions perfectly, placing every object exactly where it was requested in the prompt. DALL-E 3 produced a more stylized and visually striking image, but it failed basic prompt adherence by placing the red book inside the cube and the plant behind the scene rather than visible through the glass.
Candid Street Photography
Text-to-Image“A candid street photo of an elderly Japanese man repairing a red bicycle in light rain, reflections on wet pavement, shallow depth of field, 50mm lens, natural skin texture, imperfect framing, motion blur from passing cars, cinematic but realistic, no stylization.”
AI Judge Analysis
DALL-E 3
- + Excellent composition with an artistic 'through-the-bike' framing.
- + Strong reflections on the wet pavement and moody, cinematic lighting.
- + Includes a clear reflection of the man in the puddle.
- − Anatomical issues with the man's feet and hands.
- − The man appears to have a strange growth or ear deformity.
- − The car in the background is static despite the motion blur request.
FLUX.2 [dev] Flash
- + Highly realistic skin textures and realistic clothing details.
- + Accurately depicts motion blur on the passing cars as requested.
- + The bicycle mechanics and tools are more grounded in reality.
- − The 'imperfect framing' is missing, opting for a standard centered shot.
- − Rain effects are barely visible in the air, feeling more like a post-rain scene.
Verdict: FLUX.2 [dev] Flash produces a much more realistic and anatomically correct image with excellent skin textures and successful adherence to the motion blur request. DALL-E 3 captures a more cinematic and artistic mood with better reflections, but it suffers from significant anatomical distortions in the subject's feet and head.
Fantasy Warrior
Text-to-Image“Close portrait of a battle-worn paladin in ornate engraved plate armor, hair braided with small beads, faint scars and dirt on the skin, warm torchlight reflecting off metal, shallow depth of field, bokeh sparks, lifelike eyes, highly detailed texture on leather straps and cloth underlayer.”
AI Judge Analysis
DALL-E 3
- + Ornate engraving on the armor is exceptionally sharp and intricate.
- + Strong cinematic lighting with excellent contrast and 'warm torchlight' atmosphere.
- + High level of facial detail with a convincing battle-worn appearance.
- − The helmet design is slightly fantastical and cluttered.
- − The braiding of the hair is less distinct than Image B.
FLUX.2 [dev] Flash
- + Excellent adherence to the 'braided with small beads' prompt with very clear hair styling.
- + Realistic leather strap textures and cloth underlayers as requested.
- + Balanced composition with visible torches providing context.
- − Skin texture feels slightly flatter and more digital than Image A.
- − The scarring looks more like fresh paint/blood rather than 'faint scars' requested.
Verdict: DALL-E 3 (Image A) produces a more punchy, cinematic portrait with superior armor engravings and lighting, though it takes more liberties with the equipment design. FLUX.2 [dev] Flash (Image B) follows the specific textural instructions for hair, beads, and leather more literally, but the overall image feels slightly more muted. DALL-E 3 is the winner for its superior visual quality and the lifelike depth of the character's expression.
Chalkboard Menu
Text-to-Image“Handwritten-style chalkboard menu in a cozy café, all text rendered in the exact same realistic chalk handwriting style with natural variations in letter size, slight slant, and chalk texture — no printed or digital fonts anywhere on the board. Title at the top in elegant cursive chalk handwriting: ‘TODAY’S SPECIALS – APRIL 30, 2026’. Below it, three menu items also in the same handwritten chalk style: ‘Truffle Mushroom Risotto – $24’, ‘Grilled Octopus with Lemon & Herbs – $28’, ‘Brown Butter Chocolate Chip Cookies – $9’. At the very bottom, smaller text in the identical handwritten chalk style (slightly smaller but still clearly legible with the same handwriting characteristics): ‘All items made fresh daily • Ask about our gluten-free options’. Warm ambient café lighting, visible chalk dust and smudges, realistic handwriting imperfections, no clean printed text anywhere.”
AI Judge Analysis
DALL-E 3
- + Ornate and artistic composition
- + Good representation of a cozy cafe lighting and atmosphere
- + Rich chalk texture
- − Numerous spelling and spelling-like errors (e.g., 'OCCTUS', 'GRILILLED')
- − Handwriting styles are inconsistent and include block lettering despite the prompt requesting a single same style
- − Nonsensical prices like '$234' deviate from the prompt
FLUX.2 [dev] Flash
- + Excellent prompt adherence with nearly perfect spelling of all complex menu items
- + Uniform, realistic chalk handwriting style that looks authentically human
- + Clean composition with natural slanting and texture as requested
- − Simple layout compared to the more artistic flourished of Model A
- − Slight repetition in the final line for the price of cookies
Verdict: FLUX.2 [dev] Flash significantly outperformed DALL-E 3 by accurately rendering all requested text and prices with near-perfect spelling. While DALL-E 3 created a more visually intricate design, it failed on the core task of text legibility and adherence, producing several typos and nonsensical numbers.
The Capybara Taxi Driver
Text-to-Image“Photorealistic scene inside a yellow New York taxi at night. A capybara is driving, wearing a yellow taxi driver cap and a dark jacket. It has a calm, professional expression and both front paws on the steering wheel. In the back seat sits a human businesswoman in a coat, looking at her phone with a completely normal, bored expression (as if this is just another normal ride). Through the windows you can see the streets of Manhattan at night with blurred lights. Realistic taxi interior, photorealistic, detailed fur and fabric, 35mm lens, night lighting with reflections, shallow depth of field.”
AI Judge Analysis
DALL-E 3
- + Excellent interior lighting and detailed fur texture
- + High-quality composition with a dynamic angle
- + The cap and jacket are integrated very naturally
- − Failed to include the passenger in the back seat
- − Perspective shows the driver and the side, not the full interior as requested
FLUX.2 [dev] Flash
- + Successfully followed all instructions including the bored passenger
- + Photorealistic rendering of the capybara's face and hands
- + Perfectly captures the 'completely normal ride' atmosphere
- − The passenger's phone and fingers have minor artifacts
- − The yellow cap is slightly vibrant compared to the overall lighting
Verdict: While DALL-E 3 produced a visually striking and high-detail image of the capybara, it completely missed the instruction to include the businesswoman in the back seat. FLUX.2 [dev] Flash followed the entire prompt perfectly, managing to capture the bored expression of the passenger and the surreal nature of the setting in a single cohesive frame.
The Halloween Invitation
Text-to-Image“Vintage gothic Halloween party invitation. Dark parchment poster, spooky border with webs and thorns, central glowing jack-o-lantern, bats, twisted trees, moody night sky. Add elegant gothic title text saying "Halloween Party Invitation", a small scroll banner saying "You are invited to a night of frights", and event details at the bottom: Date: 30.10.2026 Time: 7pm Location: The Arches, NYC Spooky but polished, cinematic lighting, square format.”
AI Judge Analysis
DALL-E 3
- + Excellent atmospheric lighting and textured parchment feel
- + Highly detailed composition with a sophisticated gothic frame
- − Text rendering is very poor with significant gibberish
- − The 'scroll' requirement is missing or poorly integrated
FLUX.2 [dev] Flash
- + Excellent text rendering and legibility for the invitation details
- + Perfect adherence to all prompt elements including thorns, webs, and scroll banner
- + Clearer representation of the twisted trees and night sky
- − The lighting is a bit flatter compared to the other model
- − The thorns and spiderwebs have a slightly more digital/clean feel rather than vintage
Verdict: While DALL-E 3 creates a more artistically textured and 'vintage' atmosphere, FLUX.2 [dev] Flash is the clear winner for an invitation task because the text is perfectly readable and it accurately includes every requested prompt element. DALL-E 3 fails significantly on the textual requirements, rendering the invitation useless for its intended purpose.
Isometric Miniature Diorama Scenes
Text-to-Image“Create a clear, 45° top-down isometric miniature 3D cartoon scene of Japan's signature dish: sushi, with soft refined textures, realistic PBR materials, gentle lighting, on a small raised diorama base with minimal garnish and plate. Solid light blue background. At top-center: 'JAPAN' in large bold text, 'SUSHI' below it, small flag icon. Perfectly centered, ultra-clean, high-clarity, square format.”
AI Judge Analysis
DALL-E 3
- + Excellent 3D cartoon art style with vibrant, puffy textures.
- + Clean isometric 45° perspective on a high-quality diorama base.
- + Creative placement of the flag and text as part of the physical scene objects.
- − Failed to place the text 'JAPAN' and 'SUSHI' at the top-center of the image as requested.
- − The 'SUSHI' text is missing entirely.
- − Rice grains look more like small spheres or marshmallows rather than realistic rice.
FLUX.2 [dev] Flash
- + Perfect adherence to text instructions, placing 'JAPAN' and 'SUSHI' at the top-center.
- + Realistic PBR materials with high-fidelity textures on the fish and wood grain.
- + Clean, minimalist composition that feels professional and balanced.
- − The sushi roll construction is slightly unusual for a standard nigiri or maki.
- − The wasabi detail is very simple compared to the texture quality of the fish.
Verdict: FLUX.2 [dev] Flash is the clear winner as it followed every instruction, including the specific text placement and wording which DALL-E 3 failed to include correctly. While DALL-E 3 produced a very charming cartoon aesthetic, it missed the 'SUSHI' text and placed the 'JAPAN' text on the base instead of the top-center as requested.
Adorable Baby Animals in Sunny Meadow
Text-to-Image“Hyper-photorealistic scene of fluffy baby animals—a golden retriever puppy, tabby kitten, baby bunny, and red fox kit—with big expressive eyes and ultra-detailed soft fur, playfully chasing butterflies and tumbling together in a lush wildflower meadow, warm golden sunrise light with god rays and dew sparkles, joyful wholesome vibe, 8K masterpiece.”
AI Judge Analysis
DALL-E 3
- + Excellent expressive lighting with strong 'god rays' effects
- + Cohesive artistic style across all characters
- − Anatomical errors such as butterflies with mammal-like furry faces and ears
- − Lacks photorealism, leaning heavily into a 3D animation or illustrative style
FLUX.2 [dev] Flash
- + Stronger adherence to the 'photorealistic' requirement while maintaining cuteness
- + Detailed fur textures and realistic anatomy for the butterflies
- + Includes fine details like dew drops on the grass
- − Duplicated animals (two bunnies, two foxes) instead of one of each as requested
- − Slightly less dramatic lighting compared to the other model
Verdict: DALL-E 3 produced a whimsical, magical scene but failed on the photorealistic and anatomical accuracy requirements by giving the butterflies animal faces. FLUX.2 [dev] Flash followed the 'photorealistic' instruction much better and provided superior texture detail, although it hallucinated extra foxes and bunnies.
Apollo 11: Journey to Tranquility
Text-to-Image“Create a clean, modern vector infographic poster about the Apollo 11 mission. NASA-inspired palette (navy, white, muted red, light gray). Flat-vector style, crisp lines, consistent iconography, subtle gradients only. Steps (stop at landing): 1. Launch (Saturn Vicon) 2. Earth Orbit (Earth + orbit ring icon) 3. Translunar (trajectory arc icon) 4. Lunar Orbit (Moon + orbit ring icon) 5. Descent (lunar module descending icon) 6. Landing (lunar module on the surface icon) Small supporting elements (minimal text): • Crew strip: three silhouette icons with only last names: Armstrong, Aldrin, Collins. • Landing site marker: Moon pin labeled "Tranquility" only. Layout constraints: generous margins, large readable labels, clean background with subtle stars. Vector-only, print-poster look, high resolution.”
AI Judge Analysis
DALL-E 3
- + Features a highly stylistic and artistic poster layout
- + Accurately follows the NASA-inspired color palette in an abstract way
- + Strong aesthetic appeal for a vintage space enthusiast
- − Fails significantly on step-by-step logic and iconography
- − Includes modern space shuttles which were not part of the Apollo 11 mission
- − Text is illegible and layout is cluttered with repetitive moons
FLUX.2 [dev] Flash
- + Excellent adherence to the requested infographic structure and steps
- + Includes legible and accurate text labeling for mission phases
- + Professional flat-vector icons that match the modern clean style requested
- − Minor text artifacts on some labels (e.g., 'MEON')
- − The 'Translunar' trajectory starts from the landing site instead of earth orbit
Verdict: FLUX.2 [dev] Flash significantly outperformed DALL-E 3 by delivering an actual usable infographic that followed the requested steps and logic. DALL-E 3 produced abstract art with historical inaccuracies like the Space Shuttle, while FLUX.2 [dev] Flash correctly identified the Saturn V, Lunar Module, and specific mission phases with clear text.
Explore each model
Fast distilled version of Black Forest Labs' FLUX.2 [dev] optimized for speed and cost efficiency.