DALL-E 2 OpenAI FLUX.2 [dev] Flash fal

Settled by community votes across 9 shared challenges, with an AI judge weighing in on each.

DALL-E 2

17.7 arena score

#37 of 44 in Text-to-Image

Skill signature

Not enough comparable category data

The chart appears once both models have ratings across at least three shared arena categories.

FLUX.2 [dev] Flash

26.8 arena score

#5 of 44 in Text-to-Image

Vote tally

Where the votes landed

DALL-E 2

33.3%

win rate

Ties

0.0%

FLUX.2 [dev] Flash

66.7%

win rate

33.3% 0.0% ties 66.7%

Shared challenges 9

Challenge by challenge

The strongest take from each model on every shared challenge, with the AI judge's read.

Geometric Composition

Text-to-Image

“A glass cube on a wooden table. Inside the cube is a small blue sphere. On top of the cube sits a red book. A green plant is behind the cube, partially visible through the glass. Soft window light from the left.”

DALL-E 2

FLUX.2 [dev] Flash

0% wins 0% ties 100% wins

AI Judge Analysis

DALL-E 2

+ Features a small cube on a wooden surface with a reflection.

− Failed almost all spatial and color logic of the prompt.
− The 'red book' became a red fill inside the cube.
− The 'small blue sphere' became a giant blue pot in the background.
− The plant is green but incorrectly placed and hazy.

FLUX.2 [dev] Flash

+ Excellent prompt adherence, following all spatial instructions perfectly.
+ High photographic realism with convincing glass refractions and soft lighting.
+ Accurate color mapping for every object specified in the prompt.

− The plant's leaves appearing inside the glass look a bit sharp, though consistent with prompt requirements.

Verdict: DALL-E 2 completely failed the complex spatial relationships, merging the colors and objects into a confusing composition where the 'blue sphere' is a large background pot and the 'red book' is internal to the cube. In contrast, FLUX.2 [dev] Flash followed every detail of the prompt perfectly, placing the blue sphere inside the cube, the red book on top, and the plant behind, all while maintaining high visual fidelity and lighting consistency.

Candid Street Photography

Text-to-Image

“A candid street photo of an elderly Japanese man repairing a red bicycle in light rain, reflections on wet pavement, shallow depth of field, 50mm lens, natural skin texture, imperfect framing, motion blur from passing cars, cinematic but realistic, no stylization.”

DALL-E 2

FLUX.2 [dev] Flash

0% wins 0% ties 100% wins

AI Judge Analysis

DALL-E 2

+ Captures a sense of 'imperfect framing' and 'candid' spontaneity
+ Strong emphasis on wet pavement reflections

− Extreme blur obscures the main subject (the man and the face)
− Fails to show a clear Japanese man or natural skin texture
− Low overall image resolution and clarity

FLUX.2 [dev] Flash

+ Excellent adherence to all prompt details including age, ethnicity, and movement
+ Realistic skin textures and clothing details
+ Effective use of motion blur on cars and shallow depth of field

− The bike's mechanical parts (chain and frame) have some slight structural inconsistencies
− Framing is perhaps too 'perfect' despite the prompt request for imperfect framing

Verdict: DALL-E 2 produced an abstract and heavily blurred image that fails to showcase the requested subject clearly. In contrast, FLUX.2 [dev] Flash followed the complex prompt precisely, delivering a high-quality, cinematic image with realistic textures, appropriate rain effects, and a clear narrative.

Fantasy Warrior

Text-to-Image

“Close portrait of a battle-worn paladin in ornate engraved plate armor, hair braided with small beads, faint scars and dirt on the skin, warm torchlight reflecting off metal, shallow depth of field, bokeh sparks, lifelike eyes, highly detailed texture on leather straps and cloth underlayer.”

DALL-E 2

FLUX.2 [dev] Flash

AI Judge Analysis

DALL-E 2

+ Features a very shallow depth of field as requested
+ Texture on the metal appears weathered and gritty

− Anatomical failure where the face and helmet blend into a chaotic mess
− Low resolution with significant noise and lack of detail
− Fails to show braided hair, beads, or lifelike eyes

FLUX.2 [dev] Flash

+ Excellent adherence to all prompt details including braided hair with beads and ornate engravings
+ Photorealistic rendering of skin textures, scars, and leather straps
+ Effective composition with warm lighting and bokeh sparks

− The scars appear more like fresh bloody cuts rather than 'faint' scars
− Symmetry in the armor is slightly too perfect for a 'battle-worn' look

Verdict: FLUX.2 [dev] Flash is the clear winner, meticulously following every detail of the prompt from the beaded braids to the leather textures. DALL-E 2 fails significantly, producing an abstract and distorted image that does not clearly depict a human face or the requested accessories.

Chalkboard Menu

Text-to-Image

“Handwritten-style chalkboard menu in a cozy café, all text rendered in the exact same realistic chalk handwriting style with natural variations in letter size, slight slant, and chalk texture — no printed or digital fonts anywhere on the board. Title at the top in elegant cursive chalk handwriting: ‘TODAY’S SPECIALS – APRIL 30, 2026’. Below it, three menu items also in the same handwritten chalk style: ‘Truffle Mushroom Risotto – $24’, ‘Grilled Octopus with Lemon & Herbs – $28’, ‘Brown Butter Chocolate Chip Cookies – $9’. At the very bottom, smaller text in the identical handwritten chalk style (slightly smaller but still clearly legible with the same handwriting characteristics): ‘All items made fresh daily • Ask about our gluten-free options’. Warm ambient café lighting, visible chalk dust and smudges, realistic handwriting imperfections, no clean printed text anywhere.”

DALL-E 2

FLUX.2 [dev] Flash

AI Judge Analysis

DALL-E 2

+ Successfully captures a chalky texture on a dark background.

− Text is completely illegible gibberish.
− Fails to include the specific requested dates and menu items.
− Composition is crowded and lacks the requested café setting.

FLUX.2 [dev] Flash

+ Perfect text rendering with 100% accuracy to the prompt, including the specific date and menu prices.
+ Authentic chalk handwriting style with realistic textures and smudges.
+ Excellent composition showing a cozy café background that adds context.

− The pricing for the cookies is slightly repetitive ($9 listed twice), though it remains legible.

Verdict: FLUX.2 [dev] Flash is the clear winner, demonstrating near-perfect prompt adherence by rendering every requested word and number accurately in a realistic handwritten chalk style. In contrast, DALL-E 2 fails completely on the text generation task, producing illegible symbols that do not follow the prompt's content.

The Capybara Taxi Driver

Text-to-Image

“Photorealistic scene inside a yellow New York taxi at night. A capybara is driving, wearing a yellow taxi driver cap and a dark jacket. It has a calm, professional expression and both front paws on the steering wheel. In the back seat sits a human businesswoman in a coat, looking at her phone with a completely normal, bored expression (as if this is just another normal ride). Through the windows you can see the streets of Manhattan at night with blurred lights. Realistic taxi interior, photorealistic, detailed fur and fabric, 35mm lens, night lighting with reflections, shallow depth of field.”

DALL-E 2

FLUX.2 [dev] Flash

AI Judge Analysis

DALL-E 2

− Failed completely to follow the prompt.
− Image shows a black leather handbag instead of a taxi scene.
− Distorted artifacts visible on the surface of the bag.

FLUX.2 [dev] Flash

+ Excellent adherence to all prompt details including the capybara's clothing and the businesswoman's expression.
+ High visual quality with realistic lighting and depth of field.
+ Accurately rendered hands and paws on the steering wheel and phone.

− Minor text artifacts on the taxi cap and roof sign.
− The capybara's scale is slightly large for a driver's seat.

Verdict: DALL-E 2 failed the task entirely, producing a random image of a black handbag. FLUX.2 [dev] Flash followed the complex prompt perfectly, creating a high-quality, humorous, and photorealistic scene that captures the specific characters and atmosphere requested.

The Halloween Invitation

Text-to-Image

“Vintage gothic Halloween party invitation. Dark parchment poster, spooky border with webs and thorns, central glowing jack-o-lantern, bats, twisted trees, moody night sky. Add elegant gothic title text saying "Halloween Party Invitation", a small scroll banner saying "You are invited to a night of frights", and event details at the bottom: Date: 30.10.2026 Time: 7pm Location: The Arches, NYC Spooky but polished, cinematic lighting, square format.”

DALL-E 2

FLUX.2 [dev] Flash

100% wins 0% ties 0% wins

AI Judge Analysis

DALL-E 2

+ Captures a vintage, hand-painted aesthetic
+ Good color palette for a retro poster

− Text is largely illegible gibberish
− Lacks key requested elements like the glowing jack-o-lantern
− Image quality is low with heavy artifacts

FLUX.2 [dev] Flash

+ Excellent text rendering, following all specific copy requirements
+ Perfect adherence to prompt details including thorns, webs, and specific event data
+ Polished and cinematic visual quality

− The 'Huck' text before the date is extraneous
− Slightly less 'vintage' and more modern-digital in its rendering style

Verdict: FLUX.2 [dev] Flash followed the instructions nearly perfectly, rendering almost all the requested text accurately and including all specified visual elements like the thorns and jack-o-lantern. DALL-E 2 failed to produce legible text or the central subject of the prompt, resulting in a low-resolution and incomplete image.

Isometric Miniature Diorama Scenes

Text-to-Image

“Create a clear, 45° top-down isometric miniature 3D cartoon scene of Japan's signature dish: sushi, with soft refined textures, realistic PBR materials, gentle lighting, on a small raised diorama base with minimal garnish and plate. Solid light blue background. At top-center: 'JAPAN' in large bold text, 'SUSHI' below it, small flag icon. Perfectly centered, ultra-clean, high-clarity, square format.”

DALL-E 2

FLUX.2 [dev] Flash

AI Judge Analysis

DALL-E 2

+ Matches the 45° isometric perspective requested
+ High-contrast lighting matches a cartoonish aesthetic

− Failed to render all requested text, spelling 'Sush' instead of 'SUSHI'
− Failed 'JAPAN' text and flag icon entirely
− Visual quality is low with melting, indistinct shapes

FLUX.2 [dev] Flash

+ Perfect adherence to text and icon requirements
+ High-quality PBR materials with realistic textures for rice and wood
+ Excellent diorama composition and clean miniature aesthetic

− Perspective is slightly lower than the requested 45-degree top-down angle

Verdict: FLUX.2 [dev] Flash significantly outperformed DALL-E 2 by following every element of the complex prompt, including specific text and symbols. While DALL-E 2 struggled with spelling and object coherence, FLUX.2 [dev] Flash delivered a high-fidelity 3D miniature with professional lighting and materials.

Adorable Baby Animals in Sunny Meadow

Text-to-Image

“Hyper-photorealistic scene of fluffy baby animals—a golden retriever puppy, tabby kitten, baby bunny, and red fox kit—with big expressive eyes and ultra-detailed soft fur, playfully chasing butterflies and tumbling together in a lush wildflower meadow, warm golden sunrise light with god rays and dew sparkles, joyful wholesome vibe, 8K masterpiece.”

DALL-E 2

FLUX.2 [dev] Flash

AI Judge Analysis

DALL-E 2

+ Successfully captures a dynamic sense of movement and playfulness in the grass.

− Very low visual quality with severe AI artifacts and distorted anatomy.
− Failed to clearly include all requested animals in a recognizable way.
− Lighting is harsh and flat rather than the requested golden sunrise with god rays.

FLUX.2 [dev] Flash

+ Exceptional level of detail in the fur, eyes, and environment.
+ Perfect adherence to the prompt including all species and atmospheric lighting effects.
+ Clear, high-resolution composition with beautiful bokeh and dew drops.

− The posing is more of a group portrait than the requested 'tumbling' action.

Verdict: FLUX.2 [dev] Flash significantly outperforms DALL-E 2 in every technical category, delivering a high-definition, professional-grade image that perfectly captures the complex prompt. While DALL-E 2 shows more 'tumbling' motion, it suffers from catastrophic failure in rendering the animals' faces and the overall image clarity.

Apollo 11: Journey to Tranquility

Text-to-Image

“Create a clean, modern vector infographic poster about the Apollo 11 mission. NASA-inspired palette (navy, white, muted red, light gray). Flat-vector style, crisp lines, consistent iconography, subtle gradients only. Steps (stop at landing): 1. Launch (Saturn Vicon) 2. Earth Orbit (Earth + orbit ring icon) 3. Translunar (trajectory arc icon) 4. Lunar Orbit (Moon + orbit ring icon) 5. Descent (lunar module descending icon) 6. Landing (lunar module on the surface icon) Small supporting elements (minimal text): • Crew strip: three silhouette icons with only last names: Armstrong, Aldrin, Collins. • Landing site marker: Moon pin labeled "Tranquility" only. Layout constraints: generous margins, large readable labels, clean background with subtle stars. Vector-only, print-poster look, high resolution.”

DALL-E 2

FLUX.2 [dev] Flash

AI Judge Analysis

DALL-E 2

+ Matches the navy and red NASA-inspired color palette.
+ Includes high-contrast vector-like elements.

− Text is completely illegible gibberish.
− Fails to follow the requested 6-step chronological sequence.
− Composition is cluttered and chaotic rather than a clean infographic.

FLUX.2 [dev] Flash

+ Excellent adherence to all six requested steps and specific iconography.
+ Clearly legible text including correct names and technical terms.
+ Clean, modern flat-vector design that feels like a professional infographic.

− Small minor spelling artifacts in some labels like 'MECN'.
− The Saturn V rocket illustration includes extraneous boosters not found on the real Apollo 11 vehicle.

Verdict: FLUX.2 [dev] Flash accurately followed the complex multi-step prompt, producing a logical infographic with legible text and matching icons for each phase of the mission. DALL-E 2 failed significantly, producing garbled text and an abstract layout that does not resemble an infographic or the Apollo mission steps.