“Photorealistic scene inside a yellow New York taxi at night. A capybara is driving, wearing a yellow taxi driver cap and a dark jacket. It has a calm, professional expression and both front paws on the steering wheel. In the back seat sits a human businesswoman in a coat, looking at her phone with a completely normal, bored expression (as if this is just another normal ride). Through the windows you can see the streets of Manhattan at night with blurred lights. Realistic taxi interior, photorealistic, detailed fur and fabric, 35mm lens, night lighting with reflections, shallow depth of field.”
OpenAI's previous generation image model with higher quality than DALL-E 2 and support for larger resolutions
DALL-E 3 Benchmarks
DALL-E 3 is ranked #35 in Text-to-Image with an Elo of 1178 on the Lumenfall Arena, where real users pick the better image in blind comparisons. These rankings are based on 3 blind-vote competitions.
Text-to-Image Landscape
Elo vs Cost
Elo vs Speed
Competition Results
Uncategorized
“A glass cube on a wooden table. Inside the cube is a small blue sphere. On top of the cube sits a red book. A green plant is behind the cube, partially visible through the glass. Soft window light from the left.”
“Create a clear, 45° top-down isometric miniature 3D cartoon scene of Japan's signature dish: sushi, with soft refined textures, realistic PBR materials, gentle lighting, on a small raised diorama base with minimal garnish and plate. Solid light blue background. At top-center: 'JAPAN' in large bold text, 'SUSHI' below it, small flag icon. Perfectly centered, ultra-clean, high-clarity, square format.”
Top Matchups
See how DALL-E 3 performs head-to-head against other AI models, ranked by community votes in blind comparisons.