Arena / Challenges

Text-to-Image challenges

Every text-to-image challenge in the arena, scored with TrueSkill as the votes come in. Filter by skill to narrow it down.

Text-to-Image Image Editing Text-to-Video Image Upscaling Image-to-Video Text-to-Vector

Geometric Composition

“A glass cube on a wooden table. Inside the cube is a small blue sphere. On top of the cube sits a red book. A green plant is behind the cube, parti...”

Portrait

Fantasy Warrior

“Close portrait of a battle-worn paladin in ornate engraved plate armor, hair braided with small beads, faint scars and dirt on the skin, warm torch...”

Isometric Miniature Diorama Scenes

“Create a clear, 45° top-down isometric miniature 3D cartoon scene of Japan's signature dish: sushi, with soft refined textures, realistic PBR mater...”

Text Rendering

Modern Clean Menu

“Modern minimalist restaurant menu design, white background with colorful food photos in grid, sections for appetizers/pizza/mains, bold sans-serif ...”

Text Rendering Product, Branding & Commercial

Vintage Cafe Logo

“Vintage minimalist restaurant logo for "Caffè Florian", retro cloche dome with steam and "Est. 1720" banner, classic typography, warm brown and cre...”

Adorable Baby Animals in Sunny Meadow

“Hyper-photorealistic scene of fluffy baby animals—a golden retriever puppy, tabby kitten, baby bunny, and red fox kit—with big expressive eyes and ...”

Photorealism

Candid Street Photography

“A candid street photo of an elderly Japanese man repairing a red bicycle in light rain, reflections on wet pavement, shallow depth of field, 50mm l...”

Text Rendering

Apollo 11: Journey to Tranquility

“Create a clean, modern vector infographic poster about the Apollo 11 mission. NASA-inspired palette (navy, white, muted red, light gray). Flat-vect...”

Text Rendering Photorealism

Magic Burger Explosion: Fiery Photorealism Challenge

This prompt forces models to simultaneously nail a highly specific, multi-layered commercial scene: dynamic exploded composition with multiple flying food elements, photorealistic textures, dramatic fiery lighting with embers, and precisely integrated glowing text, all while keeping strong visual impact. It is a perfect stress test that quickly separates models with true prompt mastery and creative control from those that miss details, break physics, or produce generic results.

Photorealism

The Capybara Taxi Driver

This challenge seems to be difficult for models because it mixes reality with fiction. Most models struggle to keep the taxi realistic or loose instructions like placing the passenger not in the backseat.

Text Rendering Photorealism

Chalkboard Menu

This challenge forces models to use one consistent handwritten style across an entire dense menu instead of defaulting to clean printed text for the smaller details, a very common failure that reveals how well they actually understand and maintain stylistic coherence.

Art Photorealism

The Reversed Rodeo

This competition tests how well AI image models truly understand language versus how much they rely on visual habits from their training data. The prompt is deliberately simple on the surface but devilishly hard in practice. Most models default to the familiar trope of an astronaut riding a horse. By forcing the reversal, we measure three critical capabilities that separate good models from great ones: Strict instruction following (including negations) Accurate subject-object relationships and spatial hierarchy Resistance to strong dataset biases

Load more challenges