OpenAI's state-of-the-art image generation model with better instruction following and adherence to prompts
Settled by community votes across 1 shared challenge, with an AI judge weighing in on each.
GPT Image 1.5
#7 of 44 in Text-to-Image
Not enough comparable category data
The chart appears once both models have ratings across at least three shared arena categories.
HiDream I1 Fast
#38 of 44 in Text-to-Image
Where the votes landed
GPT Image 1.5
100.0%
win rate
Ties
0.0%
HiDream I1 Fast
0.0%
win rate
Challenge by challenge
The strongest take from each model on every shared challenge, with the AI judge's read.
Fantasy Warrior
Text-to-Image“Close portrait of a battle-worn paladin in ornate engraved plate armor, hair braided with small beads, faint scars and dirt on the skin, warm torchlight reflecting off metal, shallow depth of field, bokeh sparks, lifelike eyes, highly detailed texture on leather straps and cloth underlayer.”
AI Judge Analysis
GPT Image 1.5
- + Exceptional photographic realism with lifelike skin texture and eyes.
- + Highly detailed engraving on the plate armor and visible texture on the leather straps.
- + Perfectly executed warm torchlight lighting and atmospheric bokeh sparks.
- − The braided hair with beads is slightly less prominent than the rest of the armor details.
HiDream I1 Fast
- + Strong emphasis on the braided hair with colorful beads as requested.
- + Clear, ornate engraving and good contrast in the lighting.
- + Good adherence to the 'battle-worn' prompt with visible facial scarring.
- − Skin texture looks overly smooth and synthesized compared to a real photograph.
- − The armor looks somewhat cleaner and more 'costume-like' rather than functional battle-worn metal.
- − Lighting on the face feels a bit flat compared to the dramatic lighting on the armor.
Verdict: GPT Image 1.5 is the clear winner due to its superior photorealism and textural detail; the rendering of the skin, the micro-scratches on the armor, and the atmospheric lighting are professional grade. While HiDream I1 Fast followed the specific detail of the beads in the hair more vibrantly, the overall image quality feels more like a high-quality 3D render rather than the lifelike portrait achieved by GPT Image 1.5.
Explore each model
Distilled version of HiDream AI's 17B parameter text-to-image model