“Give the person a full, thick head of natural hair with realistic texture, density, and a natural hairline. Preserve facial features and lighting.”
An image generation model by xAI designed to generate highly aesthetic images from text descriptions.
Grok Imagine Image Benchmarks
Grok Imagine Image currently holds an Elo rating of 1227 for text-to-image generation and an Elo of 1200 for image editing. It ranks 11th globally for image editing capabilities and 19th for general image generation performance.
Image Editing Landscape
Elo vs Cost
Elo vs Speed
Competition Results
“Make a photo of the man driving the car down the California coastline”
“Add dynamic motion to this photo: make hair blow in the wind, add leaves flying, energetic and lively feel.”
“Use Image 1 as the base person. Dress them in the exact elaborate outfit from Image 2 (including all layers, accessories, jewelry, and shoes). Carefully adapt the clothing to the body shape and pose in Image 1 while maintaining realistic fabric behavior, correct proportions, and perfect lighting/shadow matching. Keep the person’s exact face, hair, and background completely unchanged.”
{
"action": "image_edit",
"reference": "uploaded neutral portrait",
"change": "Warm genuine Duchenne smile: lips curved up, slight natural teeth, soft eye crinkles, subtle cheek raise",
"details": "Realistic smiling skin (dimples if present, soft cheek shadows), slightly brighter eyes; keep exact eye shape/color/iris",
"preserve_exact": "Face identity/structure, eyes/nose/lips/eyebrows, hair, skin texture/pores/freckles, makeup, clothing, head pose, background, lighting, shadows, framing",
"no_changes": "No face shape change, no new features, no gaze shift, no hair/clothing/lighting/background edits",
"style": "Ultra-photorealistic 8K portrait, sharp face focus, natural soft lighting, realistic skin glow"
}
“Change the scene to night: a deep, dark sky with subtle, glistening stars visible behind the mountain.”
“Transform this photo into a Studio Ghibli–inspired illustration. Use soft pastel colors, hand-painted textures, gentle lighting, dreamy backgrounds, and a warm, nostalgic mood”
“Give the person a full, thick head of natural hair with realistic texture, density, and a natural hairline. Preserve facial features and lighting.”
{
"action": "image_edit",
"reference": "uploaded neutral portrait",
"change": "Warm genuine Duchenne smile: lips curved up, slight natural teeth, soft eye crinkles, subtle cheek raise",
"details": "Realistic smiling skin (dimples if present, soft cheek shadows), slightly brighter eyes; keep exact eye shape/color/iris",
"preserve_exact": "Face identity/structure, eyes/nose/lips/eyebrows, hair, skin texture/pores/freckles, makeup, clothing, head pose, background, lighting, shadows, framing",
"no_changes": "No face shape change, no new features, no gaze shift, no hair/clothing/lighting/background edits",
"style": "Ultra-photorealistic 8K portrait, sharp face focus, natural soft lighting, realistic skin glow"
}
Uncategorized
“Create a caricature of me and my job. Make it exaggerated and humorous, incorporating my profession as a tv show anchor and my love for dogs and hockey.”
Top Matchups
See how Grok Imagine Image performs head-to-head against other AI models, ranked by community votes in blind comparisons.
vs Wan 2.6
Challenge: Golden Hour Stroll
0% W · 100% L
vs FLUX.2 [max]
Challenge: Studio Ghibli Anime Style
75% W · 25% L
vs Nano Banana
Challenge: Bald man challenge
0% W · 67% L · 33% T
vs GPT Image 1.5
Challenge: Vintage Cafe Logo
100% W · 0% L
vs GPT Image 1.5
Challenge: Modern Clean Menu
50% W · 50% L