Nano Banana 2 vs Grok Imagine Image

Head-to-head across 17 challenges

Nano Banana 2

58.3%

win rate

Ties

0.0%

Grok Imagine Image

41.7%

win rate

58.3% 0.0% ties 41.7%

Challenge Results

Fantasy Warrior

Text-to-Image

“Close portrait of a battle-worn paladin in ornate engraved plate armor, hair braided with small beads, faint scars and dirt on the skin, warm torchlight reflecting off metal, shallow depth of field, bokeh sparks, lifelike eyes, highly detailed texture on leather straps and cloth underlayer.”

Nano Banana 2
Grok Imagine Image

AI Judge Analysis

Nano Banana 2

  • + Excellent adherence to 'battle-worn' with realistic dirt and grittiness on the face and hand.
  • + Highly detailed engraving on the plate armor including runes and heraldry.
  • + Superior texture rendering on leather straps, cloth underlayers, and individual hair beads.
  • The torch flame in the background is a bit sharp, slightly reducing the 'shallow depth of field' effect.

Grok Imagine Image

  • + Intricate floral engraving on the armor is visually very appealing.
  • + Strong bokeh effect with nice floating sparks and soft background torches.
  • + Clear application of facial scars and hair beads.
  • The character appears too clean and 'model-like' for a battle-worn warrior.
  • Anatomy error with a braid appearing to emerge directly from the neck/collar area without a clear path from the head.
  • The lighting on the face is a bit flat compared to the dramatic armor highlights.

Verdict: Nano Banana 2 is the clear winner as it perfectly captures the 'battle-worn' aesthetic with realistic dirt, grit, and complex textures in the armor and clothing. Grok Imagine Image produces a beautiful, clean-looking character, but it lacks the weathered intensity requested by the prompt and contains minor anatomical layering issues with the hair.

Magic Burger Explosion: Fiery Photorealism Challenge

Text-to-Image

“Ad for 'Magic Burger'. Dynamic, exploded burger with all components (bun, patty, cheese, lettuce, tomato, sauce) suspended in mid-air. Emphasize photorealistic detail and a sense of motion. Dark, fiery background with glowing embers. Integrate text: 'MAGIC BURGER' as a prominent title, 'LIMITED TIME ONLY' as a secondary message, and '€6.99' in a starburst, all rendered with a fiery, glowing effect.”

Nano Banana 2
Grok Imagine Image

AI Judge Analysis

Nano Banana 2

  • + Excellent photorealistic texture on the meat and buns
  • + Perfect typography integration with glowing fire effects
  • + Strong dynamic composition with realistic sauce splashes and flying seeds
  • The 'starburst' for the price is rendered as a literal five-point star lines

Grok Imagine Image

  • + Accurate 'starburst' shape for the price as is common in retail ads
  • + Clean text rendering and readable font choices
  • + Clear separation of ingredients
  • The burger ingredients look slightly more plastic/artificial compared to Model A
  • The lettuce and tomatoes appear floating in a less organized, more chaotic way

Verdict: Nano Banana 2 produces a much more professional and appetizing image with superior textures on the burger components and more sophisticated lighting. Grok Imagine Image handles the 'starburst' icon better conceptually, but the overall food photography quality and fiery atmosphere in Nano Banana 2 are more convincing for a high-end advertisement.

Candid Street Photography

Text-to-Image

“A candid street photo of an elderly Japanese man repairing a red bicycle in light rain, reflections on wet pavement, shallow depth of field, 50mm lens, natural skin texture, imperfect framing, motion blur from passing cars, cinematic but realistic, no stylization.”

Nano Banana 2
Grok Imagine Image
0% wins 0% ties 100% wins

AI Judge Analysis

Nano Banana 2

  • + Excellent adherence to the request for 'candid' and 'imperfect framing' with a realistic film-like texture.
  • + Highly detailed and accurate depiction of a Japanese street with readable, authentic signage.
  • + Complex composition with great reflections and atmospheric wet pavement effects.
  • Small anatomical error with the wrench handle appearing to merge or pass through the hand incorrectly.
  • The bicycle's rear wheel and chain area are a bit physically incoherent.

Grok Imagine Image

  • + Strong application of motion blur on passing cars as requested.
  • + Good use of shallow depth of field to separate the subject from the background.
  • + The red bicycle is a central, clear element of the composition.
  • The subject's face is obscured and lacks the 'natural skin texture' requested.
  • Lower overall detail in the environment and textures compared to the competitor.
  • The subject's hands and the specific act of 'repairing' are blurry and indistinct.

Verdict: Nano Banana 2 is the clear winner as it captures the 'no stylization' and 'candid' requirements with incredible photographic realism and environmental detail. While Grok Imagine Image handles the motion blur well, it lacks the textural depth and cultural authenticity seen in the signage and character design of Nano Banana 2.

Pose & Character Mashup

Editing
Edit instruction

“Use Image 1 as the exact pose reference and Image 2 as the character reference. Recreate the person/character from Image 2 in the exact dynamic pose and body position from Image 1. Keep the exact face, hair, clothing style/details, and expression from Image 2. Match the lighting and environment of Image 1. The final image must show the character from Image 2 performing the precise action/pose from Image 1 with perfect anatomy and natural integration.”

Source
Nano Banana 2
Grok Imagine Image

AI Judge Analysis

Nano Banana 2

  • + Successfully transfers the character reference traits including sunglasses, scarf, and facial features.
  • + Follows the complex pose in Image 1 reasonably well, maintaining the yellow background and red ottoman.
  • + Correctly changes the clothing color and style to match Image 2's character.
  • Anatomy is broken, particularly where the foot merges into the torso and the hand placement on the ottoman.
  • The pose is not an exact match to the reference, simplifying the leg cross into a bizarre anatomical fusion.

Grok Imagine Image

  • + Preserves the original Image 1 perfectly without any artifacts.
  • Completely failed the editing task by returning Image 1 without any changes.
  • Zero character transfer from Image 2.

Verdict: Nano Banana 2 attempted the complex task of character and pose merging; while it suffered from significant anatomical issues where limbs merged, it successfully integrated the character's face, sunglasses, and clothing. Grok Imagine Image failed entirely, providing the original pose reference image (Image 1) with no modifications at all.

Man and Car in California

Editing
Edit instruction

“Make a photo of the man driving the car down the California coastline”

Source
Nano Banana 2
Grok Imagine Image
67% wins 0% ties 33% wins

AI Judge Analysis

Nano Banana 2

  • + Successfully preserved the identity and clothing (plaid coat and black scarf) of the man from the second source image.
  • + Accurately maintained the model and details of the Rolls-Royce Phantom Drophead Coupe from the first source image.
  • + Created a high-quality, scenic California coastline background with realistic motion blur and lighting.
  • The man appears to be floating slightly too high in the driver's seat.

Grok Imagine Image

  • + Good photographic quality and dynamic composition with leading lines.
  • + Accurately depicts a California coastal environment.
  • Failed to preserve the man's identity, replacing him with a generic white male in sunglasses.
  • Significantly altered the car design, changing the headlights and grille to a different Rolls-Royce style (closer to a Dawn).
  • Ignored the specific visual cues from the provided second source image.

Verdict: Nano Banana 2 is the clear winner as it followed the multi-image editing instruction perfectly, correctly placing the specific man from the second source image into the specific car from the first source image. Grok Imagine Image failed the core task by replacing the man with a generic person and failing to preserve the exact vehicle model.

Geometric Composition

Text-to-Image

“A glass cube on a wooden table. Inside the cube is a small blue sphere. On top of the cube sits a red book. A green plant is behind the cube, partially visible through the glass. Soft window light from the left.”

Nano Banana 2
Grok Imagine Image

AI Judge Analysis

Nano Banana 2

  • + Excellent photographic realism and lighting
  • + Highly accurate text rendering on the book spine
  • + Perfect adherence to the spatial constraints of the prompt
  • The glass cube looks slightly more like an open-top display case than a sealed cube

Grok Imagine Image

  • + Clean aesthetic with soft lighting
  • + Good interpretation of the plant behind the glass
  • The blue sphere is floating unnaturally in the center of the air
  • The glass cube has physically impossible refractions and a solid center appearance

Verdict: Nano Banana 2 is the clear winner as it produces a highly realistic image with perfect prompt adherence, including legible text on the book and a physically grounded blue sphere. Grok Imagine Image fails on physics, showing the sphere levitating in the middle of the cube and displaying confusing optical distortions in the glass.

Chalkboard Menu

Text-to-Image

“Handwritten-style chalkboard menu in a cozy café, all text rendered in the exact same realistic chalk handwriting style with natural variations in letter size, slight slant, and chalk texture — no printed or digital fonts anywhere on the board. Title at the top in elegant cursive chalk handwriting: ‘TODAY’S SPECIALS – APRIL 30, 2026’. Below it, three menu items also in the same handwritten chalk style: ‘Truffle Mushroom Risotto – $24’, ‘Grilled Octopus with Lemon & Herbs – $28’, ‘Brown Butter Chocolate Chip Cookies – $9’. At the very bottom, smaller text in the identical handwritten chalk style (slightly smaller but still clearly legible with the same handwriting characteristics): ‘All items made fresh daily • Ask about our gluten-free options’. Warm ambient café lighting, visible chalk dust and smudges, realistic handwriting imperfections, no clean printed text anywhere.”

Nano Banana 2
Grok Imagine Image

AI Judge Analysis

Nano Banana 2

  • + Excellent text rendering with no spelling errors.
  • + Beautifully realistic chalk texture and smudges on the board.
  • + Strong composition with a depth-of-field effect in a cozy café setting.
  • The 'Brown Butter' item is truncated compared to the full name requested in the prompt.

Grok Imagine Image

  • + Perfectly renders every word and price from the prompt.
  • + Authentic hand-drawn chalk font style that transitions between cursive and print.
  • Slightly less chalky texture compared to Model A, appearing a bit more like a digital overlay.
  • The framing is very tight on the board, losing the environmental 'cozy café' context.

Verdict: Both models performed exceptionally well on complex text rendering tasks. Nano Banana 2 provides a more immersive and high-quality artistic scene with realistic chalk artifacts, while Grok Imagine Image followed the text instructions more strictly by completing the full item names. Nano Banana 2 is the slight winner for its superior visual quality and more natural-looking handwriting.

Modern Clean Menu

Text-to-Image

“Modern minimalist restaurant menu design, white background with colorful food photos in grid, sections for appetizers/pizza/mains, bold sans-serif fonts, vibrant accents, clean professional layout for casual dining.”

Nano Banana 2
Grok Imagine Image
0% wins 0% ties 100% wins

AI Judge Analysis

Nano Banana 2

  • + Excellent text legibility and mostly coherent spelling for names and prices.
  • + Strict adherence to the 'grid' layout requested for the food photos.
  • + Professional and realistic composition that looks like a genuine high-quality restaurant menu.
  • Some minor spelling errors in small descriptive text (e.g., 'mate', 'romorvmarrese').

Grok Imagine Image

  • + Bright, vibrant food photography that fits the 'casual dining' prompt.
  • + Good use of white space and minimalist margins.
  • Failed to follow the grid layout for photos, opting for a scattered floating arrangement.
  • Text is mostly illegible gibberish, especially in the descriptions and bottom half.
  • Repeats the same menu items (Grilled Salmon, Steak Frites) multiple times in an illogical way.

Verdict: Nano Banana 2 is the clear winner as it produces a professional, functional menu with legible text and a clean grid layout that perfectly matches the prompt. Grok Imagine Image fails to create readable text and repeats items multiple times, resulting in a layout that is aesthetically pleasing at a glance but nonsensical upon closer inspection.

Bald man challenge

Editing
Edit instruction

“Give the person a full, thick head of natural hair with realistic texture, density, and a natural hairline. Preserve facial features and lighting.”

Before After
Nano Banana 2
Before After
Grok Imagine Image
100% wins 0% ties 0% wins

AI Judge Analysis

Nano Banana 2

  • + The hair texture is highly realistic and matches the color and quality of the existing beard perfectly.
  • + Excellent preservation of the original facial features, glasses, and background.
  • + The hairstyle is stylistically coherent with the person's rugged appearance.
  • The hair volume is slightly conservative compared to the 'thick' request, but still matches the prompt well.

Grok Imagine Image

  • + Successfully added a full head of hair that looks natural.
  • + Good preservation of the background and clothing.
  • The hair texture is a bit wispy and doesn't match the density of the beard as well as Model A.
  • There is a slight loss of detail in the face and skin texture compared to the source image.
  • The hairline transition near the temples is slightly less convincing than Model A.

Verdict: Nano Banana 2 was the more successful model, providing hair that integrated seamlessly with the existing beard and facial features while perfectly preserving the image quality and lighting of the source. Grok Imagine Image also performed well but had slightly thinner hair texture and slightly softened facial details, making the edit look less integrated.

Isometric Miniature Diorama Scenes

Text-to-Image

“Create a clear, 45° top-down isometric miniature 3D cartoon scene of Japan's signature dish: sushi, with soft refined textures, realistic PBR materials, gentle lighting, on a small raised diorama base with minimal garnish and plate. Solid light blue background. At top-center: 'JAPAN' in large bold text, 'SUSHI' below it, small flag icon. Perfectly centered, ultra-clean, high-clarity, square format.”

Nano Banana 2
Grok Imagine Image
0% wins 0% ties 100% wins

AI Judge Analysis

Nano Banana 2

  • + Excellent rendering of materials like wood grain, fish texture, and rice grains.
  • + Flawless text rendering and positioning of the flag icon.
  • + Superior composition with a complex yet clean diorama base including moss and varied sushi types.
  • Slightly leans more towards realism than the requested '3D cartoon' style.
  • The perspective is more of a standard 3/4 view than a strict geometric isometric view.

Grok Imagine Image

  • + Perfectly captures the 45-degree isometric perspective and diorama base style.
  • + Matches the '3D cartoon' aesthetic with soft, rounded geometry.
  • + Accurate text and flag iconography.
  • Textures are very basic and lack the 'realistic PBR' quality requested.
  • Individual sushi components (like the maki rolls) look a bit repetitive and simplistic.
  • The lighting is somewhat flat compared to the other model.

Verdict: Nano Banana 2 produces a much higher quality image with impressive material detail and professional Typography, though it misses the specific 'isometric' geometric look. Grok Imagine Image nails the isometric diorama layout and cartoon style perfectly, but the individual assets and textures feel less refined. Nano Banana 2 is the preferred choice for its sheer visual fidelity and better overall execution of the prompt's complex requirements.

Night Sky Transformation

Editing
Edit instruction

“Change the scene to night: a deep, dark sky with subtle, glistening stars visible behind the mountain.”

Before After
Nano Banana 2
Before After
Grok Imagine Image

AI Judge Analysis

Nano Banana 2

  • + Excellent source preservation, maintaining the exact layout of the village and mountains from the original.
  • + Realistic transition to night with a subtle Milky Way effect and natural lighting on the peaks.
  • + High visual clarity and detail in both the foreground and background.
  • None notable; handled the edit instruction very well.

Grok Imagine Image

  • + Successfully applied the requested night scene with stars.
  • + Maintained the structural integrity of the original image well.
  • + Good balance of dark sky and illuminated village.
  • Slightly more compressed appearance compared to Model A.
  • The star distribution is a bit more uniform and less realistic in its cluster patterns.

Verdict: Both models did an exceptional job of preserving the source image while applying the night sky edit. Nano Banana 2 (Model A) is slightly preferred for its more sophisticated rendering of the stars, including a realistic hint of the Milky Way, whereas Grok Imagine Image (Model B) has a more uniform star field.

Over-the-top cartoon caricature

Editing
Edit instruction

“Create a caricature of me and my job. Make it exaggerated and humorous, incorporating my profession as a tv show anchor and my love for dogs and hockey.”

Source
Nano Banana 2
Grok Imagine Image
100% wins 0% ties 0% wins

AI Judge Analysis

Nano Banana 2

  • + Excellent caricature style with hand-drawn colored pencil textures.
  • + Great text rendering in the 'Breaking News' and 'W-K9 NEWS' graphics.
  • + Effectively captures the subject's features while exaggerating them for a humorous effect.
  • Physical logic errors like the microphone floating near the bulldog.
  • The hand holding the hockey stick is oddly rendered with a ring on a distorted finger.

Grok Imagine Image

  • + Strong facial resemblance to the source image while applying caricature proportions.
  • + High-quality rendering with clean digital lines and lighting.
  • + Clever integration of themes, such as a dog on ice skates in the background.
  • The hockey stick held by the dog looks thin and poorly shaped.
  • The anchor's outfit was changed from the source denim to a formal suit (though appropriate for the job).

Verdict: Both models followed the complex editing instructions very well, effectively combining the three required themes (TV anchor, dogs, and hockey). Nano Banana 2 chose a traditional hand-drawn caricature style with excellent thematic text, while Grok Imagine Image opted for a 'big head' digital caricature style that maintained a stronger facial likeness to the source person. Nano Banana 2 is slightly more creative in its presentation as a physical drawing, but Grok Imagine Image feels more polished as a professional graphic.

Adorable Baby Animals in Sunny Meadow

Text-to-Image

“Hyper-photorealistic scene of fluffy baby animals—a golden retriever puppy, tabby kitten, baby bunny, and red fox kit—with big expressive eyes and ultra-detailed soft fur, playfully chasing butterflies and tumbling together in a lush wildflower meadow, warm golden sunrise light with god rays and dew sparkles, joyful wholesome vibe, 8K masterpiece.”

Nano Banana 2
Grok Imagine Image

AI Judge Analysis

Nano Banana 2

  • + Excellent prompt adherence with all four distinct animals clearly visible and correctly identified.
  • + Dynamic and realistic composition showing the animals in motion (running/tumbling).
  • + Superior photographic quality with realistic fur textures and natural lighting integration.
  • A small orange butterfly on the puppy's side is poorly integrated/deformed.
  • The kitten's tail looks slightly elongated and stiff.

Grok Imagine Image

  • + Very cute, stylized interpretation with high-contrast 'god rays'.
  • + Clean, centered composition with large, expressive eyes as requested.
  • Suffers from anatomical merging: the bunny and fox/dog in the bottom right are fused together.
  • The butterfly rendering is very poor, appearing as mere white blobs in the sky.
  • Overall aesthetic is more like a digital painting or 3D render than 'hyper-photorealistic'.

Verdict: Nano Banana 2 is the clear winner as it successfully rendered all four requested animals with distinct, realistic features and a sense of playful movement. Grok Imagine Image failed on a technical level by merging the bunny and fox into a single multi-limbed creature and lacked the photorealistic texture requested by the prompt.

Studio Ghibli Anime Style

Editing
Edit instruction

“Transform this photo into a Studio Ghibli–inspired illustration. Use soft pastel colors, hand-painted textures, gentle lighting, dreamy backgrounds, and a warm, nostalgic mood”

Source
Nano Banana 2
Grok Imagine Image

AI Judge Analysis

Nano Banana 2

  • + Captures the Studio Ghibli aesthetic perfectly with expressive facial features and soft watercolor textures.
  • + Excellent preservation of the original meme's composition and poses while stylizing them.
  • + The background is beautifully reimagined with charming floral details and architectural character typical of Ghibli films.
  • The woman in the red dress is slightly more out of focus than in the original, though it fits the 'dreamy' prompt.

Grok Imagine Image

  • + Good adherence to the Ghibli style, particularly in the cloud rendering and cleaner line art.
  • + Preserves the source image's layout and background elements like the red vehicle.
  • + Soft, warm lighting creates a pleasant nostalgic mood.
  • The character faces are a bit stiff and less expressive compared to Ghibli's usual emotive style.
  • Texture feels a bit flatter and more digital than the requested 'hand-painted' look.

Verdict: Nano Banana 2 is the clear winner as it masterfully blends the 'Distracted Boyfriend' meme with the specific artistic DNA of Studio Ghibli, particularly through its use of expressive eyes, watercolor textures, and floral background details. Grok Imagine Image provides a clean and competent illustration, but it lacks the warmth and 'hand-painted' soul that defines the requested style and makes the other version so much more successful.

Neutral Expression to Genuine Smile

Editing
Edit instruction
{
  "action": "image_edit",
  "reference": "uploaded neutral portrait",
  "change": "Warm genuine Duchenne smile: lips curved up, slight natural teeth, soft eye crinkles, subtle cheek raise",
  "details": "Realistic smiling skin (dimples if present, soft cheek shadows), slightly brighter eyes; keep exact eye shape/color/iris",
  "preserve_exact": "Face identity/structure, eyes/nose/lips/eyebrows, hair, skin texture/pores/freckles, makeup, clothing, head pose, background, lighting, shadows, framing",
  "no_changes": "No face shape change, no new features, no gaze shift, no hair/clothing/lighting/background edits",
  "style": "Ultra-photorealistic 8K portrait, sharp face focus, natural soft lighting, realistic skin glow"
}
Before After
Nano Banana 2
Before After
Grok Imagine Image

AI Judge Analysis

Nano Banana 2

  • + Excellent preservation of original skin texture and freckles
  • + Highly realistic eye crinkles and anatomical smile movements
  • + Matches the lighting and sharp focus of the source image perfectly
  • Slightly more drastic change to the mouth shape than Model B

Grok Imagine Image

  • + Successfully applies the smiling edit while maintaining face structure
  • + Good preservation of hair and background elements
  • Noticeable loss of skin detail and pores on the cheeks (airbrushed look)
  • Smile looks slightly more artificial/digital compared to the source texture
  • Eyes lack the requested 'soft eye crinkles' to make the smile look genuine

Verdict: Nano Banana 2 is the clear winner as it manages to add a genuine Duchenne smile including natural eye crinkles and cheek raises while preserving the raw skin texture and freckles of the source image. Grok Imagine Image applies the smile but smoothens the skin surface, losing the high-frequency detail (pores and freckles) that gave the original image its photographic realism.

Vintage Cafe Logo

Text-to-Image

“Vintage minimalist restaurant logo for "Caffè Florian", retro cloche dome with steam and "Est. 1720" banner, classic typography, warm brown and cream tones, subtle texture on light background, vector emblem style.”

Nano Banana 2
Grok Imagine Image
100% wins 0% ties 0% wins

AI Judge Analysis

Nano Banana 2

  • + Excellent adherence to the 'emblem style' and 'banner' description with a cohesive circular layout.
  • + Precise text rendering for 'Caffè Florian' and 'Est. 1720'.
  • + Superior vintage aesthetic with beautiful woodcut-style hatching on the cloche.
  • The spacing between the letters in 'Florian' is slightly inconsistent.

Grok Imagine Image

  • + Clean, minimalist vector design that is easy to read.
  • + Creative integration of a coffee cup handle and spoon with the cloche dome.
  • + Accurate colors and text rendering.
  • Repeats the 'Est. 1720' text twice, which wasn't requested.
  • The 'banner' for the date is less detailed and lacks the vintage flair of the competition.

Verdict: Nano Banana 2 perfectly captures the requested vintage emblem style with a sophisticated circular composition and traditional banner execution. While Grok Imagine Image provides a clever minimalist design, it fails to match the 'vintage' and 'banner' requirements as effectively as Nano Banana 2 and unnecessarily repeats the date text.

Apollo 11: Journey to Tranquility

Text-to-Image

“Create a clean, modern vector infographic poster about the Apollo 11 mission. NASA-inspired palette (navy, white, muted red, light gray). Flat-vector style, crisp lines, consistent iconography, subtle gradients only. Steps (stop at landing): 1. Launch (Saturn Vicon) 2. Earth Orbit (Earth + orbit ring icon) 3. Translunar (trajectory arc icon) 4. Lunar Orbit (Moon + orbit ring icon) 5. Descent (lunar module descending icon) 6. Landing (lunar module on the surface icon) Small supporting elements (minimal text): • Crew strip: three silhouette icons with only last names: Armstrong, Aldrin, Collins. • Landing site marker: Moon pin labeled "Tranquility" only. Layout constraints: generous margins, large readable labels, clean background with subtle stars. Vector-only, print-poster look, high resolution.”

Nano Banana 2
Grok Imagine Image

AI Judge Analysis

Nano Banana 2

  • + Excellent text rendering with no spelling errors in main labels.
  • + Very professional and consistent flat-vector illustration style.
  • + Higher quality iconography for the Saturn V and Lunar Module.

Grok Imagine Image

  • + Captured the NASA logo and navy background well.
  • + Included numerical labels for the steps as implied by the prompt list.
  • + Used the requested muted red more prominently in the UI elements.
  • Contains multiple text spelling errors (e.g., '3rajaory', 'Moom').
  • The Saturn V rocket design is generic and less accurate than Model A.
  • Cluttered composition with overlapping text and icons.

Verdict: Nano Banana 2 is the superior infographic, featuring professional-grade vector illustrations and perfectly rendered text. While Grok Imagine captures the requested color palette well, it suffers from significant spelling errors and a messy layout that detracts from its utility as an infographic.

Nano Banana 2

Gemini 3.1 Flash with image generation capabilities. High-efficiency image generation model with support for text rendering, reference images, search grounding, and thinking mode. The efficient counterpart to Gemini 3 Pro Image.

Grok Imagine Image

An image generation model by xAI designed to generate highly aesthetic images from text descriptions.