Nano Banana 2 AI generated image

Google

Nano Banana 2

Gemini 3.1 Flash with image generation capabilities. High-efficiency image generation model with support for text rendering, reference images, search grounding, and thinking mode. The efficient counterpart to Gemini 3 Pro Image.

Explore Model

Google

4 AI Image Editing Models, 4 AI Image Generation Models, and 2 AI Video Generation Models

Models

10 models
Nano Banana AI generated image example
Text to Image Image Edit

Gemini 2.5 Flash Image is optimized for image understanding and generation, offering a balance of price and performance with fast and efficient image generation and editing capabilities.

$0.0387 /img
4
Upscale

Google's straightforward image upscaler that supports 2x or 4x magnification with minimal configuration

$0.0050 /img
1
Imagen 3.0 Generate 002 AI generated image example
Text to Image

Google's Imagen 3.0 text-to-image generation model, producing high-quality images with improved detail and lighting

$0.0300 /img
2
Imagen 4.0 Fast Generate 001 AI generated image example
Text to Image

Google's Imagen 4.0 Fast model optimized for speed and efficiency, suitable for high-volume image generation tasks

$0.0200 /img
4
Imagen 4.0 Generate 001 AI generated image example
Text to Image

Google's latest Imagen 4.0 text-to-image generation model with significantly better text rendering and overall image quality

$0.0400 /img
4
Imagen 4.0 Ultra Generate 001 AI generated image example
Text to Image

Google's Imagen 4.0 Ultra model offering the highest fidelity and resolution for professional-grade image generation

$0.0600 /img
4
Veo 3.1 Fast AI generated video example
Text to Video Image to Video

Google's fast video generation model producing 720p/1080p video up to 8 seconds with optional native audio including synchronized sound effects, ambient noise, and dialogue with lip-sync

$0.1000 /video
1
Veo 3.1 Lite AI generated video example
Preview
Text to Video Image to Video

Google's cost-efficient preview video generation model for high-volume use cases, producing 720p or 1080p videos up to 8 seconds with native audio from text or image prompts.

$0.0300 /video
1

Examples

20 images

Arena Rankings

# Model Elo
1 Nano Banana 2 1287
2 Nano Banana Pro 1281
14 more models
17 Nano Banana 1244
9 more models
27 Imagen 4.0 Ultra Generate 001 1226
11 more models
39 Imagen 4.0 Fast Generate 001 1170
40 Imagen 4.0 Generate 001 1161
3 more models

43 models ranked

# Model Elo
1 Nano Banana Pro 1242
2 Nano Banana 2 1238
3 more models
6 Nano Banana 1215
16 more models

22 models ranked

# Model Elo
3 Veo 3.1 Fast 1124
2 more models

5 models ranked

# Model Elo
4 Google Upscaler 1035

4 models ranked

About Google

Google is a global leader in artificial intelligence research, focusing on large-scale multimodal models that integrate frontier reasoning with high-fidelity creative generation. Their work spans foundational research in transformers to the development of the Gemini and Imagen families, which are designed for scalability across enterprise and consumer applications. They are particularly recognized for their advancement in native multimodality, where models handle text, images, and code within a single architectural framework.

  • Nano Banana 2 (Gemini 3.1 Flash Image): This model currently holds the #1 ranking in text-to-image tasks, balancing high-efficiency performance with advanced features like search grounding, thinking mode, and precise text rendering.
  • Nano Banana Pro (Gemini 3 Pro Image): A high-reasoning model that ranks #1 in image editing, allowing users to perform complex modifications and advanced reasoning-based generations through an integrated toolset.
  • Imagen 4.0 Ultra: The highest-fidelity entry in Google’s dedicated image generation line, optimized for professional-grade resolution and photographic detail.
  • Imagen 4.0 Fast: A model built specifically for low-latency requirements, providing a high-speed alternative for high-volume production environments without sacrificing significant quality.
  • Nano Banana (Gemini 2.5 Flash Image): An efficient, cost-effective model designed for rapid image understanding and generation, featuring high performance in image editing relative to its size.

Google excels at building models that are both versatile and technically robust, often leading the industry in capabilities like structured output, code execution, and complex function calling within a creative workflow. Their models are particularly effective for developers who need reliable text rendering and built-in grounding to ensure factual or contextual accuracy. All of these Google models, spanning the high-performance Gemini series and the specialized Imagen line, are available through Lumenfall’s unified API.