Imagen 4.0 Fast Generate 001

AI Image Generation Model

Image $$ · 2¢

Google's Imagen 4.0 Fast model optimized for speed and efficiency, suitable for high-volume image generation tasks

Supported Modes
Text to Image
Active

Details

Model ID
imagen-4.0-fast-generate-001
Creator
Family
imagen-4
Tags
image-generation text-to-image fast
// Get Started

Ready to integrate?

Access imagen-4.0-fast-generate-001 via our unified API.

Create Account
Available at 4 providers

Starting from

$0.020 /image via fal.ai, Gemini API, Replicate, Vertex AI

Prices shown are in USD

See all providers

Provider Performance

Fastest generation through gemini at 4,767ms median latency with 90.9% success rate.

Aggregated from real API requests over the last 30 days.

Generation Time

gemini
4,767ms p95: 5,682ms

Success Rate

gemini
90.9%
10 / 11 requests

Time to First Byte

gemini
4,279ms
p95: 4,623ms

Provider Rankings

# Provider p50 Gen Time p95 Gen Time Success Rate TTFB (p50)
1 gemini 4,767ms 5,682ms 90.9% 4,279ms
Data updated every 15 minutes. Based on all API requests through Lumenfall over the last 30 days.

Providers & Pricing (4)

Imagen 4.0 Fast Generate 001 is available from 4 providers, with per-image pricing starting at $0.02 through fal.ai.

fal.ai
fal/imagen-4.0-fast-generate-001
Provider Model ID: fal-ai/imagen4/preview/fast

Output

Image
$0.020 per image
Pricing Notes (2)
  • Faster version of Imagen 4 for speed-focused use cases
  • Supports aspect ratios: 1:1, 16:9, 9:16, 4:3, 3:4
View official pricing • As of
Gemini API
gemini/imagen-4.0-fast-generate-001
Provider Model ID: imagen-4.0-fast-generate-001
$0.020 /image
Replicate
replicate/imagen-4.0-fast-generate-001
Provider Model ID: google/imagen-4-fast
$0.020 /image
Vertex AI
vertex/imagen-4.0-fast-generate-001
Provider Model ID: imagen-4.0-fast-generate-001
$0.020 /image

Imagen 4.0 Fast Generate 001 API OpenAI-compatible

Integrate Imagen 4.0 Fast Generate 001 via the Lumenfall OpenAI-compatible API to generate high-speed images from text prompts using a single unified endpoint.

Base URL
https://api.lumenfall.ai/openai/v1
Model
imagen-4.0-fast-generate-001

Code Examples

Text to Image

/v1/images/generations
curl -X POST \
  https://api.lumenfall.ai/openai/v1/images/generations \
  -H "Authorization: Bearer $LUMENFALL_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "imagen-4.0-fast-generate-001",
    "prompt": "",
    "size": "1024x1024"
  }'
# Response:
# { "created": 1234567890, "data": [{ "url": "https://...", "revised_prompt": "..." }] }

Parameter Reference

Required Supported Not available

Core Parameters

Parameter Type Description Modes
prompt string Required. Text prompt for image generation
T2I
negative_prompt string Negative prompt to guide generation away from undesired content
T2I
seed integer Random seed for reproducibility
T2I

Size & Layout

Parameter Type Description Modes
size string Image dimensions as WxH pixels (e.g. "1024x1024") or aspect ratio (e.g. "16:9")
1365x768 768x1365 887x1182 1024x1024 1183x887
WxH determines both shape and scale (aspect_ratio and resolution are ignored when size is provided). W:H format is equivalent to aspect_ratio.
T2I
aspect_ratio string Aspect ratio of the output image (e.g. "16:9", "1:1")
9:16 3:4 1:1 4:3 16:9
Controls shape independently of scale. Use with resolution to control both. If size is also provided, size takes precedence. Any ratio is accepted and mapped to the nearest supported value.
T2I
resolution string Output resolution tier (e.g. "1K", "4K")
1K
Controls scale independently of shape. Higher tiers produce larger images and cost more. If size is also provided, size takes precedence for scale. Any tier is accepted and mapped to the nearest supported value.
T2I
1K 5 sizes
Output size aspect_ratio + resolution
1183 × 887 "1183x887" or "4:3" + "1K"
1024 × 1024 "1024x1024" or "1:1" + "1K"
887 × 1182 "887x1182" or "3:4" + "1K"
768 × 1365 "768x1365" or "9:16" + "1K"
1365 × 768 "1365x768" or "16:9" + "1K"

How these parameters work

size

Exact pixel dimensions

"1920x1080"
aspect_ratio

Shape only, default scale

"16:9"
resolution

Scale tier, preserves shape

"1K"

Priority when combined

size aspect_ratio + resolution aspect_ratio resolution

size is most specific and always wins. aspect_ratio and resolution control shape and scale independently.

How matching works

Shape matching – we pick the closest supported ratio. Ask for 7:1 on a model with 4:1 and 8:1, you get 8:1.
Scale matching – providers use different tier formats: K tiers (0.5K 1K 2K 4K) or megapixel tiers (0.25 1). If the exact tier isn't available, you get the nearest one.
Dimension clamping – if a model has pixel limits, we clamp dimensions to fit and keep the aspect ratio intact.

Output & Format

Parameter Type Description Modes
response_format string How to return the image
url b64_json
Default: "url"
T2I
output_format string Output image format
png jpeg gif webp avif
Gateway converts to requested format if provider doesn't support it natively.
T2I
output_compression integer Compression level for lossy formats (JPEG, WebP, AVIF)
T2I
n integer Number of images to generate
Default: 1
Gateway generates multiple images in parallel even if provider only supports 1.
T2I

Additional Parameters

Parameter Type Description Modes
personGeneration gemini string Whether to allow generation of people. 'allow_adult' permits adults only; 'dont_allow' blocks all human generation.
allow_adult dont_allow
T2I
safety_filter_level replicate string block_low_and_above is strictest, block_medium_and_above blocks some prompts, block_only_high is most permissive but some prompts will still be blocked
block_low_and_above block_medium_and_above block_only_high
T2I
safety_tolerance fal string The safety tolerance level for content moderation. 1 is the most strict (blocks most content), 6 is the least strict.
1 2 3 4 5 6
T2I
sync_mode fal boolean If `True`, the media will be returned as a data URI and the output data won't be available in the request history.
T2I

Parameter Normalization

How we handle parameters across different providers

Not every provider speaks the same language. When you send a parameter, we handle it in one of four ways depending on what the model supports:

Behavior What happens Example
passthrough Sent as-is to the provider style, quality
renamed Same value, mapped to the field name the provider expects prompt
converted Transformed to the provider's native format size
emulated Works even if the provider has no concept of it n, response_format

Parameters we don't recognize pass straight through to the upstream API, so provider-specific options still work.

Imagen 4.0 Fast Generate 001 Benchmarks

Imagen 4.0 Fast Generate 001 occupies the #25 position in the Text-to-Image arena with a competitive Elo rating of 1163. The model serves as a balanced performance benchmark for Google's low-latency vision architecture.

Lumenfall Arena
#28
Text-to-Image
1165 Elo

Text-to-Image Landscape

1 model without pricing omitted

Elo vs Speed

8 models waiting for enough speed data

Competition Results

Text-to-Image

Photorealism

View leaderboard
Text-to-Image
Prompt

“A candid street photo of an elderly Japanese man repairing a red bicycle in light rain, reflections on wet pavement, shallow depth of field, 50mm lens, natural skin texture, imperfect framing, motion blur from passing cars, cinematic but realistic, no stylization.”

Text-to-Image

Portrait

View leaderboard
#19
Fantasy Warrior
21 models
Text-to-Image
Prompt

“Close portrait of a battle-worn paladin in ornate engraved plate armor, hair braided with small beads, faint scars and dirt on the skin, warm torchlight reflecting off metal, shallow depth of field, bokeh sparks, lifelike eyes, highly detailed texture on leather straps and cloth underlayer.”

Uncategorized

Text-to-Image
Prompt

“Hyper-photorealistic scene of fluffy baby animals—a golden retriever puppy, tabby kitten, baby bunny, and red fox kit—with big expressive eyes and ultra-detailed soft fur, playfully chasing butterflies and tumbling together in a lush wildflower meadow, warm golden sunrise light with god rays and dew sparkles, joyful wholesome vibe, 8K masterpiece.”

#14
Fantasy Warrior
14 models
Text-to-Image
Prompt

“Close portrait of a battle-worn paladin in ornate engraved plate armor, hair braided with small beads, faint scars and dirt on the skin, warm torchlight reflecting off metal, shallow depth of field, bokeh sparks, lifelike eyes, highly detailed texture on leather straps and cloth underlayer.”

3 attempts – showing best result
Help rank Imagen 4.0 Fast Generate 001 Pick the better image in blind matchups. Results update rankings in real time.
Start Voting

This model demonstrates significant strength in photorealism, where it ranks #10 out of 22 models with a 46.7% win rate, though it shows limitations in portrait generation with a #18 rank and 17.6% win rate. It is optimized for high-volume workflows where speed and realistic texture are prioritized over stylized character detail.

Imagen 4.0 Fast Generate 001 FAQ

How much does Imagen 4.0 Fast Generate 001 cost?

Imagen 4.0 Fast Generate 001 starts at $0.02 per image through Lumenfall. Pricing varies by provider. Lumenfall does not add any markup to provider pricing.

How do I use Imagen 4.0 Fast Generate 001 via API?

You can use Imagen 4.0 Fast Generate 001 through Lumenfall's OpenAI-compatible API. Send requests to the unified endpoint with model ID "imagen-4.0-fast-generate-001". Code examples are available in Python, JavaScript, and cURL.

Which providers offer Imagen 4.0 Fast Generate 001?

Imagen 4.0 Fast Generate 001 is available through Vertex AI, fal.ai, Replicate, and Gemini API on Lumenfall. Lumenfall automatically routes requests to the best available provider.

Overview

Imagen 4.0 Fast Generate 001 is a high-speed text-to-image model developed by Google, designed specifically for low-latency production environments. Belonging to the Imagen 4 family, this iteration prioritizes throughput and rapid inference without sacrificing the core visual coherence associated with Google’s generative research. It is architected to handle high-volume workloads where generation speed is the primary operational requirement.

Strengths

  • Inference Latency: Optimized for near-instantaneous image creation, making it suitable for real-time applications and interactive user interfaces.
  • Prompt Adherence: Demonstrates high fidelity to descriptive text inputs, maintaining consistent spatial relationships and object placement as defined in the prompt.
  • Text Rendering: Improved accuracy in rendering legible text within generated images compared to earlier iterations of the Imagen family.
  • Photorealistic Textures: Capable of producing sharp details in human skin, fabric, and environmental light, even within a compressed generation window.

Limitations

  • Compositional Complexity: While fast, the model may struggle with extremely intricate scenes involving more than five or six distinct subjects compared to the larger, non-“Fast” variants of Imagen 4.
  • Aspect Ratio Flexibility: Performance is most predictable at standard square resolutions, with some degradation in quality or increased artifacts when pushed to extreme panoramic or vertical dimensions.
  • Fine Detail Consistency: In high-speed batches, small background details may occasionally lack the refined polish found in compute-heavy diffusion models.

Technical Background

The model is built on the Imagen 4 architecture, which utilizes a diffusion-based framework enhanced by Google’s latest advancements in transformer-based text encoders. To achieve the “Fast” designation, the model likely employs distilled sampling techniques or a reduced number of inference steps to accelerate the denoising process. This optimization allows the model to maintain a high aesthetic threshold while significantly reducing the total floating-point operations (FLOPs) required per image.

Best For

This model is ideal for dynamic content creation, such as real-time assets for gaming, rapid prototyping for marketing storyboards, and high-traffic web applications that require on-the-fly visual generation. It is a strong choice for developers who need a cost-effective solution for large-scale image batches.

Imagen 4.0 Fast Generate 001 is available through Lumenfall’s unified API and playground, allowing for seamless integration and side-by-side comparison with other industry-leading image models.

Try Imagen 4.0 Fast Generate 001 in Playground

Generate images with custom prompts — no API key needed.

Open Playground