Models
Google
Imagen 4.0 Generate 001

Google

Imagen 4.0 Generate 001

Name: Imagen 4.0 Generate 001
Brand: Google
Price: 0.04 USD
Availability: InStock

AI Image Generation Model

Image

Google's latest Imagen 4.0 text-to-image generation model with significantly better text rendering and overall image quality

Imagen 4.0 Generate 001 generated image of A hyper-realistic close-up of an artisan's workbench, focusing on a hand-carv...

Try Model Benchmark examples

Supported Modes

Text to Image

Active

Details

Model ID

imagen-4.0-generate-001

Creator

Google

Family

imagen-4

Released

August 2025

Ready to integrate?

Access imagen-4.0-generate-001 via our unified API.

Create Account

Available at 4 providers

Starting from

$0.040 /image via fal.ai, Gemini API, Replicate, Vertex AI

Prices shown are in USD

See all providers

Providers & Pricing (4)

Imagen 4.0 Generate 001 is available from 4 providers, with per-image pricing starting at $0.04 through fal.ai.

fal.ai

fal/imagen-4.0-generate-001

Provider Model ID: fal-ai/imagen4/preview

$0.040 /image

View pricing details

Gemini API

gemini/imagen-4.0-generate-001

Provider Model ID: imagen-4.0-generate-001

$0.040 /image

View pricing details

Replicate

replicate/imagen-4.0-generate-001

Provider Model ID: google/imagen-4

$0.040 /image

View pricing details

Vertex AI

vertex/imagen-4.0-generate-001

Provider Model ID: imagen-4.0-generate-001

$0.040 /image

View pricing details

Imagen 4.0 Generate 001 API OpenAI-compatible

Integrate Imagen 4.0 Generate 001 into your workflow via Lumenfall's OpenAI-compatible API to generate high-quality images and precise text-in-image renders.

Base URL

https://api.lumenfall.ai/openai/v1

Model

imagen-4.0-generate-001

Code Examples

Text to Image

/v1/images/generations

curl -X POST \

  https://api.lumenfall.ai/openai/v1/images/generations \

  -H "Authorization: Bearer $LUMENFALL_API_KEY" \

  -H "Content-Type: application/json" \

  -d '{

    "model": "imagen-4.0-generate-001",

    "prompt": "",

    "size": "1024x1024"

}'

# Response:

# { "created": 1234567890, "data": [{ "url": "https://...", "revised_prompt": "..." }] }

import OpenAI from 'openai';

const client = new OpenAI({

  apiKey: 'YOUR_API_KEY',

  baseURL: 'https://api.lumenfall.ai/openai/v1'

});

const response = await client.images.generate({

  model: 'imagen-4.0-generate-001',

  prompt: '',

  size: '1024x1024'

});

// { created: 1234567890, data: [{ url: "https://...", revised_prompt: "..." }] }

console.log(response.data[0].url);

from openai import OpenAI

client = OpenAI(

    api_key="YOUR_API_KEY",

    base_url="https://api.lumenfall.ai/openai/v1"

response = client.images.generate(

    model="imagen-4.0-generate-001",

    prompt="",

    size="1024x1024"

# { created: 1234567890, data: [{ url: "https://...", revised_prompt: "..." }] }

print(response.data[0].url)

Parameter Reference

Required Supported Not available

Core Parameters

Parameter	Type	Description	Modes
`prompt`	string	Required. Text prompt for image generation	T2I
`negative_prompt`	string	Negative prompt to guide generation away from undesired content	T2I
`seed`	integer	Random seed for reproducibility	T2I

Size & Layout

Parameter	Type	Description	Modes
`size`	string	Image dimensions as WxH pixels (e.g. "1024x1024") or aspect ratio (e.g. "16:9") WxH determines both shape and scale (aspect_ratio and resolution are ignored when size is provided). W:H format is equivalent to aspect_ratio.	T2I
`aspect_ratio`	string	Aspect ratio of the output image (e.g. "16:9", "1:1") `9:16` `3:4` `1:1` `4:3` `16:9` Controls shape independently of scale. Use with resolution to control both. If size is also provided, size takes precedence. Any ratio is accepted and mapped to the nearest supported value.	T2I
`resolution`	string	Output resolution tier (e.g. "1K", "4K") `1K` `2K` Controls scale independently of shape. Higher tiers produce larger images and cost more. If size is also provided, size takes precedence for scale. Any tier is accepted and mapped to the nearest supported value.	T2I

1K 5 sizes

Output	`size`		`aspect_ratio` + `resolution`
1183 × 887	`"1183x887"`	or	`"4:3"` + `"1K"`
1024 × 1024	`"1024x1024"`	or	`"1:1"` + `"1K"`
887 × 1182	`"887x1182"`	or	`"3:4"` + `"1K"`
768 × 1365	`"768x1365"`	or	`"9:16"` + `"1K"`
1365 × 768	`"1365x768"`	or	`"16:9"` + `"1K"`

2K 5 sizes

Output	`size`		`aspect_ratio` + `resolution`
1774 × 2365	`"1774x2365"`	or	`"3:4"` + `"2K"`
2365 × 1774	`"2365x1774"`	or	`"4:3"` + `"2K"`
1536 × 2731	`"1536x2731"`	or	`"9:16"` + `"2K"`
2731 × 1536	`"2731x1536"`	or	`"16:9"` + `"2K"`
2048 × 2048	`"2048x2048"`	or	`"1:1"` + `"2K"`

How these parameters work

size

Exact pixel dimensions

"1920x1080"

aspect_ratio

Shape only, default scale

"16:9"

resolution

Scale tier, preserves shape

"1K"

Priority when combined

size › aspect_ratio + resolution › aspect_ratio › resolution

size is most specific and always wins. aspect_ratio and resolution control shape and scale independently.

How matching works

Shape matching – we pick the closest supported ratio. Ask for 7:1 on a model with 4:1 and 8:1, you get 8:1.

Scale matching – providers use different tier formats: K tiers (0.5K 1K 2K 4K) or megapixel tiers (0.25 1). If the exact tier isn't available, you get the nearest one.

Dimension clamping – if a model has pixel limits, we clamp dimensions to fit and keep the aspect ratio intact.

Output & Format

Parameter	Type	Description	Modes
`response_format`	string	How to return the image `url` `b64_json` Default: `"url"`	T2I
`output_format`	string	Output image format `png` `jpeg` `gif` `webp` `avif` Gateway converts to requested format if provider doesn't support it natively.	T2I
`output_compression`	integer	Compression level for lossy formats (JPEG, WebP, AVIF)	T2I
`n`	integer	Number of images to generate Default: `1` Gateway generates multiple images in parallel even if provider only supports 1.	T2I

Additional Parameters

Provider-specific passthrough fields, available only when the request is routed to the listed provider.

Parameter	Type	Description	Modes
Universal
`personGeneration`	string	Whether to allow generation of people. 'allow_adult' permits adults only; 'dont_allow' blocks all human generation. `allow_adult` `dont_allow`	T2I
fal
`safety_tolerance`	string	The safety tolerance level for content moderation. 1 is the most strict (blocks most content), 6 is the least strict. `1` `2` `3` `4` `5` `6`	T2I
`sync_mode`	boolean	If `True`, the media will be returned as a data URI and the output data won't be available in the request history.	T2I
replicate
`image_size`	string	Resolution of the generated image `1K` `2K`	T2I
`safety_filter_level`	string	block_low_and_above is strictest, block_medium_and_above blocks some prompts, block_only_high is most permissive but some prompts will still be blocked `block_low_and_above` `block_medium_and_above` `block_only_high`	T2I

Parameter Normalization

How we handle parameters across different providers

Not every provider speaks the same language. When you send a parameter, we handle it in one of four ways depending on what the model supports:

Behavior	What happens	Example
`passthrough`	Sent as-is to the provider	style, quality
`renamed`	Same value, mapped to the field name the provider expects	prompt
`converted`	Transformed to the provider's native format	size
`emulated`	Works even if the provider has no concept of it	n, response_format

Parameters we don't recognize pass straight through to the upstream API, so provider-specific options still work.

Full API Reference Authentication, endpoints, and more

Imagen 4.0 Generate 001 Benchmarks

Google's Imagen 4.0 Generate 001 holds rank #27 in the Text-to-Image arena with a competitive Elo score of 1146. This model demonstrates significant improvements in text rendering accuracy and compositional fidelity over previous Google iterations.

Lumenfall Arena

#40

Text-to-Image

1161 Elo

Text-to-Image Landscape

Elo vs Cost

1 model without pricing omitted

Elo vs Speed

20 models waiting for enough speed data

Competition Results

Uncategorized

#22

Adorable Baby Animals in Sunny Meadow

27 models

Text-to-Image

Prompt

“Hyper-photorealistic scene of fluffy baby animals—a golden retriever puppy, tabby kitten, baby bunny, and red fox kit—with big expressive eyes and ultra-detailed soft fur, playfully chasing butterflies and tumbling together in a lush wildflower meadow, warm golden sunrise light with god rays and dew sparkles, joyful wholesome vibe, 8K masterpiece.”

Gallery

View all 5 images

Imagen 4.0 Generate 001 FAQ

How much does Imagen 4.0 Generate 001 cost?

Imagen 4.0 Generate 001 starts at $0.04 per image through Lumenfall. Pricing varies by provider. Lumenfall does not add any markup to provider pricing.

How do I use Imagen 4.0 Generate 001 via API?

You can use Imagen 4.0 Generate 001 through Lumenfall's OpenAI-compatible API. Send requests to the unified endpoint with model ID "imagen-4.0-generate-001". Code examples are available in Python, JavaScript, and cURL.

Which providers offer Imagen 4.0 Generate 001?

Imagen 4.0 Generate 001 is available through Replicate, fal.ai, Vertex AI, and Gemini API on Lumenfall. Lumenfall automatically routes requests to the best available provider.

Overview

Imagen 4.0 Generate 001 is Google’s fourth-generation text-to-image model, designed to synthesize high-fidelity visuals from natural language descriptions. Developed by Google Research, this iteration focuses on solving long-standing hurdles in diffusion models, specifically the accurate rendering of complex typography and the adherence to detailed, multi-part prompts. It represents a significant architectural leap over the 3.0 series in terms of spatial reasoning and fine-grained detail.

Strengths

Precise Text Rendering: The model demonstrates a high success rate when embedding specific strings, legible words, and long phrases into images, minimizing the common “gibberish” artifacts found in earlier generation models.
Nuanced Prompt Adherence: It excels at interpreting complex instructions that involve multiple subjects, specific lighting conditions (e.g., “volumetric God rays”), and precise camera angles without merging distinct elements.
Compositional Realism: The model exhibits improved spatial awareness, accurately placing objects in relation to one another according to prepositional commands (e.g., “behind,” “to the left of,” or “resting on”).
High-Fidelity Textures: It produces sharp, realistic textures for challenging subjects such as human skin, woven fabrics, and reflective surfaces, reducing the “plastic” look often associated with AI-generated imagery.

Limitations

Photorealistic Bias: While capable of various styles, the model can lean toward a “stock photo” aesthetic unless specific artistic styles or medium-specific keywords (e.g., “charcoal sketch” or “35mm film grain”) are heavily emphasized.
Anatomical Edge Cases: Like most diffusion models, it may still struggle with extreme anatomical poses or complex overlapping of limbs in crowded scenes.
Generation Latency: Due to the model’s increased parameter count and complexity, inference times may be slightly higher compared to “Turbo” or “Lightning” variants of competing models.

Technical Background

Imagen 4.0 is built upon an evolved transformer-based diffusion architecture, likely utilizing a massive T5-XXL text encoder to deeply understand linguistic semantics before the image synthesis phase begins. This version incorporates a more robust training dataset focused on high-descriptive captions and high-resolution aesthetics. Key technical refinements were made to the sampling process to ensure that textural details remain coherent even at the edges of the frame.

Best For

Marketing and Ad Copy: Creating hero images that require integrated legible text, such as signs, storefronts, or branded packaging.
Concept Art: Generating detailed character designs and environments that require strict adherence to specific stylistic and spatial prompts.
UI/UX Prototyping: Visualizing app interfaces and website layouts where text placement and icon clarity are essential.

Imagen 4.0 Generate 001 is available for testing and integration through Lumenfall’s unified API and interactive playground, allowing developers to compare its output alongside other industry-leading image models.

Try Imagen 4.0 Generate 001 in Playground

Generate images with custom prompts — no API key needed.

Open Playground