FLUX.2 [max]
Black Forest Labs' flagship image generation model delivering state-of-the-art quality with exceptional realism, precision, and consistency for both text-to-image and advanced image editing
Explore ModelAI Media Models
Browse our collection of AI image and video generation models. Compare pricing, capabilities, and find the perfect model for your creative workflow.
Filters & Sort
101 models
Recraft AI's premium text-to-vector model for generating detailed SVG vector graphics with refined composition and materials
Recraft AI's text-to-vector model for generating production-ready SVG vector images with clean geometry, structured layers, and editable paths
ByteDance's flagship video model with text-to-video, image-to-video, and reference-to-video (multi-image/video/audio) generation, cinematic output, native synchronized audio, multi-shot editing, and director-level camera control.
Alibaba's Qwen Image 2.0 model with enhanced text rendering, supporting both Chinese and English prompts with up to 6 images per request
Alibaba's Qwen Image 2.0 Pro model offering higher quality image generation with enhanced detail and accuracy
An image generation model by xAI designed to generate highly aesthetic images from text descriptions.
xAI's premium image generation model offering higher fidelity output and stronger performance on single-image editing benchmarks compared to the standard Grok Imagine model
PixVerse's latest video generation model with astonishing physics, audio-visual synchronization, multi-shot camera control, and end-frame support
Black Forest Labs' compact, open-source image generation model with sub-second inference, optimized for production and near real-time applications with multi-reference support
Black Forest Labs' distilled 9 billion parameter image generation model with sub-second inference and multi-reference support
Fast distilled version of Black Forest Labs' FLUX.2 [dev] optimized for speed and cost efficiency.
The Max series of Tongyi Qwen’s image generation model excels across a wide range of generation tasks. Compared with the Plus series, it significantly reduces the “AI-like” feel in generated images, enhancing their realism. It delivers more lifelike material textures for human subjects, finer and more detailed natural textures, and more visually appealing text rendering.