FP8 quantized variant of Black Forest Labs' FLUX.1 [schnell] model, offering ~2x faster inference with reduced precision while maintaining high-quality image generation in 4 steps
Providers & Pricing (1)
FLUX.1 [schnell] FP8 is free to use through Fireworks AI.
Fireworks AI
fireworks/flux.1-schnell-fp8
Provider Model ID:
accounts/fireworks/models/flux-1-schnell-fp8/text_to_image
Output
Image
Free
per image
Pricing Notes (4)
- • Free to try
- • Normally priced at $0.00035 per inference step
- • FLUX.1 [schnell] uses 4 steps by default, making the effective per-image cost $0.0014
- • FP8 variant uses reduced precision for ~2x faster inference
Provider Performance
Fastest generation through fireworks at 1,769ms median latency with 96.0% success rate.
Aggregated from real API requests over the last 30 days.
Generation Time
fireworks
1,769ms
p95: 11,146ms
Success Rate
fireworks
96.0%
3,106 / 3,236 requests
Time to First Byte
fireworks
962ms
p95: 4,736ms
Provider Rankings
| # | Provider | p50 Gen Time | p95 Gen Time | Success Rate | TTFB (p50) |
|---|---|---|---|---|---|
| 1 | fireworks | 1,769ms | 11,146ms | 96.0% | 962ms |
Data updated every 15 minutes. Based on all API requests through Lumenfall over the last 30 days.