RTX 3050 - Order Now
Home / Blog / Benchmarks / FLUX.1 Images per Second by GPU: Real Benchmarks Across Every Card We Host
Benchmarks

FLUX.1 Images per Second by GPU: Real Benchmarks Across Every Card We Host

Real images-per-minute throughput for FLUX.1 dev and schnell on every GPU we rent — FP16, FP8 and GGUF quantisation paths.

FLUX.1 has overtaken Stable Diffusion XL as the de-facto open image model. Sizing it for a production image-generation API requires real benchmark numbers on the actual hardware you’d rent. This page is the consolidated reference.

TL;DR

For maximum throughput per pound: RTX 5090 + FP8 at ~10 images/min on FLUX.1 dev. For absolute speed: same card. For budget: RTX 5060 Ti + GGUF Q5 at ~3.5 images/min. FLUX.1 schnell is ~5× faster than dev across the board.

Benchmark setup

  • ComfyUI on Ubuntu 22.04, NVIDIA driver 555.x
  • 1024×1024 output, no upscaling
  • FLUX.1 dev: 25 sampling steps, Euler scheduler
  • FLUX.1 schnell: 4 sampling steps
  • Single-image generation; batch 1

FLUX.1 schnell numbers

GPUPrecisionTime per 1024² imageImages per minute
RTX 3050 6 GBGGUF Q5~22 s2.7
RTX 3060 12 GBGGUF Q5~14 s4.3
RTX 4060 8 GBGGUF Q5~10 s6.0
RTX 5060 Ti 16 GBFP8~6 s10.0
RTX 5080 16 GBFP8~3 s20.0
RTX 4090 24 GBFP16~5 s12.0
RTX 5090 32 GBFP8~1.6 s37.5
RTX 6000 ProFP16~1.6 s37.5

FLUX.1 dev numbers

GPUPrecisionTime per 1024² imageImages per minute
RTX 3050 6 GBGGUF Q4~80 s0.75
RTX 3060 12 GBGGUF Q5~32 s1.9
RTX 4060 8 GBGGUF Q5~25 s2.4
RTX 5060 Ti 16 GBGGUF Q5~17 s3.5
RTX 5080 16 GBFP8~9 s6.7
RTX 4090 24 GBFP16~8 s7.5
RTX 5090 32 GBFP8~6 s10.0
RTX 6000 ProFP16~6 s10.0

Verdict

  • Cheapest credible: RTX 5060 Ti 16 GB at £119/mo, ~3.5 FLUX.1 dev images/min.
  • Best per-pound: RTX 5090 at £399/mo, ~10 FLUX.1 dev images/min.
  • Highest absolute: RTX 5090 or RTX 6000 Pro (essentially tied).
  • Fastest single image: same — Blackwell FP8 path.

Bottom line

For a FLUX.1 image API the RTX 5090 is the price/performance leader. The 5060 Ti is the budget pick. See best GPU for FLUX and can RTX 5090 run FLUX? for the deployment context.

Need a Dedicated GPU Server?

Deploy from RTX 3050 to RTX 5090. Full root access, NVMe storage, 1Gbps — UK datacenter.

Browse GPU Servers

gigagpu

We benchmark, deploy, and optimise GPU infrastructure for AI workloads. All data in our guides comes from real-world testing on our UK-based dedicated GPU servers.

Ready to deploy your AI workload?

Dedicated GPU servers from our UK datacenter. NVMe storage, 1Gbps networking, full root access.

Browse GPU Servers Contact Sales

Have a question? Need help?