RTX 3050 - Order Now
Home / Blog / Benchmarks / Flux.1 Images/sec by GPU
Benchmarks

Flux.1 Images/sec by GPU

Benchmark results for Flux.1 image generation speed across six GPUs, with images per second data and cost-efficiency analysis for dedicated GPU hosting.

Flux.1 Benchmark Overview

Flux.1 by Black Forest Labs has emerged as one of the leading open image generation models, delivering exceptional prompt adherence and visual quality. Running Flux.1 inference on a dedicated GPU server requires a card with at least 12 GB of VRAM, as the model is significantly larger than Stable Diffusion. We benchmark images per second across six GPUs to help you choose the right hardware.

All tests were run on GigaGPU servers at 1024×1024 resolution with 20 sampling steps using the default Euler scheduler. Flux.1 Dev was used for all measurements. For comparisons with other image models, see our SD 1.5 vs SDXL speed benchmark.

Images/sec Results by GPU

GPUVRAMFlux.1 Dev 1024×1024 (images/sec)Notes
RTX 30506 GBN/AInsufficient VRAM
RTX 40608 GBN/AInsufficient VRAM
RTX 4060 Ti16 GB0.12 img/s~8.3s per image
RTX 309024 GB0.19 img/s~5.3s per image
RTX 508016 GB0.28 img/s~3.6s per image
RTX 509032 GB0.42 img/s~2.4s per image

Flux.1 is considerably more compute-intensive than SDXL. The RTX 5090 at 0.42 images/sec (~2.4 seconds per image) is the only GPU tested that approaches real-time generation speeds. The RTX 3090 at 5.3 seconds per image is still practical for batch generation.

Resolution Impact on Speed

Higher resolutions dramatically reduce throughput. Below we compare 512×512, 1024×1024, and 1536×1536 on the RTX 5090.

ResolutionRTX 3090 (img/s)RTX 5090 (img/s)
512×5120.521.15
1024×10240.190.42
1536×15360.080.18

At 512×512 the RTX 5090 breaks one image per second, while 1536×1536 drops to roughly one every 5.5 seconds. Choose your resolution target carefully based on your application needs.

Cost Efficiency Analysis

GPU1024×1024 img/sApprox. Monthly Costimg/s per Pound
RTX 4060 Ti0.12~£750.0016
RTX 30900.19~£1100.0017
RTX 50800.28~£1600.0018
RTX 50900.42~£2500.0017

The RTX 5080 offers the best cost efficiency for Flux.1. For the best GPU for Flux, it represents an excellent balance of speed and price.

GPU Recommendations

  • Budget: RTX 3090 — 5.3s per image at 1024×1024 is workable for batch generation pipelines.
  • Best value: RTX 5080 — highest images per pound with sub-4-second generation.
  • Best speed: RTX 5090 — 2.4s per image for near-real-time generation APIs.

Compare Flux.1 with other image generation models in our SDXL Turbo benchmark or the SD 1.5 vs SDXL comparison. Browse all results in the Benchmarks category.

Conclusion

Flux.1 delivers superior image quality but at the cost of higher compute requirements compared to Stable Diffusion models. GPUs with 16 GB or more VRAM can run it, with the RTX 5080 and RTX 5090 providing the best experience. For high-volume image generation, consider batching requests or using lower resolutions to maximise throughput.

Generate Images with Flux.1 on Dedicated GPUs

High-VRAM GPU servers optimised for image generation workloads with fast NVMe and full root access.

Browse GPU Servers

Need a Dedicated GPU Server?

Deploy from RTX 3050 to RTX 5090. Full root access, NVMe storage, 1Gbps — UK datacenter.

Browse GPU Servers

admin

We benchmark, deploy, and optimise GPU infrastructure for AI workloads. All data in our guides comes from real-world testing on our UK-based dedicated GPU servers.

Ready to deploy your AI workload?

Dedicated GPU servers from our UK datacenter. NVMe storage, 1Gbps networking, full root access.

Browse GPU Servers Contact Sales

Have a question? Need help?