RTX 3050 - Order Now
Home / Blog / Benchmarks
Benchmarks

Benchmarks

Real performance data, not marketing claims. Our benchmarks test every GPU we offer across LLM inference, image generation, OCR, and TTS workloads on dedicated GPU servers. See our tokens/sec benchmark for the latest results.

Benchmarks May 2026

RTX 4090 24GB Concurrent Chat Users: Capacity Benchmark

Deep concurrent-user benchmark on the RTX 4090 24GB across Llama 3 8B FP8, Mistral 7B, Qwen 14B/32B AWQ, Phi-3 mini…

Benchmarks May 2026

RTX 4090 24GB FLUX.1-schnell Benchmark

Full FLUX.1-schnell benchmark on the RTX 4090 24GB - 1.8s per 1024px image at FP8, batch throughput, FP16 vs FP8…

Benchmarks May 2026

RTX 4090 24GB Qwen 2.5 14B Benchmark: Full Quant Sweep with AWQ, FP8 and GGUF

Comprehensive RTX 4090 24GB benchmark for Qwen 2.5 14B - 135 t/s AWQ INT4, 110 t/s FP8 at batch 1,…

Benchmarks May 2026

RTX 4090 24GB Qwen 2.5 32B Benchmark: AWQ INT4 at the VRAM Ceiling

Comprehensive RTX 4090 24GB benchmark for Qwen 2.5 32B - AWQ INT4 fits at 18GB, decodes 65 t/s, sustains 4…

Benchmarks May 2026

RTX 4090 24GB Whisper Benchmarks: large-v3, Turbo, WhisperX

Deep real-time-factor measurements for Whisper large-v3, large-v3-turbo and medium on the RTX 4090 24GB, including batched WhisperX throughput, WhisperX alignment,…

Benchmarks May 2026

RTX 4090 24GB Fine-Tuning Throughput: LoRA, QLoRA, Unsloth

LoRA, QLoRA and Unsloth fine-tuning throughput on the RTX 4090 24GB across Llama 3 8B, Mistral 7B, Qwen 14B, Qwen…

Benchmarks May 2026

RTX 4090 24GB FLUX.1-dev Benchmark: 30-step in 4 Seconds FP8

FLUX.1-dev FP16 just fits a single RTX 4090 24GB at 22GB peak with 30-step renders in 6.2s; FP8 drops to…

Benchmarks May 2026

RTX 4090 24GB SDXL Benchmark: 1024×1024 in 2.0s

SDXL 1.0 at 1024x1024 on the RTX 4090 24GB renders a 30-step image in 2.0 seconds, batch of four in…

Benchmarks May 2026

RTX 4090 24GB Stable Video Diffusion Benchmark: 25-frame in 25s

Deep Stable Video Diffusion benchmark on the RTX 4090 24GB: 25-frame SVD-XT in 25s FP16 / 18s FP8, full VRAM…

1 3 4 5 6 7 29

Ready to deploy your AI workload?

Dedicated GPU servers from our UK datacenter. NVMe storage, 1Gbps networking, full root access.

Browse GPU Servers Contact Sales

Have a question? Need help?