Home / Blog / Benchmarks / RTX 5060 Ti 16GB vs RTX 5080 Benchmark

Benchmarks

RTX 5060 Ti 16GB vs RTX 5080 Benchmark

Blackwell 16GB vs Blackwell 5080 16GB - same VRAM, same FP8, very different bandwidth and power.

Benchmarks April 23, 2026 1 min read gigagpu

The RTX 5080 and RTX 5060 Ti 16GB both have 16 GB VRAM and Blackwell FP8 tensor cores. Both available on our hosting. Comparison:

Specs
LLM decode
Aggregate throughput
Per-watt
Verdict

Specs

Spec	5060 Ti 16GB	5080 16GB
CUDA cores	4,608	10,752
VRAM	16 GB GDDR7	16 GB GDDR7
Bandwidth	448 GB/s	960 GB/s
TDP	180 W	360 W
PCIe	Gen 5 x8	Gen 5 x16
Price (UK hosting)	Lower tier	Higher tier

LLM Decode (Llama 3.1 8B FP8)

Batch	5060 Ti t/s	5080 t/s	Ratio
1	112	185	1.65x
8	510	810	1.59x
32	720	1,150	1.60x

5080 is ~60% faster at equivalent batch, tracking roughly with bandwidth delta.

Aggregate Concurrency Ceiling

5060 Ti: ~720 t/s aggregate at ~35 concurrent Llama 3 8B chats
5080: ~1,150 t/s aggregate at ~55 concurrent Llama 3 8B chats

Per-Watt Efficiency

Metric	5060 Ti	5080
Peak t/s	720	1,150
Draw at peak	155 W	305 W
tokens/Joule	4.6	3.8

5060 Ti wins on tokens/watt. 5080 wins on per-dollar-per-second if your workload needs the concurrency.

Verdict

5060 Ti: best when you run multiple separate workloads, value hosting, efficiency matters
5080: best when one workload needs the higher single-card throughput, more concurrent users, or slightly larger models at tighter quantisation

Both have identical VRAM so they serve the same model catalogue. The upgrade is purely about throughput.

Blackwell Value vs Blackwell Performance

Both available. UK dedicated hosting.

Order the RTX 5060 Ti 16GB

Need a Dedicated GPU Server?

Deploy from RTX 3050 to RTX 5090. Full root access, NVMe storage, 1Gbps — UK datacenter.

Browse GPU Servers

Benchmarks

gigagpu

We benchmark, deploy, and optimise GPU infrastructure for AI workloads. All data in our guides comes from real-world testing on our UK-based dedicated GPU servers.

Ready to deploy your AI workload?

Dedicated GPU servers from our UK datacenter. NVMe storage, 1Gbps networking, full root access.

Browse GPU Servers Contact Sales

RTX 5060 Ti 16GB vs RTX 5080 Benchmark

Contents

Specs

LLM Decode (Llama 3.1 8B FP8)

Aggregate Concurrency Ceiling

Per-Watt Efficiency

Verdict

Blackwell Value vs Blackwell Performance

Need a Dedicated GPU Server?

gigagpu

GPU Hosting

Blog Categories

AI Model Hosting

Benchmarks & Tools

Deploy a GPU Server

Ready to deploy your AI workload?

Have a question? Need help?

RTX 5060 Ti 16GB vs RTX 5080 Benchmark

Contents

Specs

LLM Decode (Llama 3.1 8B FP8)

Aggregate Concurrency Ceiling

Per-Watt Efficiency

Verdict

Blackwell Value vs Blackwell Performance

Need a Dedicated GPU Server?

gigagpu

Related Articles

Phi-3 Mini Tokens/sec by GPU

RTX 5060 Ti 16GB TFLOPS for AI Workloads

RTX 4090 24GB Stable Video Diffusion Benchmark: 25-frame in 25s

Mistral 7B and Mistral Small 22B Benchmarks Across Every GPU We Host

GPU Hosting

Blog Categories

AI Model Hosting

Benchmarks & Tools

Deploy a GPU Server

Ready to deploy your AI workload?

Have a question? Need help? Contact us

Have a question? Need help?