Home / Blog / Benchmarks / RTX 5060 Ti 16GB Coqui TTS Benchmark

Benchmarks

RTX 5060 Ti 16GB Coqui TTS Benchmark

Coqui XTTS v2 and Bark-small on Blackwell 16GB - real-time factor, VRAM, batch throughput for self-hosted TTS.

Benchmarks April 23, 2026 1 min read gigagpu

Coqui XTTS v2 is the leading open TTS model for multilingual voice cloning. Numbers on the RTX 5060 Ti 16GB at our hosting:

Setup
XTTS v2 throughput
Batch
Voice cloning latency

Setup

Coqui TTS 0.22
Model: XTTS v2 (multilingual, 17 languages)
Sample rate: 24 kHz, mel 80-band
FP16 inference, CUDA 12.6

XTTS v2 Throughput (Batch 1)

Length (output audio)	Gen time	RTF
5 sec	0.85 s	0.17
10 sec	1.25 s	0.125
20 sec	2.20 s	0.110
60 sec	6.10 s	0.102

Real-time factor below 0.2 means you generate audio ~5-10x faster than it plays. Solid for interactive voice assistants.

Batch 4

Length	Total time (4 items)	Per-item
5 sec each	2.2 s	0.55 s
10 sec each	3.4 s	0.85 s

Batching 4 cuts per-item time by ~35%. VRAM peak ~6 GB.

Voice Cloning Latency

Provide a 6-second reference clip, generate new speech in cloned voice:

Speaker encoding (one-time): ~300 ms
Generation: same as unclones (RTF ~0.1)

For persistent cloned voices, cache the speaker embedding in memory to skip the 300 ms on subsequent calls.

Coqui TTS on Blackwell 16GB

RTF 0.1, voice cloning ready. UK dedicated hosting.

Order the RTX 5060 Ti 16GB

Need a Dedicated GPU Server?

Deploy from RTX 3050 to RTX 5090. Full root access, NVMe storage, 1Gbps — UK datacenter.

Browse GPU Servers

Benchmarks

gigagpu

We benchmark, deploy, and optimise GPU infrastructure for AI workloads. All data in our guides comes from real-world testing on our UK-based dedicated GPU servers.

Ready to deploy your AI workload?

Dedicated GPU servers from our UK datacenter. NVMe storage, 1Gbps networking, full root access.

Browse GPU Servers Contact Sales

RTX 5060 Ti 16GB Coqui TTS Benchmark

Contents

Setup

XTTS v2 Throughput (Batch 1)

Batch 4

Voice Cloning Latency

Coqui TTS on Blackwell 16GB

Need a Dedicated GPU Server?

gigagpu

GPU Hosting

Blog Categories

AI Model Hosting

Benchmarks & Tools

Deploy a GPU Server

Ready to deploy your AI workload?

Have a question? Need help?

RTX 5060 Ti 16GB Coqui TTS Benchmark

Contents

Setup

XTTS v2 Throughput (Batch 1)

Batch 4

Voice Cloning Latency

Coqui TTS on Blackwell 16GB

Need a Dedicated GPU Server?

gigagpu

Related Articles

RTX 4090 24 GB TFLOPS Benchmark Class: Where It Sits in the AI Hierarchy

Whisper Large-v3 RTF by GPU

DeepSeek 7B on RTX 4060: Performance Benchmark & Cost, Category: Benchmarks, Slug: deepseek-7b-on-rtx-4060-benchmark, Excerpt: DeepSeek 7B benchmarked on RTX 4060: 22.0 tok/s at 4-bit GGUF Q4_K_M, VRAM usage, cost per 1M tokens, and deployment configuration., Internal links: 9 –>

Embedding Speed: GPU vs CPU Benchmark

GPU Hosting

Blog Categories

AI Model Hosting

Benchmarks & Tools

Deploy a GPU Server

Ready to deploy your AI workload?

Have a question? Need help? Contact us

Have a question? Need help?