Home / Blog / AI Hosting & Infrastructure / Version Pinning Strategy for AI Deployments: What to Pin, How Tight

AI Hosting & Infrastructure

Version Pinning Strategy for AI Deployments: What to Pin, How Tight

AI stacks have many moving versions — driver, CUDA, vLLM, model commit. Pinning the wrong layer too tight breaks security; too loose breaks reproducibility.

AI Hosting & Infrastructure May 5, 2026 1 min read gigagpu

Table of Contents

One unattended-upgrades incident at 3 AM is enough to motivate version pinning. The question is what to pin.

TL;DR

Pin tight: NVIDIA driver, CUDA, vLLM, model commit SHA. Pin loose: OS minor versions, language libraries. Update on a maintenance window with eval harness validation.

Versioning layers

OS: Ubuntu 22.04 LTS — pin to LTS, allow security updates
NVIDIA driver: pinned to exact version (e.g., 555.42)
CUDA toolkit: pinned to exact version (e.g., 12.4)
cuDNN / NCCL: pinned
Python: pinned to minor (e.g., 3.10.x)
vLLM: pinned to exact (0.6.3)
Model: pinned to commit SHA, never tag
LiteLLM, TEI, Qdrant: pinned to exact

Pinning strategy

Use apt-mark hold for system packages. Use requirements.txt with exact versions for Python. Pin model with explicit revision: --revision sha256....

Verdict

Version pinning is boring infrastructure that pays back the first time something breaks. Always pin the GPU stack tight.

Bottom line

Pin everything in the GPU stack. Update on maintenance windows. See driver setup.

Need a Dedicated GPU Server?

Deploy from RTX 3050 to RTX 5090. Full root access, NVMe storage, 1Gbps — UK datacenter.

Browse GPU Servers

AI Hosting & Infrastructure

gigagpu

We benchmark, deploy, and optimise GPU infrastructure for AI workloads. All data in our guides comes from real-world testing on our UK-based dedicated GPU servers.

Ready to deploy your AI workload?

Dedicated GPU servers from our UK datacenter. NVMe storage, 1Gbps networking, full root access.

Browse GPU Servers Contact Sales

Version Pinning Strategy for AI Deployments: What to Pin, How Tight

Versioning layers

Pinning strategy

Verdict

Bottom line

Need a Dedicated GPU Server?

gigagpu

GPU Hosting

Blog Categories

AI Model Hosting

Benchmarks & Tools

Deploy a GPU Server

Ready to deploy your AI workload?

Have a question? Need help?

Version Pinning Strategy for AI Deployments: What to Pin, How Tight

Versioning layers

Pinning strategy

Verdict

Bottom line

Need a Dedicated GPU Server?

gigagpu

Related Articles

VPN Setup for Remote AI Inference Access

SSL/TLS for AI APIs: Let’s Encrypt + Nginx

AI Platform Build vs Buy in 2026

Open-Source LLM Licensing in 2026: A Practical Comparison

GPU Hosting

Blog Categories

AI Model Hosting

Benchmarks & Tools

Deploy a GPU Server

Ready to deploy your AI workload?

Have a question? Need help? Contact us

Have a question? Need help?