Table of Contents
Across many AI engineering teams in 2026, certain mistakes recur. Documenting them helps new teams avoid the same mistakes. Most are addressable; awareness is the first step.
Top mistakes: (1) shipping without eval harness, (2) hardcoded prompts in app code, (3) no observability, (4) no caching, (5) over-provisioning hardware, (6) ignoring residency until enterprise sale, (7) missing fallback path, (8) skipping load test, (9) untested rollback, (10) no per-feature cost attribution. Each addressable; awareness prevents repetition.
Common mistakes
- Shipping without eval harness: every change is a quality gamble; can't safely iterate
- Hardcoded prompts in app code: can't version, can't A/B, can't roll back
- No observability: invisible production behaviour; incidents become detective work
- No caching: paying full inference cost on repeat queries
- Over-provisioning hardware: defaulting to 4090 / 5090 when 5060 Ti is enough
- Ignoring residency until enterprise sale: late discovery of UK / EU residency requirement
- Missing fallback path: hosted API down → production down
- Skipping load test: capacity surprises in first week of traffic
- Untested rollback: rollback theatre — works in theory, untested in practice
- No per-feature cost attribution: features that should be premium-tier are free; unsustainable
Avoiding
- Build eval harness before shipping; gate every change
- Version prompts in repo from day one; reference by version ID
- Build observability stack day one of production
- Implement prefix + semantic caching by default
- Right-size hardware via load testing, not gut feel
- Plan for residency in design phase, not after
- Always have a fallback — cached, alternative model, hosted API
- Run soak test before launch
- Test rollback quarterly
- Tag costs by feature in logs; review monthly
Verdict
The recurring AI engineering mistakes are mostly preventable. Build eval, observability, caching, prompt versioning, fallback, residency thinking from day one. The cost is small; the value is avoiding the painful learning experience that comes from skipping these.
Bottom line
Build the boring foundations first. See deployment checklist.