Prompt cache diagnostics are now in Claude Console. When a request misses the cache, you can now see exactly which part of your prompt changed and how many tokens it cost you. https://t.co/z0dV6zzLPm
Anthropic Launches Cache Diagnostics to Debug Silent Claude API Cost Spikes
Anthropic· Updated
Anthropic introduced Cache Diagnostics, a beta feature that identifies exactly why a prompt failed to hit the cache. By comparing consecutive requests, developers can now pinpoint silent cache breakers like reordered tools or dynamic timestamps that inflate API expenses.
- Feature state
- Beta
- Required header
- cache-diagnosis-2026-04-07
- Diagnostic types
- model_changed, system_changed, tools_changed, and messages_changed
- Availability
- Claude API only
- Data retention
- Zero Data Retention eligible
This update addresses a friction point in AI economics where minor, non-deterministic changes cause 10x cost spikes. While the Claude Prompt Caching Dashboard tracks hit rates, diagnostics explain the cause. It follows recent optimizations like Claude's prompt cache pre-warming, completing the toolset needed for reliable, high-context agentic workflows.
To use the feature, include the cache-diagnosis-2026-04-07 beta header and pass the previous response id. The API returns a cache_miss_reason and an estimate of missed tokens, helping you fix root causes like reordered tool arrays. It is currently available on the Claude API but not on Amazon Bedrock or Vertex AI.
Still wondering? A few quick answers below.
Every HeadsUpAI update is written based on its original source and reviewed before it's published. Read our editorial standards →


