Caching is critical for customers to lower both costs and TTFT. We’re launching a new dashboard in Claude Developer Console to increase visibility and help customers optimize their usage. Check it out here: https://t.co/zgBJ4dHXyI https://t.co/Uwje2iPbLT
Anthropic Launches Claude Prompt Caching Dashboard to Optimize API Costs
· Updated
- Cost reduction
- Up to 90% discount on cached tokens
- Performance gain
- Reduced Time to First Token
- Primary metric
- Cache hit rate visibility
- Access location
- Claude Developer Console
- Availability
- All Claude API users
This update mirrors a trend seen in Google's AI Studio usage dashboards as teams move from prototypes to production. It follows a pattern seen in Anthropic's framework for scaling agents, which prioritizes context efficiency for cloud-based systems. Managing repeated prompts is now the primary lever for controlling costs, matching Google's cost-optimized inference tiers.
Access the new usage metrics immediately through the console under the usage tab. The dashboard helps identify specific prompts that are failing to hit the cache, which is critical for reducing Time to First Token (the delay before the model starts responding). This visibility is available for all Claude API users.
Still wondering? A few quick answers below.

