How do we automate business analytics with Claude? New blog post covering our best practices for skills, data foundations, and evaluations when building agents to perform data analysis: https://t.co/mfEJMAQFBU
Anthropic automates 95 percent of internal analytics using new Claude agent stack
- Internal Automation Rate
- 95%
- Aggregate Accuracy
- ~95%
- Accuracy Gain (Adversarial Review)
- 6%
- Token Cost Increase (Adversarial Review)
- 32%
- Latency Increase (Adversarial Review)
- 72%
This release builds on Anthropic's evaluation framework, shifting focus to domain-specific reliability. While tools like Claude Code excel at creative software tasks, business data often suffers from staleness. By forcing agents to use a semantic layer (a central source of metric definitions), organizations can reach the 95% accuracy levels Anthropic reports internally.
Teams can implement this stack by adopting the provided Skill File Skeleton to define procedural knowledge. This includes setting up adversarial review sub-agents to challenge query assumptions and using MCP (a universal connector for AI tools) to connect agents to governed warehouses. The framework is available now for organizations deploying Claude in Claude Cowork.
Still wondering? A few quick answers below.
Every HeadsUpAI update is written based on its original source and reviewed before it's published. Read our editorial standards →



