New Science Blog: Why has AI advanced faster in coding than in biology? To agents, bio databases are like cities built before cars—maddening to drive in because they're designed for different traffic. How do we build infrastructure agents can use? https://t.co/PQaNQ4GRJZ
Anthropic: Agent-Friendly Infrastructure Crucial for AI in Biology
AnthropicAnthropic published a new Science Blog post detailing why AI agents have advanced faster in coding than in biology. The research highlights that biological data infrastructure is often not designed for agents, leading to unreliable performance in scientific tasks. Building deterministic retrieval layers is crucial for agents to navigate scientific data effectively.
- Benchmark
- VirBench
- Agent performance (without gget virus)
- 16.9% to 91.3% mean accuracy
- Agent performance (with gget virus)
- >90% for all agents, peaking at 99.7% for GPT-5.5
- Deterministic retrieval layer
- gget virus
- Models tested
- Claude Sonnet 4, Claude Opus 4.7, Biomni OSS, Edison Analysis, GPT-5.2-pro, GPT-5.5
Even frontier models like Claude and GPT struggled to retrieve viral sequence data from NCBI Virus, achieving accuracies as low as 16.9% with high variability. Small errors in biological data retrieval can have severe consequences, invalidating downstream analyses. The bottleneck is not just agent reasoning, but the absence of dependable execution layers.
Adding a deterministic retrieval layer, such as gget virus developed with NCBI, dramatically improved agent accuracy to nearly 100% and eliminated variability. This suggests making biological data infrastructure agent-friendly, with reliable access paths, is more critical for scientific agents than relying on model power. This research is part of Anthropic's Science Blog efforts.
Still wondering? A few quick answers below.
Every HeadsUpAI update is written based on its original source and reviewed before it's published. Read our editorial standards →


