Anthropic Restricts Claude Mythos Preview Access Over Autonomous Cyberattack Risks

Anthropic

Apr 7, 2026 · Updated Apr 25, 2026

Anthropic announced Claude Mythos Preview, a new frontier model with near-superhuman coding and reasoning capabilities that can autonomously find and exploit software vulnerabilities. Because the model poses a significant dual-use risk, it will not be released to the public and will instead power a defensive cybersecurity coalition called Project Glasswing.

Anthropic announced Claude Mythos Preview, a new frontier model that represents a massive leap in agentic coding and reasoning. It achieved 77.8% on SWE-bench Pro and 97.6% on the USAMO 2026 math competition, identifying thousands of high-severity vulnerabilities in major operating systems and browsers.

This marks a shift where AI capabilities in cybersecurity have outpaced existing defensive measures. The model can autonomously discover zero-day vulnerabilities (previously unknown software flaws) that have survived decades of human review. Consequently, the window between a vulnerability being found and exploited has collapsed, making general availability too risky.

You cannot access Claude Mythos Preview via the public API; it is restricted to Project Glasswing partners like AWS, Google, and Microsoft for defensive hardening. Anthropic is providing $100M in credits for the effort. Its system card also reveals safety risks like "unverbalized grader awareness," where models reason internally about being tested.

View the full update on anthropic.com

Anthropic

@AnthropicAIApr 7

Introducing Project Glasswing: an urgent initiative to help secure the world’s most critical software. It’s powered by our newest frontier model, Claude Mythos Preview, which can find software vulnerabilities better than all but the most skilled humans. https://t.co/NQ7IfEtYk7

1.9k13k

View on X

Every HeadsUpAI update is written based on its original source and reviewed before it's published. Read our editorial standards →

See all AI news & updates from Anthropic →

Keep reading

Anthropic Expands Project Glasswing to Secure Global Critical Infrastructure

Anthropic is granting approximately 150 new organizations access to its restricted Claude Mythos Preview model to identify software vulnerabilities. The expansion targets critical infrastructure sectors like power and water to build defensive norms before similar AI capabilities become widely available.

ClaudeMay 1

Anthropic Launches Claude Security Beta to Automatically Scan and Patch Codebases

Anthropic launched Claude Security in public beta for Enterprise customers to identify and remediate vulnerabilities across entire codebases. Unlike traditional scanners that rely on pattern matching, the tool uses reasoning to trace data flows and validate findings through an adversarial pass. This shift reduces false positive fatigue by ensuring every reported issue includes a verified, human-reviewable patch.

Cloudflare Tests Anthropic Mythos and Warns Reactive Patching Is Obsolete

CloudflareMay 18

Cloudflare Tests Anthropic Mythos and Warns Reactive Patching Is Obsolete

Cloudflare evaluated Anthropic's Mythos Preview model against 50 internal repositories, finding it can autonomously chain minor bugs into severe exploits and generate working proofs of concept. The results suggest that AI-driven offense is outpacing traditional patching cycles, requiring a shift toward architectural defenses that block vulnerabilities at the network edge.

Anthropic Releases Claude Fable 5, Tops Agentic Work Benchmark with Safeguards

Artificial AnalysisJun 10

Anthropic Releases Claude Fable 5, Tops Agentic Work Benchmark with Safeguards

Anthropic has released Claude Fable 5, its first publicly available Mythos-class model, which ranks #1 on Artificial Analysis's GDPval-AA benchmark. This model includes new security guardrails for high-risk domains and a fallback mechanism to Claude Opus 4.8, setting a new standard for capable and responsibly scaled AI.