Today we're announcing the private beta of Agent Memory, a managed service that extracts information from agent conversations and makes it available when it’s needed, without filling up the context window. https://t.co/tcyjIzpiHd
Cloudflare Launches Agent Memory to Give AI Agents Persistent Long Term State
Cloudflare· Updated
Cloudflare introduced Agent Memory, a managed service that extracts and stores key information from agent conversations to prevent context rot. By moving state management to a dedicated pipeline, agents can recall past decisions and facts across sessions without exhausting their context windows.
- Memory types
- Facts, Events, Instructions, Tasks
- Extraction model
- Llama 4 Scout (17B MoE)
- Synthesis model
- Nemotron 3 (120B MoE)
- Availability
- Private beta waitlist
This update addresses context rot—the degradation of model performance as context windows fill up with redundant history. It follows programmable search primitives for external files, but Agent Memory focuses on internal conversation state. It builds on stateful agent harnesses by making knowledge a durable asset.
You can integrate the service via a Cloudflare Worker binding or a REST API to manage memory profiles for individual agents or entire teams. The system uses a hybrid retrieval engine that fuses vector search with full-text search to improve accuracy. Access is limited to a private beta waitlist for developers.
Still wondering? A few quick answers below.
Every HeadsUpAI update is written based on its original source and reviewed before it's published. Read our editorial standards →



