Nous Research launches Tool Search to prevent context bloat in Hermes Agent

Nous Research

Jun 1, 2026 · Updated Jun 12, 2026

Nous Research released Tool Search for its Hermes Agent to manage large toolsets more efficiently. By loading tool definitions only when needed, the agent preserves context window space for reasoning and conversation history.

Nous Research has released Tool Search for Hermes Agent to optimize tool handling. It uses a progressive-disclosure layer to replace large MCP (Model Context Protocol) schemas with three bridge tools. This prevents the context window—the data a model processes at once—from being crowded by technical definitions not immediately relevant to the task.

Bridge tools: tool_search, tool_describe, tool_call
Activation threshold: 10% of context window
Search algorithm: BM25 with literal substring fallback
Core tools: terminal, read_file, write_file, and others
Configuration modes: auto, on, off

This addresses a bottleneck where multiple MCP servers consume significant reasoning capacity. By deferring schemas, Hermes Agent mirrors modular patterns seen in Agent Skills and Appwrite. This keeps the agent efficient with large toolsets, following its recent integration of Qwen 3.7 Max.

Users can enable the feature via hermes update. It triggers automatically when tool definitions exceed 10% of the context window. While the first use of a deferred tool adds a round trip to load the schema, the system caches results to maintain speed on subsequent turns.

View the full update on hermes-agent.nousresearch.com

Nous Research

@NousResearchMay 29

Hermes Agent now has Tool Search, so your agent only loads what it needs https://t.co/CaWrKo5mxp

2643.4k

View on X

Still wondering? A few quick answers below.

It is a progressive-disclosure feature that manages how AI agents load external tool definitions. Instead of filling the context window with every available tool schema at once, the system uses three bridge tools to search for and load specific technical definitions only when the model determines they are necessary for a task.

By deferring the loading of complex JSON schemas for MCP and plugin tools, the feature preserves more of the model's context window for reasoning and conversation history. This prevents context bloat, which often degrades an agent's ability to follow instructions or maintain accuracy as the available toolset grows larger.

The feature uses an automatic mode that triggers whenever deferrable tool schemas would consume at least 10% of the active model's context window. This threshold ensures that the system only incurs the minor latency of on-demand loading when the token savings are significant enough to justify the extra round trip.

Only external MCP servers and non-core plugin tools are eligible for deferral. Core Hermes Agent capabilities, such as terminal access, file operations, memory management, and web searching, are always loaded directly into the context window to ensure the agent's fundamental skills remain immediately available without any additional latency.

Every HeadsUpAI update is written based on its original source and reviewed before it's published. Read our editorial standards →

See all AI news & updates from Nous Research →

Keep reading

Nous Research releases Hermes Agent v0.15.0 with 4500x faster session search

Nous Research launched Hermes Agent v0.15.0, a major architectural refactor that reduces core code by 76% and eliminates LLM costs for session history searches. The update shifts the open-source platform toward high-speed multi-agent orchestration with native swarm support and hardened security.

What is Hermes Agent Tool Search?

How does Tool Search improve agent performance?

When does Tool Search activate automatically?

Which tools are affected by Tool Search?

Keep reading

Nous Research releases Hermes Agent v0.15.0 with 4500x faster session search

Nous Research releases Hermes Agent v0.15.0 with 4500x faster session search

Keep reading

Nous Research releases Hermes Agent v0.15.0 with 4500x faster session search

Nous Research releases Hermes Agent v0.15.0 with 4500x faster session search