NVIDIA Releases VSS Agent Skills to Automate Industrial Video Analytics

NVIDIA

May 30, 2026 · Updated Jun 12, 2026

NVIDIA released agent skills and a modular architecture for its Metropolis Blueprint for Video Search and Summarization (VSS), allowing coding agents to autonomously deploy video analytics stacks. This shift replaces manual microservice configuration with a chat-based interface for searching hours of footage using natural language.

NVIDIA updated its Metropolis Blueprint for Video Search and Summarization (VSS) with a modular architecture and new agent skills. These skills follow the agentskills.io specification, allowing coding agents to self-install the stack. This follows the Metropolis VSS 3 Blueprint launch by removing the manual microservice configuration previously required.

Max concurrent streams (H100): 33
Max concurrent streams (RTX PRO 6000): 51
Ingestion latency (H100): 0.079 seconds
Ingestion latency (RTX PRO 6000): 0.101 seconds
Retrieval latency (H100): 2.24 seconds

A new profile system lets developers layer workflows like real-time alerts onto a base agent. This is powered by fusion search, which decomposes complex natural language queries into sub-queries. By searching across multiple embedding types, the system improves precision when finding specific events in massive video archives.

You can deploy the VSS Search profile via chat prompts to agents like Codex. The system supports H100 and RTX PRO 6000 GPUs, with ingestion latencies under 0.1 seconds. These capabilities join the NVIDIA AI-Q Agent Skill's new feature to expand the portable tools available to autonomous coding agents.

View the full update on developer.nvidia.com

NVIDIA AI

@NVIDIAAIMay 29

Hours of video, now searchable by your agent. We just released a new set of agent skills and modular architecture for the Metropolis Blueprint for Video Search and Summarization, eliminating the need for manual configuration of multiple microservices. Load the skills into a compatible coding agent and it deploys the stack, turning hours of footage into searchable, actionable intelligence through a chat interface. Ask in plain language and get back clips, summaries, and answers.

78678

View on X

Still wondering? A few quick answers below.

VSS agent skills are portable capabilities for the NVIDIA Metropolis Blueprint for Video Search and Summarization that follow the agentskills.io specification. These skills allow autonomous coding agents like Codex or OpenClaw to understand how to deploy, configure, and operate the VSS microservice stack through a simple natural language chat interface instead of manual setup.

The system uses a modular architecture and vision-language models to perform agentic search. It decomposes complex natural language queries into sub-queries and uses a fusion search capability to scan multiple embedding types. This process allows the AI to locate specific objects, actions, or safety events across massive volumes of live or recorded video data.

NVIDIA VSS is supported on several hardware configurations to meet different performance needs. Benchmarks provided by NVIDIA show the agentic search and alert verification workflows running on H100 and RTX PRO 6000 GPUs. The system is also compatible with DGX Spark and AGX Thor setups for specific tasks like alert verification and video summarization.

On a single H100 GPU, the agentic search workflow can handle up to 33 concurrent input streams with an ingestion latency of 0.079 seconds. When a user performs a search, the retrieval latency to receive a result is approximately 2.24 seconds. These metrics vary depending on the specific developer profile and hardware topology used.

Developers can access VSS skills through the NVIDIA VSS GitHub repository. To use them, you need a system prepared to run VSS, such as an NVIDIA Brev Launchable instance, and a compatible coding agent like Codex or Claude Code. Once the skills are loaded, the agent can autonomously manage the deployment of containers and environment variables.

Every HeadsUpAI update is written based on its original source and reviewed before it's published. Read our editorial standards →

See all AI news & updates from NVIDIA →

Keep reading

NVIDIA Open Sources Physical AI Agent Skills for Robotics and Manufacturing

NVIDIA released a collection of open-source tools and skills on GitHub that allow AI agents to orchestrate complex physical-world workflows. By making libraries like Omniverse and Isaac agent-callable, the release enables coding agents to autonomously handle data generation, simulation, and deployment for robots and autonomous vehicles.

RemotionJan 21

Remotion Releases Agent Skills for Video Generation in Coding Agents

Remotion released agent skills that let coding agents generate videos through natural language prompts. Describe what you want in Claude Code, Cursor, or Codex, and the agent creates React-based animations and exports MP4 files - no Remotion API knowledge needed.

Weaviate Agent Skills Teaches Coding Agents Its Vector Database

Weaviate AI DatabaseMar 2

Weaviate Agent Skills Teaches Coding Agents Its Vector Database

Weaviate released agent skills that teaches coding agents how to correctly work with its vector database, covering search, schema management, and full RAG application patterns. Agents often hallucinate Weaviate syntax - this gives them accurate procedural knowledge.

What are NVIDIA VSS agent skills?

How does NVIDIA VSS search hours of video footage?

Which GPUs are compatible with NVIDIA VSS 3?

What is the performance of NVIDIA VSS on an H100 GPU?

How do developers get started with NVIDIA VSS skills?

Keep reading

NVIDIA Open Sources Physical AI Agent Skills for Robotics and Manufacturing

NVIDIA Open Sources Physical AI Agent Skills for Robotics and Manufacturing

Remotion Releases Agent Skills for Video Generation in Coding Agents

Remotion Releases Agent Skills for Video Generation in Coding Agents

Weaviate Agent Skills Teaches Coding Agents Its Vector Database

Weaviate Agent Skills Teaches Coding Agents Its Vector Database

Keep reading

NVIDIA Open Sources Physical AI Agent Skills for Robotics and Manufacturing

NVIDIA Open Sources Physical AI Agent Skills for Robotics and Manufacturing

Remotion Releases Agent Skills for Video Generation in Coding Agents

Remotion Releases Agent Skills for Video Generation in Coding Agents

Weaviate Agent Skills Teaches Coding Agents Its Vector Database

Weaviate Agent Skills Teaches Coding Agents Its Vector Database