From unboxing to AI agent in minutes. Getting an agent running used to mean sourcing a model, configuring an inference backend, installing a runtime, and wiring it all together. The new NemoClaw install path on DGX Spark replaces that with a single command. DGX Spark also simplifies the path to local, long-running AI agents, cutting out external cloud dependencies and providing predictable on-premise compute.
NVIDIA simplifies local AI agent setup on DGX Spark with NemoClaw
nemoclaw.sh command now automates the setup of the OpenShell secure runtime, the OpenClaw agent harness, and local inference backends. This update targets the growing demand for autonomous agents that operate without external cloud dependencies.- Installer
- nemoclaw.sh
- Performance
- 2.6x throughput for Qwen3.6-35B
- Max Cluster Memory
- 512GB across 4 nodes
- Networking
- 200 Gbps RoCE via ConnectX-7
- Default Model
- Qwen3.6-35B
Performance optimizations for Qwen3.6-35B using NVFP4 quantization deliver a 2.6x throughput increase on DGX Spark. This allows agents to manage large context windows (the amount of data a model can process at once) more efficiently on local hardware. By shifting workloads to on-premise compute, teams can eliminate per-token API costs and keep sensitive data within their own infrastructure.
For scaling beyond a single device, the NVIDIA Sync cluster assistant now automates multi-node networking via ConnectX-7. This tool enables users to link up to four DGX Spark units into a high-bandwidth cluster with 512GB of unified memory. This configuration supports massive models and multi-agent pipelines that require significant distributed memory for local execution.
Still wondering? A few quick answers below.
Every HeadsUpAI update is written based on its original source and reviewed before it's published. Read our editorial standards →





