Benchmarks should reflect real-world performance. That’s why we’re excited to share that Nemotron 3 Super has topped the open source category on the EnterpriseOps-Gym leaderboard. This agentic gauntlet evaluates performance across 1,150 tasks in fully interactive environments with 512 functional tools, requiring agents to coordinate across multiple enterprise systems and tools to complete a single workflow. 📊 https://t.co/wt54NRNgeK
NVIDIA Nemotron 3 Super Tops Open Source Leaderboard for Enterprise Agents
NVIDIA· Updated
NVIDIA's Nemotron 3 Super model claimed the #1 spot in the open-source category on the EnterpriseOps-Gym leaderboard, a benchmark for autonomous agents. The result validates the model's ability to coordinate across hundreds of tools and enterprise systems to complete complex, multi-step workflows.
- Tasks evaluated
- 1,150
- Functional tools
- 512
- Total parameters
- 120 billion
- Active parameters
- 12 billion
- Availability
- Open weights
The shift to autonomous agents requires models that handle long-horizon reasoning and tool-use. By outperforming other open-weight models, Nemotron 3 Super proves it can manage the "plumbing" of enterprise work. It extends the multimodal capabilities of Nemotron 3 Nano Omni and adds to the OpenShell security runtime.
Use Nemotron 3 Super as a high-capacity engine for production-grade enterprise automation, following the release of NVIDIA's supply chain agentic workflow. The model is a 120-billion-parameter Mixture-of-Experts designed for high-throughput inference. It is available as an open-weight model for local deployment or cloud-based agentic sessions.
Still wondering? A few quick answers below.
Every HeadsUpAI update is written based on its original source and reviewed before it's published. Read our editorial standards →





