NVIDIA Nemotron 3 Nano Omni is now available on Amazon SageMaker JumpStart. This multimodal model supports video, audio, image, and text, enabling enterprise Q&A, summarization, transcription, OCR, and document intelligence. With @nvidia Nemotron 3 Nano Omni, organizations can streamline end-to-end processing of meetings, training videos, and documents. https://t.co/XgVkOg6B8x
AWS Launches NVIDIA Nemotron 3 Nano Omni for Unified Multimodal Agents
· Updated
- Total parameters
- 30B
- Active parameters
- 3B (MoE)
- Context window
- 131K tokens
- Input types
- Video, Audio, Image, Text
- Precision
- FP8
- License
- NVIDIA Open Model Agreement
This release addresses the "perception bottleneck" in agentic workflows. Most systems rely on separate models for transcription and vision, increasing latency. This update builds on AWS agent orchestration workflows by converging these modalities into one reasoning loop, maintaining a unified context across complex tasks like computer use or real-time video analysis.
You can deploy the model immediately through SageMaker JumpStart, following a pattern seen in OpenRouter's integration. It supports a 131K context window and tool calling, making it a viable backbone for enterprise agentic workflows. The model is licensed under the NVIDIA Open Model Agreement for commercial use.
Still wondering? A few quick answers below.





