NVIDIA Launches Physical AI Data Factory Blueprint for Robotics Training Data

NVIDIANVIDIA

· Updated

NVIDIA released the Physical AI Data Factory Blueprint at GTC, an open reference architecture that automates synthetic training data generation for robotics, autonomous vehicles, and vision AI agents. Cosmos foundation models expand limited datasets into diverse, edge-case-rich training sets.

NVIDIA released the Physical AI Data Factory Blueprint, an open reference architecture for generating synthetic training data for robotics, autonomous vehicles, and vision AI. The pipeline runs three Cosmos models in sequence: Cosmos Curator annotates real-world datasets, Cosmos Transfer expands them with synthetic edge-case variations, and Cosmos Evaluator scores output for physical accuracy.

Microsoft Azure and Nebius are integrating the blueprint into their cloud infrastructure, while FieldAI, Hexagon Robotics, Skild AI, Uber, and Teradyne Robotics are applying it across robotics, autonomous driving, and vision AI. OSMO, NVIDIA's open-source orchestration framework, manages these workflows and integrates with coding agents like Claude Code and Cursor.

The blueprint is built for teams where real-world data is too sparse or not diverse enough to train reliable physical AI models. With OSMO providing orchestration and Azure and Nebius integrations already shipping, you can connect it to your existing cloud training stack.

NVIDIA Newsroom
NVIDIA Newsroom
@nvidianewsroom
X

#NVIDIAGTC news: NVIDIA announces the new NVIDIA Physical AI Data Factory Blueprint turns accelerated compute into high-quality training data for robotics, vision AI agents, and autonomous vehicles. ➡️ https://t.co/vyKQijztgU https://t.co/IUUiRWS15g

9retweets
View on X

Every HeadsUpAI update is written based on its original source and reviewed before it's published. Read our editorial standards →

Share this update