I promise this will be the best 20 min you spend today! Robotics: Endgame, the sequel to my last year's Sequoia AI Ascent talk, "Physical Turing Test". I laid out the roadmap for solving Physical AGI as a simple parallel to the LLM success story. Be a good scientist, copy homework ;) And stay till the end, more easter eggs and predictions for your polymarket! 00:30 DGX-1 origin story at OpenAI, I was there in 2016 signing with Jensen and Elon. Heading to the Computer History Museum! 01:42 The Great Parallel 03:31 Robotics, the Endgame 03:39 Why VLAs fall short 04:32 Video world models as the 2nd pretraining paradigm 06:09 World Action Models (WAM) 07:46 Strategies for robot data collection and the FSD equivalent to physical data flywheel for robot manipulation 11:06 EgoScale and the Dexterity Scaling Law we discovered recently 14:00 Physical RL: bridging the last mile 15:39 DreamDojo: an end-to-end neural physics engine for scaling RL in silico 17:00 Civilizational Technology Tree and my predictions for the near future. Spoiler: it's closer than you think. Thanks to my friends at Sequoia for inviting me back to AI Ascent this year! I had a blast! Last year's talk is attached in the thread if you missed it.
NVIDIA Outlines Technical Roadmap for Scaling Robot Dexterity and Physical AGI
· Updated
- Talk title
- Robotics: Endgame
- Speaker
- Jim Fan (NVIDIA)
- Venue
- Sequoia AI Ascent
- Length
- ~20 minutes
- Key concepts introduced
- World Action Models, EgoScale, Dexterity Scaling Law, DreamDojo, Physical RL
The roadmap walks through several pieces in order: why current Vision-Language-Action models (VLAs) fall short, video world models as a second pretraining paradigm, World Action Models (WAM), strategies for robot data collection and an FSD-style physical data flywheel for robot manipulation, EgoScale and a newly discovered Dexterity Scaling Law, Physical RL as the last-mile step, and DreamDojo — an end-to-end neural physics engine for scaling reinforcement learning in silico.
You can watch the full talk on YouTube via the link in Fan's announcement tweet. The chapter markers in the tweet map directly to the segments above, so viewers can jump straight to a specific argument — DreamDojo at 15:39, World Action Models at 06:09, or the Civilizational Technology Tree predictions near the end of the talk.
Still wondering? A few quick answers below.


