HeadsUpAI

Google Launches Gemini Robotics-ER 1.6 With Advanced Spatial Reasoning and Instrument Reading

· Updated

Google launched Gemini Robotics-ER 1.6, a major upgrade to its embodied reasoning model. This version introduces instrument reading, allowing robots to interpret analog pressure gauges, digital readouts, and liquid levels. It also features agentic vision, which enables the model to zoom into images or execute code to calculate precise measurements.

This update bridges the gap between digital intelligence and physical autonomy in dynamic settings. By improving multi-view reasoning, robots can now synthesize data from multiple cameras—like overhead and wrist-mounted feeds—to detect task success even when objects are partially hidden. This shift moves robotics from simple instruction-following to independent physical problem-solving.

You can access gemini-robotics-er-1.6-preview today through the Gemini API and Google AI Studio. The release includes a developer Colab notebook with examples for configuring spatial prompts and success detection. For industrial users, the model is already being integrated into Boston Dynamics robots for autonomous facility inspections.

Google DeepMind
Google DeepMind
@GoogleDeepMind
X

We’re rolling out an upgrade designed to help robots reason about the physical world. 🤖 Gemini Robotics-ER 1.6 has significantly better visual and spatial understanding in order to plan and complete more useful tasks. Here’s why this is important 🧵

307retweets1.9klikes
View on X

Share this update