Google Launches Gemini Robotics-ER 1.6 With Advanced Spatial Reasoning and Instrument Reading

Google DeepMind

Apr 15, 2026 · Updated Jun 5, 2026

Google released Gemini Robotics-ER 1.6, a specialized model that acts as a high-level reasoning brain for physical robots. It introduces agentic vision for multi-step tasks like reading analog gauges and improves multi-view camera understanding for complex industrial environments.

Google launched Gemini Robotics-ER 1.6, a major upgrade to its embodied reasoning model. This version introduces instrument reading, allowing robots to interpret analog pressure gauges, digital readouts, and liquid levels. It also features agentic vision, which enables the model to zoom into images or execute code to calculate precise measurements.

This update bridges the gap between digital intelligence and physical autonomy in dynamic settings. By improving multi-view reasoning, robots can now synthesize data from multiple cameras—like overhead and wrist-mounted feeds—to detect task success even when objects are partially hidden. This shift moves robotics from simple instruction-following to independent physical problem-solving.

You can access gemini-robotics-er-1.6-preview today through the Gemini API and Google AI Studio. The release includes a developer Colab notebook with examples for configuring spatial prompts and success detection. For industrial users, the model is already being integrated into Boston Dynamics robots for autonomous facility inspections.

View the full update on deepmind.google

Google DeepMind

@GoogleDeepMindApr 14

We’re rolling out an upgrade designed to help robots reason about the physical world. 🤖 Gemini Robotics-ER 1.6 has significantly better visual and spatial understanding in order to plan and complete more useful tasks. Here’s why this is important 🧵

3071.9k

View on X

Every HeadsUpAI update is written based on its original source and reviewed before it's published. Read our editorial standards →

See all AI news & updates from Google →

Keep reading

Google DeepMind powers Boston Dynamics Spot with embodied reasoning for autonomous tasks

Google DeepMind integrated its Gemini Robotics ER 1.6 model into Boston Dynamics' Spot robot to enable embodied reasoning. This allows the robot to interpret natural language commands and navigate dynamic environments without manual programming. By bridging high-level AI reasoning with physical hardware, Spot can now autonomously orchestrate basic skills to complete complex goals.

Google Research Benchmarks Gemini's 3D Object Generation Through Code

GoogleJun 7

Google Research Benchmarks Gemini's 3D Object Generation Through Code

Google Research introduced 3DCodeBench, a new benchmark evaluating AI models' ability to generate 3D objects using code. This benchmark, presented at CVPR2026, demonstrates how agentic AI can autonomously create complex 3D assets, highlighting the role of iterative refinement in improving model performance.

Google Brings Gemini 3.5 Flash to Everyone for Free Visual Research

GeminiMay 21

Google Brings Gemini 3.5 Flash to Everyone for Free Visual Research

Google is rolling out Gemini 3.5 Flash globally to all users for free via the web and mobile app. The update shifts the high-speed model from a developer tool to a consumer assistant capable of analyzing complex diagrams and math papers. This move democratizes frontier-level multimodal reasoning for everyday research and document exploration.

NVIDIAApr 1

NVIDIA launches CaP-X to turn frontier language models into autonomous robot controllers

NVIDIA and academic partners released CaP-X, an open-source framework that allows large language models to control robots by writing and executing code. It proves that off-the-shelf models like Gemini 3 Pro can perform complex physical tasks without specific robotics training. This shifts the robotics paradigm from specialized end-to-end models to general-purpose agentic reasoning.