AI co-clinician is our new research initiative to help explore how multimodal agents could better support healthcare workers and patients. 🩺 Here’s a snapshot of our progress 🧵
Google Previews AI co-clinician Agents With Real Time Multimodal Senses
Google DeepMind· Updated
Google announced the AI co-clinician research initiative, a system of multimodal agents designed to assist doctors and patients through real-time audio and video. By moving beyond text-based chat to eyes, ears, and a voice, the system can guide physical exams and medication reasoning.
- Clinical accuracy (primary care)
- 97 of 98 queries with zero critical errors
- Performance vs PCPs
- Matched or exceeded in 68 of 140 areas
- Architecture
- Dual-agent (Planner and Talker)
- Core benchmarks
- RxQA and NOHARM
- Availability
- Research initiative and trusted tester program
- Primary models
- Gemini and Project Astra
This initiative addresses healthcare worker shortages through a triadic care model where AI acts as a supervised teammate. A dual-agent architecture uses a Planner to monitor a Talker agent, keeping interactions within clinical boundaries. The AI matched or exceeded primary care performance in 68 of 140 assessed clinical areas.
The system is currently a research project and not intended for medical advice. Google is expanding its trusted tester program to sites in the US, India, and Singapore. You can follow the progress of Google's agentic transformation as the lab refines the system's ability to navigate complex medication reasoning.
Still wondering? A few quick answers below.
Every HeadsUpAI update is written based on its original source and reviewed before it's published. Read our editorial standards →

