Say hello to Gemini 3.1 Flash Live. 🗣️ Our latest audio model delivers more natural conversations with improved function calling – making it more useful and informed. Here’s what’s new 🧵
Google Launches Gemini 3.1 Flash Live for Natural Real Time Voice Agents
· Updated
Google DeepMind introduced Gemini 3.1 Flash Live, a specialized model for real-time audio reasoning. It features a 90.8% score on the
ComplexFuncBench Audio benchmark, indicating a major leap in multi-step function calling. The model also includes improved tonal understanding to detect pitch, pace, and user frustration during live interactions.Most voice interfaces struggle with interruptions, background noise, and long-term context. This update doubles the conversation thread length, allowing the AI to maintain a train of thought during extended brainstorms. By processing audio natively rather than converting to text first, it reduces latency and captures acoustic nuances that text-only models miss.
You can access the model in preview via the Gemini Live API in Google AI Studio to build low-latency voice agents. It is also integrated into Gemini Enterprise for Customer Experience for automated support. All generated audio is protected by SynthID watermarking to help identify AI-generated content across 200 countries.
Google DeepMind
@GoogleDeepMind
183retweets
View on X



