Introducing GPT-Realtime-2 in the API: our most intelligent voice model yet, bringing GPT-5-class reasoning to voice agents. Voice agents are now real-time collaborators that can listen, reason, and solve complex problems as conversations unfold. Now available in the API alongside streaming models GPT-Realtime-Translate and GPT-Realtime-Whisper — a new set of audio capabilities for the next generation of voice interfaces.
OpenAI Launches GPT-Realtime-2 to Bring GPT-5 Reasoning to Voice Agents
· Updated
- GPT-Realtime-2 capability
- GPT-5-class reasoning
- Translation support
- 70+ input and 13 output languages
- Transcription model
- GPT-Realtime-Whisper
- API parameter
- reasoning.effort
- Availability
- Realtime API
This release bridges the gap between the GPT-5.5 reasoning models and OpenAI's WebRTC infrastructure updates. By moving reasoning into the audio modality, voice agents can now handle natural conversational interruptions and solve multi-step problems as they unfold. This shifts the paradigm from reactive chatbots to collaborative agents capable of complex real-time logic.
You can access these models through the Realtime API to build voice-first applications like live interpreters or technical support agents. Developers can use the reasoning.effort parameter to balance intelligence with latency requirements. While these capabilities are live in the API, OpenAI noted that they are not yet available within the consumer ChatGPT application.
Still wondering? A few quick answers below.




