Introducing GPT-Realtime-2 in the API: our most intelligent voice model yet, bringing GPT-5-class reasoning to voice agents.
Voice agents are now real-time collaborators that can listen, reason, and solve complex problems as conversations unfold.
Now available in the API alongside streaming models GPT-Realtime-Translate and GPT-Realtime-Whisper β a new set of audio capabilities for the next generation of voice interfaces.

VIDEO
Our new voice models are now available in the Realtime API:
ποΈ GPT-Realtime-2: Build production-ready voice agents that can think harder, take action, handle interruptions, and keep conversations flowing.
ποΈ GPT-Realtime-Translate: Translate while streaming across more than 70 input and 13 output languages, breaking down language barriers and helping people communicate more naturally.
ποΈ GPT-Realtime-Whisper: Transcribe streaming audio as words are spoken to generate captions and notes in real time.
openai.com/index/advancinβ¦
ποΈ GPT-Realtime-2: Build production-ready voice agents that can think harder, take action, handle interruptions, and keep conversations flowing.
ποΈ GPT-Realtime-Translate: Translate while streaming across more than 70 input and 13 output languages, breaking down language barriers and helping people communicate more naturally.
ποΈ GPT-Realtime-Whisper: Transcribe streaming audio as words are spoken to generate captions and notes in real time.
openai.com/index/advancinβ¦
We know youβre eager for voice updates in ChatGPT. Stay tuned, weβre cooking.
Generated by Thread Navigator
Press β + S to quick-export
