Advancing voice intelligence with new models in the API
Summary
OpenAI has released three new audio models for developers: GPT-Realtime-2 (a voice model with advanced reasoning capabilities), GPT-Realtime-Translate (live translation across 70+ languages), and GPT-Realtime-Whisper (streaming speech-to-text). These models enable voice applications that can understand context, reason through requests, use tools, and take action during conversations, moving beyond simple back-and-forth responses to support real-world tasks like booking travel or providing customer support.
Classification
Affected Vendors
Related Issues
Original source: https://openai.com/index/advancing-voice-intelligence-with-new-models-in-the-api
First tracked: May 7, 2026 at 02:00 PM
Classified by LLM (prompt v3) · confidence: 95%