OpenAI introduces new speech-to-speech AI model

FILE PHOTO: OpenAI announced its “most capable” speech-to-speech AI model, gpt-realtime.
| Photo Credit: AP

OpenAI on Thursday (August 28, 2025) announced its “most capable” speech-to-speech AI model, gpt-realtime. The AI model is said to be natural and expressive while also being better at following complex instructions.

“It’s better at interpreting system messages and developer prompts—whether that’s reading disclaimer scripts word-for-word on a support call, repeating back alphanumerics, or switching seamlessly between languages mid-sentence,” per the company blog.

It can also switch language or tone in the middle of a sentence.

Gpt-realtime is also able to capture non-verbal cues like laughs and detect numbers even in languages like Spanish, Chinese, Japanese and French.

“We trained the model in close collaboration with customers to excel at real-world tasks like customer support, personal assistance, and education—aligning the model to how developers build and deploy voice agents,” the blog stated.

The model will be available on the Realtime API, which was also made generally available.

OpenAI has also released new voices on the API called Cedar and Marin which can be accessed via the API.

Published – August 29, 2025 02:07 pm IST

Source link

OpenAI introduces new speech-to-speech AI model

Latest Updates

Cut the noise and dive into history, science, and culture with MagellanTV

Access Denied

Google Pixel 10 Pro XL Review: Stands strong on AI while doubling down on camera strengths

Frequently Asked Questions

Cut the noise and dive into history, science, and culture with MagellanTV

Access Denied

Google Pixel 10 Pro XL Review: Stands strong on AI while doubling down on camera strengths

What if the AI stock market blows up?

oracle: What Oracle didn’t foresee? Techies’ millions in a moment

Maxar executive renews warning that budget cuts threaten commercial remote sensing industry

Related Articles

Cut the noise and dive into history, science, and culture with MagellanTV

Access Denied

Google Pixel 10 Pro XL Review: Stands strong on AI while doubling down on camera strengths

What if the AI stock market blows up?