News

AI text-to-speech programs could “unlearn” how to imitate certain people New research shows models can be directly edited to hide selected voices, even when users specifically ask for them.
Podcast recording and editing platform Podcastle is now joining other companies in the AI-powered, text-to-speech race by releasing its own AI model called Asyncflow v1.0. An API for developers ...
Hume claims Octave is the first text-to-speech system powered by a large language model (LLM) trained not only on text but on speech and emotion tokens, enabling it to understand words in context ...
OpenAI’s new voice AI model gpt-4o-transcribe lets you add speech to your existing text apps in seconds ...
OpenAI unveils cutting-edge speech-to-text audio AI models API to help developers build accurate, reliable, and engaging voice-driven apps ...
OpenAI’s latest speech-to-text models, such as GPT-4 Transcribe and GPT-4 Mini Transcribe, deliver significant improvements in transcription accuracy and processing speed.
Text-to-speech with feeling - this new AI model does everything but shed a tear ElevenLabs' 'most expressive' v3 model can speak with a huge range of emotions in more than 70 languages.
This new text-to-speech AI model understands what it's saying - how to try it for free I tested Hume's new Octave model and was impressed with the results. Now you can try it, too.
When you purchase through links on our site, we may earn an affiliate commission. Here’s how it works. PlayAI (formerly PlayHT) is a text-to-speech platform and voice generator that uses ...
Google is enhancing Gemini's text-to-speech (TTS). On Tuesday at Google I/O 2025, the company previewed a new TTS feature, built on native audio output, that can "converse in more expressive ways ...