ニュース

ElevenLabs, the highly-valued AI voice cloning and generation startup from former Palantir alumni, today launched Scribe v1, a new speech-to-text model that reportedly achieves the highest ...
It's not just what is said but how it's articulated that shapes the meaning of human communication, and people use intonation ...
Researchers at Amazon have trained the largest ever text-to-speech model yet, which they claim exhibits “emergent” qualities improving its ability to speak even complex sentences naturally ...
Hume claims Octave is the first text-to-speech system powered by a large language model (LLM) trained not only on text but on speech and emotion tokens, enabling it to understand words in context ...
Podcast recording and editing platform Podcastle is now joining other companies in the AI-powered, text-to-speech race by releasing its own AI model called Asyncflow v1.0. An API for developers ...
CapCut PC combines text-to-speech AI with a drag-and-drop editor to make it easy to create videos for beginners. You no longer need to use microphones and professional editing capabilities to put ...