News

Discover the key differences between Moshi and Whisper speech-to-text models. Speed, accuracy, and use cases explained for your next project.
Key features, accuracy, and usability factors to consider when selecting the right speech-to-text converter for your needs ...
The company took a step in another technological direction by launching its first stand-alone speech-to-text model called Scribe.
ElevenLabs, the highly-valued AI voice cloning and generation startup from former Palantir alumni, today launched Scribe v1, a new speech-to-text model that reportedly achieves the highest ...
Meta says that it's the biggest open-source multimodal dataset, containing 270,000 hours' worth of mined speech and text alignment on which its AI was trained.
Meta’s “massively multilingual” AI model translates up to 100 languages, speech or text Meta aims for a universal translator like "Babel Fish" from Hitchhiker’s Guide.
[saurabhchalke] recently released whisper.unity, a Unity package that implements whisper locally on the Meta Quest 3 VR headset, bringing nearly real-time transcription of natural speech to the dev… ...
With a focus on expressive quality, reproducibility, and open access, Dia adds a distinctive new voice to the landscape of text-to-speech.