Speech to Text Demonstration

News

Kyutai vs Whisper : Streaming Speech-to-Text AI Models Compared

Discover the key differences between Moshi and Whisper speech-to-text models. Speed, accuracy, and use cases explained for your next project.

How to Choose a Speech-to-Text Converter

Key features, accuracy, and usability factors to consider when selecting the right speech-to-text converter for your needs ...

TechCrunch6mon

ElevenLabs is launching its own speech-to-text model

The company took a step in another technological direction by launching its first stand-alone speech-to-text model called Scribe.

VentureBeat6mon

ElevenLabs’ new speech-to-text model Scribe is here with highest ...

ElevenLabs, the highly-valued AI voice cloning and generation startup from former Palantir alumni, today launched Scribe v1, a new speech-to-text model that reportedly achieves the highest ...

CNET2y

Meta’s New AI Can Translate Speech and Text for Nearly 100 Languages

Meta says that it's the biggest open-source multimodal dataset, containing 270,000 hours' worth of mined speech and text alignment on which its AI was trained.

Ars Technica2y

Meta’s “massively multilingual” AI model translates up to 100 ...

Meta’s “massively multilingual” AI model translates up to 100 languages, speech or text Meta aims for a universal translator like "Babel Fish" from Hitchhiker’s Guide.

Hackaday1y

Robust Speech-to-Text, Running Locally On Quest VR Headset

[saurabhchalke] recently released whisper.unity, a Unity package that implements whisper locally on the Meta Quest 3 VR headset, bringing nearly real-time transcription of natural speech to the dev… ...

VentureBeat4mon

A new, open source text-to-speech model called Dia has arrived to ...

With a focus on expressive quality, reproducibility, and open access, Dia adds a distinctive new voice to the landscape of text-to-speech.

Results that may be inaccessible to you are currently showing.

Hide inaccessible results