# speech-recognition

2 tools tagged

showing 2 of 2 tools

Vosk

Offline speech recognition for 20+ languages

Vosk is an offline speech recognition toolkit supporting 20+ languages with compact 50MB models that run on Raspberry Pi, Android, iOS, and servers. It provides streaming API with zero-latency response, speaker identification, and reconfigurable vocabulary. Vosk offers bindings for Python, Java, Node.js, C#, Go, and Rust. Unlike cloud-based alternatives, all processing happens locally with no internet required. Apache 2.0 licensed with 14K+ GitHub stars.

open-sourceOpen Source

Deepgram

Voice AI APIs for speech-to-text and text-to-speech

Deepgram is a voice AI infrastructure platform providing low-latency speech-to-text, text-to-speech, and conversational AI APIs. Its Nova-3 model delivers industry-leading accuracy for real-time transcription with streaming support, interruption handling, and multi-language capabilities. Used by 1,300+ organizations including Twilio and Vapi, Deepgram powers voice features in applications ranging from call centers to AI agent voice interfaces.

api-usage-based