2 tools tagged
Showing 2 of 2 tools
NVIDIA's real-time persona-driven voice dialogue model
PersonaPlex is NVIDIA's open-source, full-duplex speech-to-speech conversational AI model that enables persona control through text-based role prompts and audio-based voice conditioning. Built on the Moshi architecture, it produces natural, low-latency spoken interactions with consistent persona across conversations. The model supports multiple pre-packaged voice embeddings for both natural and varied speaking styles, making it suitable for building interactive voice agents and assistants.
Voice AI APIs for speech-to-text and text-to-speech
Deepgram is a voice AI infrastructure platform providing low-latency speech-to-text, text-to-speech, and conversational AI APIs. Its Nova-3 model delivers industry-leading accuracy for real-time transcription with streaming support, interruption handling, and multi-language capabilities. Used by 1,300+ organizations including Twilio and Vapi, Deepgram powers voice features in applications ranging from call centers to AI agent voice interfaces.