# voice AI

3 tools tagged

Showing 3 of 3 tools

ElevenLabs

Lifelike AI voice generation, cloning, and voice agents

ElevenLabs is an AI voice platform for text-to-speech, voice cloning, and conversational AI agents, built on models like Multilingual v2 and the low-latency Flash v2.5 and Turbo v2.5. Developers call its API to generate lifelike narration, clone voices from short audio samples, dub content across 30+ languages, add sound effects, and deploy real-time voice agents for customer service, IVR, and interactive apps, with SDKs for Python, JavaScript, and more.

freemium

PersonaPlex

NVIDIA's real-time persona-driven voice dialogue model

PersonaPlex is NVIDIA's open-source, full-duplex speech-to-speech conversational AI model that enables persona control through text-based role prompts and audio-based voice conditioning. Built on the Moshi architecture, it produces natural, low-latency spoken interactions with consistent persona across conversations. The model supports multiple pre-packaged voice embeddings for both natural and varied speaking styles, making it suitable for building interactive voice agents and assistants.

open-sourceOpen Source

Deepgram

Voice AI APIs for speech-to-text and text-to-speech

Deepgram is a voice AI infrastructure platform providing low-latency speech-to-text, text-to-speech, and conversational AI APIs. Its Nova-3 model delivers industry-leading accuracy for real-time transcription with streaming support, interruption handling, and multi-language capabilities. Used by 1,300+ organizations including Twilio and Vapi, Deepgram powers voice features in applications ranging from call centers to AI agent voice interfaces.

api-usage-based