# text-to-speech

4 tools tagged

showing 4 of 4 tools

ElevenLabs

Lifelike AI voice generation, cloning, and voice agents

ElevenLabs is an AI voice platform for text-to-speech, voice cloning, and conversational AI agents, built on models like Multilingual v2 and the low-latency Flash v2.5 and Turbo v2.5. Developers call its API to generate lifelike narration, clone voices from short audio samples, dub content across 30+ languages, add sound effects, and deploy real-time voice agents for customer service, IVR, and interactive apps, with SDKs for Python, JavaScript, and more.

freemium

MiniMax MCP

MCP server for MiniMax speech, video, and image APIs

Official MiniMax Model Context Protocol server enabling AI applications and code editors to access text-to-speech, voice cloning, image generation, video generation, and music creation APIs. Designed for Claude Desktop, Cursor, and Windsurf integration with stdio and SSE transport support, regional API endpoints for global and China regions, and flexible resource handling for seamless generative AI workflows.

api-usage-basedOpen Source

Coqui TTS

Open-source deep learning text-to-speech toolkit

Coqui TTS is an open-source deep learning toolkit for text-to-speech synthesis, originally built by former Mozilla TTS engineers. It supports multi-speaker and multilingual synthesis, voice cloning from just six seconds of audio, and ships pre-trained models for 20+ languages. After Coqui shut down in 2023, the Idiap Research Institute forked and actively maintains it. With 45K+ GitHub stars, it remains the most popular open-source TTS framework in Python.

open-sourceOpen Source

Chatterbox

State-of-the-art open-source text-to-speech with emotion control

Chatterbox is an open-source text-to-speech model by Resemble AI that delivers state-of-the-art voice synthesis with fine-grained emotion and style control. The model supports zero-shot voice cloning from short audio samples, produces natural-sounding speech across multiple speaking styles, and runs locally without cloud dependencies. With over 24,000 GitHub stars, it has become the leading open-source alternative to commercial TTS services for developers building voice-enabled AI applications.

open-sourceOpen Source