aicoolies logo

# voice-cloning

3 tools tagged

Showing 3 of 3 tools

MiniMax MCP logo

MiniMax MCP

MCP server for MiniMax speech, video, and image APIs

Official MiniMax Model Context Protocol server enabling AI applications and code editors to access text-to-speech, voice cloning, image generation, video generation, and music creation APIs. Designed for Claude Desktop, Cursor, and Windsurf integration with stdio and SSE transport support, regional API endpoints for global and China regions, and flexible resource handling for seamless generative AI workflows.

api-usage-basedOpen Source
VoxCPM logo

VoxCPM

Tokenizer-free multilingual TTS with voice cloning

VoxCPM is an open-source text-to-speech system from OpenBMB generating continuous speech across 30 languages without traditional tokenization. Its 2B parameter end-to-end diffusion architecture produces 48kHz studio-quality audio with natural prosody and emotion. Key capabilities include voice design from text descriptions, few-shot voice cloning, and multilingual synthesis without language-specific modules. The Apache 2.0 project has 8,700 GitHub stars.

open-sourceOpen Source
Coqui TTS logo

Coqui TTS

Open-source deep learning text-to-speech toolkit

Coqui TTS is an open-source deep learning toolkit for text-to-speech synthesis, originally built by former Mozilla TTS engineers. It supports multi-speaker and multilingual synthesis, voice cloning from just six seconds of audio, and ships pre-trained models for 20+ languages. After Coqui shut down in 2023, the Idiap Research Institute forked and actively maintains it. With 45K+ GitHub stars, it remains the most popular open-source TTS framework in Python.

open-sourceOpen Source