aicoolies logo

Pipecat

Open-source framework for real-time voice and multimodal AI agents

Share
open-sourceOpen Source
Visit Website →

Pipecat is an open-source framework with 11,000+ GitHub stars for building real-time voice and multimodal AI agents. Developed by Daily.co, it manages the STT to LLM to TTS pipeline with sub-second latency, integrating with AWS Bedrock, NVIDIA NIM, and AssemblyAI for production-grade voice agent deployment.

Pipecat addresses the technical challenge of orchestrating real-time voice AI pipelines where speech-to-text, language model processing, and text-to-speech must flow seamlessly with minimal latency. The framework handles the complex timing, buffering, and error recovery required for natural conversational experiences, abstracting away the infrastructure complexity that makes voice agent development notoriously difficult. Built by Daily.co, which has operated WebRTC infrastructure since 2016, Pipecat inherits battle-tested real-time communication expertise.

The pipeline architecture supports pluggable components for each stage: multiple STT providers for speech recognition, any LLM for reasoning, and various TTS engines for voice synthesis. Developers define agent behavior through Python code while Pipecat handles the real-time orchestration, including interruption handling, turn-taking, and graceful degradation under network conditions. Official integrations with AWS Bedrock, NVIDIA NIM Blueprint, and AssemblyAI provide production-ready deployment paths.

With 11,000+ GitHub stars and growing adoption, Pipecat fills a category entirely absent from most developer tool directories: voice AI agent frameworks. As conversational AI interfaces expand beyond text chat, the infrastructure for building reliable voice agents becomes critical. Pipecat supports both telephony and WebRTC transports, enabling agents that work over phone calls, web browsers, and mobile applications. The BSD-2-Clause license ensures flexibility for both open-source and commercial use cases.

Pricing

Free and open-source (BSD-2-Clause)

Platforms

Python; Linux, macOS; WebRTC and telephony transports

Categories

Tags

Use Cases

Alternatives

Related Tools

Hermes Agent logo

Hermes Agent

Top Pick

Open-source AI agent framework with persistent memory, reusable skills, tools, and messaging gateways

Hermes Agent is an open-source AI agent framework with persistent memory, reusable skills, 40+ tools, cron jobs, and messaging gateways.

open-sourceOpen Source

Accomplish Coworker

Open-source desktop AI coworker for browsing and code execution.

Accomplish Coworker is an MIT-licensed open-source AI coworker that runs on the desktop, combining computer-use style browsing with code execution so agents can research, implement, run, and debug workflows in one local environment.

open-sourceOpen SourceTelemetry

Headroom

Context compression for LLM apps and coding agents

Headroom is an Apache-2.0 context compression layer for LLM apps and coding agents. It compresses tool output, logs, files, RAG chunks, and agent history through a local library, proxy, wrapper, or MCP server, with retrieval hooks for bringing originals back when needed. Treat its savings numbers as Headroom-reported benchmarks, not independent aicoolies measurements.

open-sourceOpen SourceTelemetry

Codebase Memory MCP

Codebase knowledge graph MCP server for AI coding agents

Codebase Memory MCP is an MIT-licensed MCP server that turns a repository into a persistent code knowledge graph for AI coding agents. It gives Claude Code, Cursor, Codex-style agents, and other MCP clients structural queries for functions, classes, call chains, routes, and architecture, helping them explore large projects without repeatedly rereading files or relying only on broad search.

open-sourceOpen SourceTelemetry
BeeAI Framework logo

BeeAI Framework

Python and TypeScript framework for production multi-agent systems

BeeAI Framework is an Apache-2.0 toolkit for building production-ready AI agents and multi-agent systems in Python and TypeScript. Its docs cover agents, tools, RAG, memory, workflows, backend providers, serving, and A2A/MCP integration surfaces, making it a vendor-neutral option for teams comparing LangGraph, CrewAI, Mastra, and related agent runtimes.

open-sourceOpen SourceTelemetry
Klavis AI logo

Klavis AI

MCP integration platform for agent tool use at scale

Klavis AI is an Apache-2.0 MCP integration platform for teams connecting AI agents to external SaaS tools and APIs. The public repo and official docs position it as infrastructure for reliable tool access at scale, so it fits teams that want reusable MCP connectors without treating every integration as a one-off script or custom OAuth maintenance project.

open-sourceOpen SourceTelemetry