MCP servers, system prompts, agent frameworks, and orchestration tools
Showing 24 of 205 tools
Open-source personal AI agent for messaging apps
OpenClaw is a free, open-source AI agent framework that turns any LLM into an autonomous personal assistant accessible through messaging apps like WhatsApp, Telegram, Discord, and Signal. Running entirely on your local machine via a Node.js gateway, it connects AI models to system tools, browsers, files, and APIs for multi-step task execution with persistent memory across sessions.
Enhanced capabilities for Claude Code sessions
A community plugin collection for Windsurf IDE that extends its AI coding capabilities with specialized tools for testing, documentation, deployment, and code analysis. Each plugin adds targeted functionality that complements Windsurf's built-in features, letting developers customize their AI-assisted workflow for specific languages, frameworks, and development patterns.
Spec-driven development for AI coding assistants
An open-source tool for generating detailed API specifications from natural language descriptions and existing codebases. Produces OpenAPI-compatible specs automatically, streamlines API documentation workflows, and helps teams maintain consistent, up-to-date API contracts across services. Reduces the manual effort of writing and maintaining API specs by inferring endpoints, schemas, and parameters from your actual code.
Autonomous scientific discovery via agentic tree search
AI Scientist v2 is Sakana AI's open-source system for fully autonomous scientific research using LLM-powered agentic tree search. It generates hypotheses, designs experiments, writes and executes code, analyzes results, and produces publishable manuscripts without human intervention. The system uses progressive exploration with backtracking to navigate the research space efficiently.
Unified Python/.NET framework for multi-agent AI
Microsoft Agent Framework is Microsoft's official unified SDK for building multi-agent AI workflows in Python and .NET. It consolidates Semantic Kernel and AutoGen into a single framework with MCP tool integration, graph-based workflows, human-in-the-loop patterns, and multi-agent orchestration. The framework reached Release Candidate status in February 2026 and is Microsoft's recommended path for production agent development.
Context retrieval layer for AI agents and RAG
Airweave is an open-source context retrieval platform that connects AI agents and RAG systems to 50+ apps and databases through a unified search interface. It continuously syncs data from sources like Notion, Slack, GitHub, and databases, making it searchable through LLM-friendly APIs. Airweave includes Python and TypeScript SDKs, MCP support, and a CLI for managing data connections.
Open-source platform for building API integrations
Nango is an open-source API integration platform that handles OAuth authentication, data synchronization, and proxying for 700+ APIs in a single self-hostable package. It manages token refresh, rate limiting, field mapping, and webhook syncing so developer teams can connect AI agents and products to external services in hours instead of weeks. Backed by Y Combinator W23 with $7.5M in funding.
Voice AI APIs for speech-to-text and text-to-speech
Deepgram is a voice AI infrastructure platform providing low-latency speech-to-text, text-to-speech, and conversational AI APIs. Its Nova-3 model delivers industry-leading accuracy for real-time transcription with streaming support, interruption handling, and multi-language capabilities. Used by 1,300+ organizations including Twilio and Vapi, Deepgram powers voice features in applications ranging from call centers to AI agent voice interfaces.
In-page AI browser agent via a single script tag
Page Agent is Alibaba's open-source JavaScript library that embeds an AI GUI agent directly into any web page through a single script tag injection. Unlike headless browser tools that operate externally, Page Agent works inside the DOM using text-based manipulation for natural language QA testing, enterprise copilots, and making legacy web apps AI-native. It supports BYOLLM with any model provider and requires no backend changes.
Headless browsers in Docker for automation at scale
Browserless is a headless browser-as-a-service platform that deploys Chrome, Firefox, and WebKit in Docker containers for web scraping, testing, and AI agent automation. It provides Puppeteer and Playwright-compatible APIs, a built-in MCP server for connecting AI assistants to browser automation, screenshot and PDF generation, and connection pooling for high-concurrency workloads. Available as self-hosted open source or managed cloud.
On-device ML solutions for mobile and edge AI
MediaPipe is Google's open-source framework for building on-device machine learning pipelines across mobile, web, desktop, and edge platforms. It provides pre-built solutions for face detection, hand tracking, pose estimation, object detection, image classification, text classification, and on-device LLM inference. MediaPipe runs entirely locally without cloud dependencies, supporting Android, iOS, Python, and web browsers.
Reusable computer vision tools for developers
Supervision is an open-source Python toolkit by Roboflow providing reusable CV utilities for detection, tracking, annotation, and dataset management. It works with any model including YOLO and Hugging Face via a standardized Detections class. Features include 20+ annotators, ByteTrack object tracking, zone counting, speed estimation, and dataset conversion between COCO, YOLO, and Pascal VOC formats.
Deep learning optimization for distributed training
DeepSpeed is Microsoft's open-source deep learning optimization library that makes distributed training and inference easy, efficient, and effective. Its ZeRO optimizer eliminates memory redundancies across data-parallel processes, enabling training of models with trillions of parameters. DeepSpeed supports 3D parallelism combining data, pipeline, and tensor parallelism, along with mixed precision training, gradient checkpointing, and CPU/NVMe offloading for memory-constrained environments.
YC-backed multimodal RAG platform for documents, images, and video
Morphik is a YC-backed multimodal RAG platform that ingests and retrieves information from documents, images, tables, and video content. It processes complex document layouts including charts, diagrams, and multi-column formats that traditional text-only RAG systems handle poorly. Provides API-first integration for building knowledge bases that understand visual as well as textual information.
AI chatbot framework for WeChat with multi-model and plugin support
chatgpt-on-wechat is an open-source framework for deploying AI chatbots on WeChat, the dominant messaging platform in China. It supports OpenAI, Claude, Gemini, Qwen, and local models through a plugin architecture. Features group chat management, image generation, voice messages, and knowledge base integration. Over 42,700 GitHub stars reflecting massive adoption in the Chinese developer community.
Alibaba's agent framework built for the Qwen model family
Qwen-Agent is Alibaba's open-source framework for building AI agents powered by the Qwen model family. It provides tool use, planning, memory, and multi-agent orchestration with native optimization for Qwen models including function calling and code interpretation. Supports RAG, browser automation, and custom tool development with over 15,900 GitHub stars.
All-in-one embeddings database with RAG, search, and agent capabilities
txtai is a self-contained AI search and RAG platform that combines vector embeddings, semantic search, LLM pipelines, and agent workflows in a single Python library. It handles embedding generation, similarity search, extractive QA, summarization, translation, and custom pipelines without external dependencies. Runs locally with over 12,400 GitHub stars and Apache 2.0 license.
Framework for converting MCP servers into autonomous AI agents with UI
Nanobot transforms MCP servers into full autonomous agents by adding a planning layer, conversation memory, and web-based UI on top of MCP tool capabilities. It enables building agents that combine multiple MCP servers with LLM reasoning to complete multi-step tasks. Features MCP-UI for browser-based interaction and supports any MCP-compatible tools as agent capabilities.
Self-hosted MCP gateway for managing multiple servers behind a single endpoint
MCPJungle is a self-hosted gateway that aggregates multiple MCP servers behind a single endpoint. It provides server discovery, health checking, access control, and request routing so AI clients connect to one gateway rather than managing individual server connections. Supports server grouping, authentication, and monitoring dashboards for production MCP deployments.
Protocol bridge converting MCP stdio transport to HTTP/SSE endpoints
mcp-proxy is a protocol bridge that converts MCP servers using stdio transport into HTTP and Server-Sent Events endpoints accessible over the network. This enables remote MCP server hosting, load balancing, and sharing servers across multiple AI clients. Supports both directions: wrapping stdio servers with HTTP and connecting to remote SSE servers through local stdio. Over 2,400 GitHub stars.
CLI package manager for MCP servers with profile-based configuration
MCPM is a command-line package manager for MCP servers that handles installation, configuration, and profile management. It supports profile-based server grouping where different AI workflows use different sets of MCP servers. Features a server registry, automatic dependency resolution, and configuration file management for Claude Desktop and other MCP clients.
MCP server manager for installing and running Model Context Protocol servers
MCPorter is a management tool for discovering, installing, and running MCP servers. It provides a registry of available servers, handles dependency installation, manages configuration, and starts servers with proper environment setup. Simplifies the process of connecting AI agents to external tools through MCP by abstracting server lifecycle management. Over 5,700 GitHub stars.
Microsoft's MCP server for structured browser automation by AI agents
Playwright MCP is Microsoft's Model Context Protocol server that enables AI agents to automate web browsers through structured tool calls. It exposes Playwright's browser automation capabilities as MCP tools that LLMs can invoke for navigating pages, clicking elements, filling forms, extracting content, and taking screenshots. Provides structured, reliable browser interaction for AI agent workflows.
AWS open-source SDK for building model-driven AI agents
Strands Agents is an open-source SDK from AWS that takes a model-driven approach to building AI agents. Developers define a prompt, model, and tools, and the LLM handles planning and orchestration autonomously. Supports Amazon Bedrock, Anthropic, OpenAI, Gemini, Ollama, and more. Powers Amazon Q Developer and AWS Glue in production. Available in Python and TypeScript with native MCP support.