Loading...
Loading...
Using autonomous AI agents that can plan, write, test, and deploy code independently — from terminal agents to background coding assistants that work while you focus on other tasks
Showing 24 of 307 tools
Persistent memory layer for AI coding agents — keeps Claude Code, Codex, Cursor, and any MCP agent in context across sessions
agentmemory is an open-source MCP server that gives AI coding agents persistent, cross-session memory. Built on hybrid vector-graph search, it achieves 95.2% recall on the LongMemEval-S benchmark while using up to 92% fewer context tokens than naive context injection. Works out of the box with Claude Code, Codex, Cursor, Windsurf, Cline, OpenCode, Kilo Code, Hermes, and any MCP client through 51 MCP tools plus 12 hooks and 4 skills.
MCP, ACP and Skills support for building production coding agents — interactive or automated.
fast-agent is an Apache-licensed Python framework for building and running LLM agents with full MCP (Model Context Protocol) and ACP support. It ships with an interactive shell mode, Skills management, and multi-model routing — making it a practical platform for coding agents, workflow automation, and agent evaluation across Claude, Codex, HuggingFace, and local models.
Command center for Claude Code and Codex — monitor, steer, and voice-control your AI agents from any device.
Omnara is a command center for AI coding agents, letting you run, monitor, and steer Claude Code and Codex sessions from your phone, web browser, Apple Watch, or any device while the agent runs on your machine. Sessions migrate to the cloud when your laptop goes offline, and the voice-first interface lets you guide your agent hands-free. Built by a YC S25 team and available with a free tier plus paid plans across desktop, web, and mobile clients.
Vectorless, reasoning-based RAG that reads documents like a human expert — no vector DB, no chunking.
PageIndex is a vectorless, reasoning-based RAG system that builds hierarchical tree indexes from long documents and uses LLMs to navigate them like a human expert would. Instead of chunking text and comparing embeddings, it constructs a table-of-contents-style structure and reasons its way to the right sections — no vector database required. Available as an open-source Python package, cloud API, MCP server, and chat platform.
Production-grade browser automation with AI self-healing and Playwright code ownership
Intuned is a code-first browser automation platform that turns natural language prompts into production-ready Playwright code, deploys it, and self-heals it when target sites change. Supports TypeScript and Python with Anthropic Computer Use, OpenAI CUA, Stagehand, Browser-Use, and Gemini Computer Use integrations. Built-in stealth, captcha solving, auth session management, and scheduled runs with concurrency control. No vendor lock-in—you own the code.
AI agent dev environment with parallel git worktrees, magic git commands, and Linear integration.
Jean is an open-source desktop dev environment for AI agents from coolLabs (the team behind Coolify). It runs multiple coding agents — Claude Code, Codex, and others — in parallel inside isolated git worktrees, each with its own chat session and terminal. Magic git commands handle commits, PR descriptions, code reviews, and merge conflicts with AI assistance, while built-in Linear and GitHub integrations load issue context into every session.
Cut Claude Code token costs by up to 50% with a local plugin that never uploads your code.
WOZCODE is a Claude Code plugin that reduces token consumption by 25–55% using smarter context reads, batched file edits, AST truncation, and Haiku subagents. It installs in seconds with two CLI commands, runs entirely locally with no code upload, and requires no account sign-up. Developers report finishing the same tasks in fewer tokens without changing their existing editor or workflow.
Always-on cloud engineer that lives in Slack and ships verified PRs
Roomote is a Slack-first cloud coding agent from RooCodeInc that takes prompts end-to-end across GitHub, Linear, Notion, Sentry, and your own dev environment, then opens self-verified pull requests for review. It is the team behind 23k-star Roo Code going all-in on cloud agents — plug it into your stack, mention it in Slack, and it answers questions, drafts plans, and ships verified PRs without asking engineers to leave their flow.
Sandboxes for coding agents — Linux VMs, Git, and deploys in one box
Freestyle is YC-backed sandbox infrastructure built for AI coding agents, shipping secure Linux VMs with nested virtualization, Git servers, and one-click web deploys. It lets agents run real workloads, branch repos, and deploy apps under short-lived identities while billing only for active compute. Used in production by vly.ai, Rork, and Vibeflow.
Rust-native multi-agent orchestration for production
GraphBit is a Rust-native, multi-agent orchestration framework built for production. It targets the gap between Python-first frameworks like LangGraph and the operational expectations of enterprise systems — predictable memory, low latency, deterministic concurrency, and the ability to embed an agent runtime in services that already run Rust without dragging in a Python interpreter.
Self-evolution engine for AI agents with auditable updates
Evolver is an open-source self-evolution engine for AI agents that turns run logs into auditable, reviewable updates via its Genome Evolution Protocol. Instead of ad hoc prompt tweaking, teams collect traces and Evolver proposes versioned diffs to prompts, tools and workflows that engineers can approve, reject or roll back like code.
Self-evolving local computer agent with a reusable skill tree
GenericAgent is a minimal, self-evolving autonomous agent in roughly 3K lines of Python that gives LLMs system-level control of a local computer. It writes files, runs shell commands, and browses the web, but its defining feature is skill crystallization: successful task runs are saved as reusable skills inside a growing skill tree that cuts token cost on repeats.
Headless browser cloud built for AI agents
Browserbase is cloud infrastructure that runs headless Chromium browsers on demand for AI agents and automation workflows, exposing Playwright, Puppeteer, and Selenium endpoints with built-in session replay, residential proxies, CAPTCHA solving, and stealth fingerprints. It also hosts Stagehand and a Model Gateway, letting teams build browser-using agents without maintaining their own fleet of Kubernetes-managed Chromium instances.
Open-source async coding agent you can run in your own sandbox
Open-source framework from LangChain AI for building your organization's internal coding agent — the same pattern Stripe's Minions, Ramp's Inspect, and Coinbase's Cloudbot follow. Built on LangGraph and Deep Agents, Open SWE runs each task in an isolated cloud sandbox (Modal, Daytona, Runloop, or LangSmith), invokes from Slack, Linear, or GitHub, orchestrates subagents, and opens pull requests autonomously — customizable end-to-end for your codebase and conventions.
Build modular, scalable LLM applications in Rust
Open-source Rust library for building scalable, modular, and ergonomic LLM-powered applications. Rig unifies 20+ model providers (OpenAI, Anthropic, Mistral, DeepSeek, Ollama, and more) and 10+ vector stores behind one trait-based interface, supports completion and embedding workflows, multi-turn streaming, and transcription/audio/image generation, with full GenAI Semantic Convention compatibility and WASM-ready core library — production agentic infra for Rust teams.
Fork, customize, and ship AI agents on Vercel in minutes
Open Agents is a Vercel Labs open-source template for building and deploying cloud-hosted AI agents. It provides a production-ready Next.js starter with built-in tool use, streaming responses, multi-model support, and deployment on Vercel infrastructure. Developers can fork, customize agent behavior and tools, then ship agent-backed apps in minutes with automatic scaling and edge routing.
Human-in-the-loop web agent you can co-pilot in real time
Magentic-UI is a Microsoft Research web agent with a human-in-the-loop interface for browsing, coding, and file tasks. It plans multi-step actions, asks for approval before executing, and lets users co-pilot by taking over the browser mid-task. Built on AutoGen, it runs a team of specialized agents for web browsing, file handling, and code execution with full action transparency and safety guardrails.
Constrained generation that guarantees valid LLM outputs every time
Guidance is Microsoft's structured generation library that enforces output constraints directly within LLM decoding. It supports JSON schemas, regex patterns, grammars, and interleaved generation-and-control flow to guarantee valid outputs from any compatible model. Works with local models via llama.cpp, Transformers, and remote APIs including OpenAI and Anthropic. Eliminates retry loops and post-processing for structured data extraction.
Vector search extension for SQLite that runs anywhere
sqlite-vec is a lightweight vector search extension for SQLite written in pure C with zero dependencies. It brings nearest-neighbor search capabilities directly into SQLite databases, enabling AI applications to store and query embeddings without running a separate vector database. The extension works everywhere SQLite runs including Linux, macOS, Windows, WebAssembly in browsers, and even Raspberry Pi devices. Sponsored by Mozilla Builders, Fly.io, and Turso.
AI-powered task management for agentic coding workflows
Claude Task Master is an AI-powered task management system designed for agentic development workflows in IDEs like Cursor, Windsurf, Lovable, and Roo. It breaks complex projects into structured task trees with dependencies, priorities, and complexity scores so AI coding agents can execute work methodically. The MCP server integration enables direct task operations from any compatible client, while tagged task lists support multi-context management across branches and environments.
Connect Blender to AI through Model Context Protocol
BlenderMCP bridges Blender and AI assistants like Claude through the Model Context Protocol, enabling natural language control over 3D modeling workflows. Artists and developers can create, modify, and manipulate 3D scenes, objects, materials, and animations by describing what they want instead of navigating complex menus. The tool exposes Blender's Python API through MCP, supporting operations from mesh creation to shader editing and physics simulations.
Context engineering platform for AI agents with temporal knowledge graphs
Zep is a context engineering platform that assembles relationship-aware context for AI agents from conversations, business data, documents, and events. It maintains a temporal knowledge graph that automatically extracts entities and relationships, tracking how context evolves over time. Zep delivers formatted context blocks optimized for LLMs with sub-200ms latency, integrating with LangChain, LlamaIndex, AutoGen, and Google ADK through Python, TypeScript, and Go SDKs.
Agent memory system that learns, not just remembers
Hindsight is an agent memory system that enables AI agents to learn from experience rather than just store conversations. It organizes memories into three biomimetic categories: World knowledge for facts, Experiences for agent events, and Mental Models for learned understanding. The system provides retain, recall, and reflect operations backed by a temporal knowledge graph with parallel retrieval strategies including semantic, keyword, graph traversal, and temporal search.
Agentic IM chatbot platform with multi-platform LLM integration
AstrBot is an open-source agentic chatbot infrastructure that connects multiple instant messaging platforms including Telegram, Discord, Slack, WeChat, QQ, Feishu, and DingTalk to AI language models. It supports multi-provider LLM integration, MCP protocol, knowledge bases, persona management, multimodal input, and a plugin ecosystem with over 1,000 community extensions. Features include a web management UI, sandbox code execution, and auto-context compression for efficient conversations.