# sandbox
15 tools tagged
Showing 15 of 15 tools
Superserve
Open-source Firecracker sandboxes for long-running AI agents
Superserve is an open-source sandbox infrastructure layer for AI agents that need durable computers instead of short-lived shells. It runs isolated Firecracker microVMs, supports pause, resume, snapshot, fork, preview URLs, MCP connectivity, SDK/API control, Docker workloads, and self-hosting, while the hosted service adds pay-as-you-go agent sandboxes for teams.
eve by Vercel
Filesystem-first framework for durable AI agents
Eve is Vercel's filesystem-first TypeScript framework for building durable AI agents as ordinary project files. It combines Markdown instructions and skills, typed tools, channels, connections, subagents, schedules, sandboxes, and evals with Vercel's agent runtime so teams can ship deployable agents without hand-rolling orchestration. The current beta fits Vercel-native backend agent projects.
Freestyle
Sandboxes for coding agents — Linux VMs, Git, and deploys in one box
Freestyle is YC-backed sandbox infrastructure built for AI coding agents, shipping secure Linux VMs with nested virtualization, Git servers, and one-click web deploys. It lets agents run real workloads, branch repos, and deploy apps under short-lived identities while billing only for active compute. Used in production by vly.ai, Rork, and Vibeflow.
Open SWE
Open-source async coding agent you can run in your own sandbox
Open-source framework from LangChain AI for building your organization's internal coding agent — the same pattern Stripe's Minions, Ramp's Inspect, and Coinbase's Cloudbot follow. Built on LangGraph and Deep Agents, Open SWE runs each task in an isolated cloud sandbox (Modal, Daytona, Runloop, or LangSmith), invokes from Slack, Linear, or GitHub, orchestrates subagents, and opens pull requests autonomously — customizable end-to-end for your codebase and conventions.
ralphy
Multi-agent loop runner for autonomous PRD completion
ralphy is an autonomous bash-based orchestrator that runs Claude Code, Codex, OpenCode, Cursor agent, Qwen, and Droid in a continuous loop until a PRD is complete. Supports parallel task execution with isolated git worktrees, automated branching per task, PR creation, and webhook notifications to Discord and Slack. Cross-platform on macOS, Linux, and Windows via WSL2 or Git Bash.
ruflo
Agent orchestration platform for Claude multi-agent swarms
ruflo is an agent orchestration platform built for Claude that enables deploying intelligent multi-agent swarms and coordinating autonomous workflows. Features enterprise-grade architecture with distributed swarm intelligence, self-learning capabilities, 100+ pre-built specialized agents, fault-tolerant consensus mechanisms, and RAG integration. Native Claude Code and Codex integration with MCP extension support.
gptme
Personal AI agent in your terminal with local tool access
gptme is one of the earliest terminal-based AI agent CLIs, launched in spring 2023. It equips a personal AI agent with local tools to run shell commands, write and edit code, browse the web via Playwright, and use vision capabilities. Supports MCP integration and an extensible plugin system for building persistent autonomous agents. The Bob reference agent is documented as having run extensively as an autonomous agent.
Agent Orchestrator
Parallel coding agent orchestrator with autonomous CI handling
Agent Orchestrator manages fleets of parallel AI coding agents, each working in isolated git worktrees. It handles CI failures, code review comments, and pull request coordination while keeping humans involved for judgment calls. The current canonical repository is maintained under AgentWrapper.
Symphony
OpenAI's autonomous coding agent orchestration framework
Symphony is OpenAI's open-source framework that turns project work into isolated, autonomous implementation runs. Instead of supervising coding agents line by line, teams assign tasks from project boards and Symphony dispatches agents to handle them independently. Each agent works in an isolated workspace, provides proof of work documentation including CI status and PR review feedback, and can automatically merge approved pull requests.
OpenFang
Rust-based agent OS with built-in security, WASM sandboxing, and multi-agent runtime
OpenFang is an open-source agent operating system built in Rust that provides a secure multi-agent runtime with WASM sandboxing, auditability layers, and multi-channel communication. It goes beyond typical orchestration SDKs by treating agent security and operational isolation as first-class concerns, making it suitable for teams deploying agents in environments where trust boundaries and audit trails matter.
CUA (Computer-Use Agent)
Open-source sandboxes and SDKs for AI agents that control desktops
Open-source computer-use infrastructure for agents that need to drive desktop environments in the background. CUA includes Cua Driver, Sandbox, Run, Bench, and Verified Data across Linux, Windows, macOS, and Android, with MCP and CLI surfaces for screenshots, accessibility trees, keyboard/mouse actions, shell commands, task evaluation, and fleet execution.
Microsandbox
Local microVM sandboxes for AI agent code execution
Microsandbox provides hardware-level isolated sandboxes for AI agents to execute code safely on local machines. Using libkrun microVMs and a 320ms bare-metal Linux/KVM homepage benchmark, it offers stronger isolation than Docker containers while staying lightweight enough for dev workstations. OCI-compatible with Python and Node.js runtimes. Apache-2.0 licensed with 6.6K+ GitHub stars.
Steel
Open-source browser infrastructure for AI agents at scale
Steel is an open-source browser API purpose-built for AI agents, providing managed headless browser sessions with anti-bot bypass, proxy rotation, CAPTCHA solving, and session persistence. It handles the infrastructure layer that browser automation agents like Browser Use and Stagehand run on top of. Self-hostable or available as a cloud service. Over 6,000 GitHub stars.
Dagger Container Use
Containerized sandboxes for AI coding agents
Dagger Container Use provides isolated container environments for AI coding agents, enabling multiple agents to work in separate sandboxed branches simultaneously. Built by the Dagger team, it ensures reproducibility and safety for autonomous code execution by giving each agent its own containerized workspace with full toolchain access.
Plandex
Open-source AI coding agent for large projects and real-world tasks
Terminal-based coding engine that breaks complex tasks into subtasks with support for 2M token context windows, diff review sandboxes, and both autonomous and manual modes. Multi-model pipelines work with Anthropic, OpenAI, and Google. With 15k+ GitHub stars, Plandex is a mature open-source option for developers tackling large, multi-step development tasks from the command line.