aicoolies logo
Coasty logo

Coasty

Open computer-use platform for browser, terminal, and full desktop automation

Share
open-sourceOpen Source
Visit Website →

Coasty is an open-source computer-use platform for teams that want AI agents to operate across browser, terminal, and full desktop surfaces instead of only clicking DOM nodes. The project combines planner/orchestrator logic, visual and input control, local Electron workflows, remote sandbox options, an MCP server, and logs for debugging long-running automations. It is a strong fit for QA, research-to-action, form workflows, and repetitive desktop tasks where browser-only agents are too narrow.

Coasty is positioned as a broader computer-use layer rather than another browser-only agent. The project targets workflows where an agent may need to use a browser, a terminal and a desktop application in the same task. That makes it relevant for QA teams, operators and developers who are testing whether computer-use agents can handle real work instead of short demos. The key editorial distinction is reach: Coasty is about orchestrating a full computer environment, not simply automating a web page.

The reported workflow combines planner and orchestrator logic, desktop input control, visual context, remote sandbox options, local Electron-style use and an MCP server for connecting the automation layer to agent clients. The main value is the ability to move between surfaces while keeping logs and a deployment story that can support repeatable workflows. In practice, teams should benchmark it against Browser Use, Skyvern and OpenClaw on tasks that include more than one app, because that is where the differentiation becomes clear.

The core project is open source under Apache 2.0, while hosted or sandbox access should be verified before making procurement claims. Teams should evaluate reliability, audit logs, permissions and the operational boundary around remote environments before using it for sensitive production workflows. Coasty is strongest as a candidate for experimentation, QA automation and internal workflows that need browser-plus-desktop reach, especially when a team wants MCP compatibility and a path toward reproducible agent operations.

Pricing

Open-source core under Apache 2.0 with GitHub-hosted code; hosted or sandbox access should be verified from Coasty before making procurement claims.

Platforms

GitHub project and local/remote computer-use workflow spanning browser, terminal, and desktop automation; MCP server support is part of the positioning.

Categories

Tags

Use Cases

Alternatives

Browser Use logo

Browser Use

AI agent framework for web browser automation

Browser Use is an open-source AI agent framework with 85K+ GitHub stars enabling LLMs to control web browsers via natural language. Y Combinator-backed, it lets agents navigate sites, fill forms, extract data, and complete multi-step tasks autonomously. Built on Playwright with vision-based element detection, multi-tab management, cookie persistence, and self-correcting actions. Supports OpenAI, Anthropic, and local models with a simple Python API for building custom browser agents.

open-sourceOpen Source

Skyvern

Browser automation with AI vision — no XPath or DOM parsing needed

Skyvern automates browser-based workflows using LLMs and computer vision instead of brittle XPath or CSS selectors. It understands web pages visually, navigating forms, clicking buttons, and extracting data like a human would. Achieved 85.85% success rate on WebVoyager benchmark and SOTA on WRITE tasks for RPA. 21,000+ GitHub stars, AGPL-3.0 licensed. Skyvern Cloud offers managed usage-based hosting for teams that prefer not to self-host the infrastructure.

open-sourceOpen Source
Grok logo

Grok Build

Top Pick

xAI's terminal coding agent with parallel subagents and worktree-aware automation

Grok Build is xAI's terminal-first coding agent for planning, editing, testing, and reviewing code from a local CLI. The early beta exposes subagent controls, worktree mode, headless JSON output, best-of-N parallel attempts, sandbox profiles, and experimental memory. It fits developers comparing Claude Code, Codex, and Gemini CLI for local agentic workflows with deeper parallel execution.

paid

Related Tools

Claude Code logo

Claude Code

Top Pick

Anthropic's agentic coding CLI

Anthropic's agentic CLI coding tool that delegates complex tasks to Claude directly from the terminal. Understands entire codebases via automatic context gathering, edits multiple files, runs shell commands, and manages Git workflows autonomously. Supports CLAUDE.md for persistent project instructions, integrates with VS Code and JetBrains, and uses Claude Opus/Sonnet with extended thinking for complex architectural decisions. Built for terminal-first developers.

paidOpen Source
OpenCode logo

OpenCode

Top Pick

Open-source AI coding agent for the terminal

Open-source terminal-based AI coding agent built in Go by the SST team, with a rich TUI (Bubble Tea) supporting 75+ model providers including OpenAI, Anthropic, Gemini, Bedrock, Groq, and OpenRouter. Features vim-like editing, persistent SQLite sessions, and LSP integration for 40+ languages. Fully free with no vendor lock-in, it has rapidly grown to 95k+ GitHub stars.

open-source
Codex logo

Codex

Top Pick

OpenAI's agentic coding CLI and cloud sandbox

OpenAI's cloud-based AI coding agent powered by codex-1 (a version of o3 optimized for software engineering). Autonomously writes features, fixes bugs, and proposes pull requests, with each task running in its own sandboxed environment preloaded with your repository. Teams can deploy multiple agents in parallel to work on independent tasks, with MCP integration and AGENTS.md for repo-specific instructions.

freemiumOpen Source
Figma Context MCP logo

Figma Context MCP

MCP server that gives coding agents structured Figma context for design-to-code work

Figma Context MCP is an MCP server for giving coding agents structured access to Figma design context during implementation. Instead of copying screenshots or hand-written design specs into prompts, teams can expose layout, component, and context information to agents such as Cursor, Claude Code, and other MCP-compatible coding workflows. It is a strong design-to-code bridge for teams trying to reduce hallucinated UI details and tighten handoff between designers and AI-assisted developers.

open-sourceOpen SourceTelemetry

Grok CLI

Community Grok terminal agent for xAI-powered coding and command-line workflows

Grok CLI is a community command-line interface for using xAI/Grok models from a terminal workflow. It fits developers who want a lightweight, scriptable Grok surface for coding help, command-line experiments, and local agent-style interactions without waiting for a heavier IDE integration. For aicoolies, it belongs in the fast-growing AI CLI agents lane beside Grok Build, Claude Code, Codex, Gemini CLI, and Qwen Code.

open-sourceOpen SourceTelemetry
Webwright logo

Webwright

Microsoft browser agent that turns long-horizon web tasks into reusable Playwright code

Webwright is a Microsoft browser-agent project that asks coding models to write, debug, and reuse Playwright scripts instead of relying on one-off stochastic click loops. The approach gives automation teams a more inspectable artifact: scripts can be logged, reviewed, rerun, and maintained like normal test or scraping code. It is especially relevant for long-horizon browser tasks where teams care about determinism, auditability, and resilience to UI changes.

open-sourceOpen Source