# computer-use
8 tools tagged
Showing 8 of 8 tools
Accomplish Coworker
Open-source desktop AI coworker for browsing and code execution.
Accomplish Coworker is an MIT-licensed open-source AI coworker that runs on the desktop, combining computer-use style browsing with code execution so agents can research, implement, run, and debug workflows in one local environment.
agent-desktop
Accessibility-tree desktop automation engine for deterministic native-app control
agent-desktop is a Rust-native desktop automation engine for AI agents that need structured control of native applications without relying only on screenshots or pixel loops. It exposes accessibility-tree snapshots, stable element references, progressive traversal, and action primitives that can let coding agents and automation stacks operate on Windows, macOS, Electron, and legacy interfaces with lower token cost and better repeatability than pure vision control.
Coasty
Open computer-use platform for browser, terminal, and full desktop automation
Coasty is an open-source computer-use platform for teams that want AI agents to operate across browser, terminal, and full desktop surfaces instead of only clicking DOM nodes. The project combines planner/orchestrator logic, visual and input control, local Electron workflows, remote sandbox options, an MCP server, and logs for debugging long-running automations. It is a strong fit for QA, research-to-action, form workflows, and repetitive desktop tasks where browser-only agents are too narrow.
GenericAgent
Self-evolving local computer agent with a reusable skill tree
GenericAgent is a minimal, self-evolving autonomous agent from a 3.3K-line seed and ~3K core loop that gives LLMs system-level control of a local computer. It writes files, runs shell commands, browses the web, and uses keyboard/mouse/screen/mobile tools, while skill crystallization saves successful runs into a reusable skill tree that cuts token cost on repeats.
Agent Browser
Browser automation CLI built for AI agents by Vercel Labs
Agent Browser is a Rust-based browser automation CLI designed specifically for AI agent workflows rather than traditional testing. Developed by Vercel Labs, it provides semantic element selection through a refs system, accessibility tree snapshots, session persistence, and authentication vaults. Unlike Playwright or Puppeteer which target test automation, Agent Browser optimizes for token efficiency and deterministic element selection that gives LLMs reliable browser interaction capabilities.
UI-TARS Desktop
ByteDance's open-source multimodal desktop agent with vision-based GUI automation
UI-TARS Desktop is ByteDance's open-source multimodal AI agent that automates desktop and browser interactions using computer vision rather than DOM selectors or accessibility APIs. Powered by the UI-TARS vision model, it can understand and operate any graphical interface by looking at screenshots, making it capable of automating applications that traditional browser automation tools cannot reach, including native desktop apps and complex web UIs.
Teleport Beams
Trusted runtime environments for AI agents in production infrastructure
Teleport Beams provides cryptographically verified, policy-gated access for AI agents to interact with production infrastructure including servers, Kubernetes clusters, and databases. Launched at KubeCon EU 2026, Beams extends Teleport's zero-trust access platform with agent-specific runtime controls, audit trails, and policy enforcement to ensure AI agents operate within defined boundaries when deployed in production environments.
CUA (Computer-Use Agent)
Open-source sandboxes and SDKs for AI agents that control desktops
Open-source computer-use infrastructure for agents that need to drive desktop environments in the background. CUA includes Cua Driver, Sandbox, Run, Bench, and Verified Data across Linux, Windows, macOS, and Android, with MCP and CLI surfaces for screenshots, accessibility trees, keyboard/mouse actions, shell commands, task evaluation, and fleet execution.