# computer-use

5 tools tagged

Showing 5 of 5 tools

GenericAgent

Self-evolving local computer agent with a reusable skill tree

GenericAgent is a minimal, self-evolving autonomous agent in roughly 3K lines of Python that gives LLMs system-level control of a local computer. It writes files, runs shell commands, and browses the web, but its defining feature is skill crystallization: successful task runs are saved as reusable skills inside a growing skill tree that cuts token cost on repeats.

open-sourceOpen Source

Agent Browser

Browser automation CLI built for AI agents by Vercel Labs

Agent Browser is a Rust-based browser automation CLI designed specifically for AI agent workflows rather than traditional testing. Developed by Vercel Labs, it provides semantic element selection through a refs system, accessibility tree snapshots, session persistence, and authentication vaults. Unlike Playwright or Puppeteer which target test automation, Agent Browser optimizes for token efficiency and deterministic element selection that gives LLMs reliable browser interaction capabilities.

open-sourceOpen Source

UI-TARS Desktop

ByteDance's open-source multimodal desktop agent with vision-based GUI automation

UI-TARS Desktop is ByteDance's open-source multimodal AI agent that automates desktop and browser interactions using computer vision rather than DOM selectors or accessibility APIs. Powered by the UI-TARS vision model, it can understand and operate any graphical interface by looking at screenshots, making it capable of automating applications that traditional browser automation tools cannot reach, including native desktop apps and complex web UIs.

open-sourceOpen Source

Teleport Beams

Trusted runtime environments for AI agents in production infrastructure

Teleport Beams provides cryptographically verified, policy-gated access for AI agents to interact with production infrastructure including servers, Kubernetes clusters, and databases. Launched at KubeCon EU 2026, Beams extends Teleport's zero-trust access platform with agent-specific runtime controls, audit trails, and policy enforcement to ensure AI agents operate within defined boundaries when deployed in production environments.

open-sourceOpen Source

CUA (Computer-Use Agent)

Open-source sandboxes and SDKs for AI agents that control desktops

CUA is an open-source infrastructure platform for building, benchmarking, and deploying AI agents that autonomously control full desktop environments. It provides secure sandboxed VMs across macOS, Linux, Windows, and Android with a unified Python SDK for screenshots, mouse/keyboard control, shell commands, and file I/O. Includes CuaBot CLI for running agents in sandboxes, Cua-Bench for standardized evaluation, and Lume for near-native macOS virtualization on Apple Silicon.

freemium