Browser Use

AI agent framework for web browser automation

open-sourceOpen Source

Browser Use is an open-source AI agent framework with 99K+ GitHub stars enabling LLMs to control web browsers via natural language. Y Combinator-backed, it lets agents navigate sites, fill forms, extract data, and complete multi-step tasks autonomously. Built on Playwright with vision-based element detection, multi-tab management, cookie persistence, and self-correcting actions. Supports OpenAI, Anthropic, and local models with a simple Python API for building custom browser agents.

We have a review for this tool

A detailed review by the aicoolies team — click to read

Browser Use is an open-source framework enabling AI agents to interact with web browsers using natural language. With 99K+ GitHub stars gained rapidly, it is one of the fastest-growing projects in AI browser automation, backed by Y Combinator.

Built on Playwright, agents can navigate pages, click elements, fill forms, extract data, handle authentication, and complete complex web workflows. Vision-based element detection lets agents identify UI elements without explicit selectors.

Multi-tab management, cookie persistence, and self-correcting actions make it production-capable. Supports OpenAI, Anthropic, Gemini, and local models with a minimal Python API.

Valuable for web scraping, automated testing, form filling, competitive research, and any task requiring human-like web interaction — especially for sites without APIs.

Pricing

MIT OSS library free; cloud starts $0 with 3 sessions/10 tasks; Dev $29/mo, Business $299/mo, Scaleup $999/mo

Platforms

Python, Playwright, any OS

Use Cases

Agentic Development DevOps Automation

Alternatives

Firecrawl MCP Server

Web scraping and crawling via MCP for AI agents

Firecrawl MCP Server is the official MCP integration for Firecrawl, giving Cursor, Claude, Windsurf, and other MCP clients scrape, crawl, map, search, extract, and agent-style web research tools. It now supports a hosted remote endpoint, keyless rate-limited scrape/search/interact use, API-key/OAuth access for the full tool set, and self-hosted Firecrawl deployments.

freemiumOpen Source

BrowserMCP

Automate local Chrome browser via MCP

BrowserMCP is an MCP server that enables AI agents to automate a local Chrome browser — navigating pages, clicking elements, filling forms, extracting content, and taking screenshots. It gives coding agents the ability to interact with web applications the way a human would, directly from Claude Desktop, Cursor, or any MCP client.

open-sourceOpen Source

Browserbase MCP Server

Cloud browser automation via MCP for scalable testing

Browserbase MCP Server gives MCP clients a hosted or self-hostable browser through Browserbase and Stagehand. It exposes tools for starting sessions, navigating, acting, observing, extracting, and taking screenshots, with a hosted Streamable HTTP endpoint for easiest setup and local STDIO/Docker options for teams that want to run the Apache-licensed server themselves.

freemiumOpen Source

ScrapeGraphAI

LLM-powered web scraping with graph-based extraction pipelines

ScrapeGraphAI is a Python library that uses LLMs and graph-based logic to build automated, self-healing web scraping pipelines. Developers describe desired data in natural language and ScrapeGraphAI constructs a processing graph that extracts structured information from any website. It supports multiple LLM providers, achieves 96%+ accuracy on semantic extraction benchmarks, and adapts to layout changes automatically. Over 20,000 GitHub stars.

open-sourceOpen Source

Related Tools

Hermes Agent

Top Pick

Open-source AI agent framework with persistent memory, reusable skills, tools, and messaging gateways

Hermes Agent is an open-source AI agent framework with persistent memory, reusable skills, 40+ tools, cron jobs, and messaging gateways.

open-sourceOpen Source

Accomplish Coworker

Open-source desktop AI coworker for browsing and code execution.

Accomplish Coworker is an MIT-licensed open-source AI coworker that runs on the desktop, combining computer-use style browsing with code execution so agents can research, implement, run, and debug workflows in one local environment.

open-sourceOpen SourceTelemetry

Headroom

Context compression for LLM apps and coding agents

Headroom is an Apache-2.0 context compression layer for LLM apps and coding agents. It compresses tool output, logs, files, RAG chunks, and agent history through a local library, proxy, wrapper, or MCP server, with retrieval hooks for bringing originals back when needed. Treat its savings numbers as Headroom-reported benchmarks, not independent aicoolies measurements.

open-sourceOpen SourceTelemetry

Codebase Memory MCP

Codebase knowledge graph MCP server for AI coding agents

Codebase Memory MCP is an MIT-licensed MCP server that turns a repository into a persistent code knowledge graph for AI coding agents. It gives Claude Code, Cursor, Codex-style agents, and other MCP clients structural queries for functions, classes, call chains, routes, and architecture, helping them explore large projects without repeatedly rereading files or relying only on broad search.

open-sourceOpen SourceTelemetry

BeeAI Framework

Python and TypeScript framework for production multi-agent systems

BeeAI Framework is an Apache-2.0 toolkit for building production-ready AI agents and multi-agent systems in Python and TypeScript. Its docs cover agents, tools, RAG, memory, workflows, backend providers, serving, and A2A/MCP integration surfaces, making it a vendor-neutral option for teams comparing LangGraph, CrewAI, Mastra, and related agent runtimes.

open-sourceOpen SourceTelemetry

Klavis AI

MCP integration platform for agent tool use at scale

Klavis AI is an Apache-2.0 MCP integration platform for teams connecting AI agents to external SaaS tools and APIs. The public repo and official docs position it as infrastructure for reliable tool access at scale, so it fits teams that want reusable MCP connectors without treating every integration as a one-off script or custom OAuth maintenance project.

open-sourceOpen SourceTelemetry

Used in Stacks

Browser Automation for AI Agents Stack

Give your AI agents full web interaction: Firecrawl for web data extraction, Browser Use for autonomous browsing, Stagehand for structured browser automation, and Hyperbrowser for cloud browser infrastructure.

varies

Comparisons

Browser Use vs UI-TARS Desktop: Browser Agent Framework or Vision-Based Desktop Automation?

Browser Use and UI-TARS Desktop both help AI agents operate graphical interfaces, but they start from different surfaces. Browser Use focuses on web browser automation with an LLM-friendly Python and Playwright stack. UI-TARS Desktop uses multimodal vision to control desktop and browser interfaces like a human operator. Choose Browser Use for most web automation and agent workflows; choose UI-TARS Desktop when the task must cross native desktop apps or visual-only interfaces.

Browser UseUI-TARS Desktop

Midscene.js vs Browser Use — Vision AI Automation SDK vs Python Browser Agent Framework

Midscene.js provides a JavaScript SDK for vision-driven UI automation across web, Android, and iOS platforms. Browser Use offers a Python framework for building browser-controlling AI agents with autonomous navigation and task completion. Browser Use wins for autonomous agent workflows while Midscene.js wins for structured cross-platform test automation.

Midscene.jsBrowser Use

Skyvern vs Browser Use — AI Vision Automation vs LLM-Powered Browser Agent

Skyvern and Browser Use both automate web browsers with AI, but use fundamentally different techniques. Skyvern combines LLMs with computer vision to understand pages visually — no DOM parsing needed. Browser Use leverages LLMs to reason about page structure and generate browser actions. Both eliminate brittle CSS selectors, but the approaches have different strengths for different automation scenarios.

SkyvernBrowser Use

BrowserMCP vs Browser Use — MCP Browser Automation vs AI Browser Agent Compared

BrowserMCP and Browser Use both enable AI agents to interact with web browsers, but through different architectures. BrowserMCP is an MCP server that gives any MCP-compatible AI assistant (Claude Desktop, Cursor) direct Chrome control with 6.1K+ stars. Browser Use is a standalone Python library with 65K+ stars that provides browser automation capabilities for custom AI agent applications built with LangChain or other frameworks.

BrowserMCPBrowser Use

Browser-Use vs Stagehand — AI Browser Automation Comparison

Two open-source frameworks for AI-powered browser automation. Browser-Use enables LLMs to control browsers with natural language through Playwright, growing to 85K+ stars. Stagehand by Browserbase adds AI vision to Playwright with act/extract/observe primitives for more reliable web interactions.

Browser UseStagehand

Browser Use

Pricing

Platforms

Categories

Tags

Use Cases

Alternatives

Firecrawl MCP Server

BrowserMCP

Browserbase MCP Server

ScrapeGraphAI

Related Tools

Hermes Agent

Accomplish Coworker

Headroom

Codebase Memory MCP

BeeAI Framework

Klavis AI

Used in Stacks

Comparisons

Browser Use vs UI-TARS Desktop: Browser Agent Framework or Vision-Based Desktop Automation?

Midscene.js vs Browser Use — Vision AI Automation SDK vs Python Browser Agent Framework

Skyvern vs Browser Use — AI Vision Automation vs LLM-Powered Browser Agent

BrowserMCP vs Browser Use — MCP Browser Automation vs AI Browser Agent Compared

Browser-Use vs Stagehand — AI Browser Automation Comparison