aicoolies logo

# browser

12 tools tagged

Showing 12 of 12 tools

UI-TARS Desktop

ByteDance's open-source multimodal desktop agent with vision-based GUI automation

UI-TARS Desktop is ByteDance's open-source multimodal AI agent that automates desktop and browser interactions using computer vision rather than DOM selectors or accessibility APIs. Powered by the UI-TARS vision model, it can understand and operate any graphical interface by looking at screenshots, making it capable of automating applications that traditional browser automation tools cannot reach, including native desktop apps and complex web UIs.

open-sourceOpen Source
Stagewise logo

Stagewise

Browser-native frontend coding agent for production codebases

Stagewise is an open-source frontend coding agent with 6,500+ GitHub stars that runs directly in the browser on localhost. YC S25 backed, it lets developers and designers point-and-prompt on live web applications with full devtools and console access, bridging the gap between visual editing and production codebase modification.

open-sourceOpen Source

VibeVoice

Microsoft's open-source frontier voice AI for long-form multi-speaker audio

VibeVoice is Microsoft's open-source voice AI family with both TTS and speech recognition models. The TTS model generates up to 90 minutes of expressive multi-speaker audio with 4 distinct voices. VibeVoice-ASR transcribes 60-minute recordings in a single pass with speaker identification and timestamps. Built on continuous speech tokenizers at 7.5 Hz and next-token diffusion, it compresses audio 80x more efficiently than Encodec while preserving fidelity.

open-sourceOpen Source
Crawl4AI logo

Crawl4AI

High-performance open-source web crawler optimized for AI pipelines

Crawl4AI is an open-source Python web crawler built for AI and data-pipeline use cases. It produces LLM-ready Markdown, supports structured extraction, Playwright/browser automation, deep/adaptive crawling, proxy/security controls, anti-bot fallback patterns, and multiple output formats. With 68K+ GitHub stars and Apache-2.0 licensing, it is a strong local/self-hosted option for RAG datasets and agent data collection.

open-sourceOpen Source
Steel logo

Steel

Open-source browser infrastructure for AI agents at scale

Steel is an open-source browser API purpose-built for AI agents, providing managed headless browser sessions with anti-bot bypass, proxy rotation, CAPTCHA solving, and session persistence. It handles the infrastructure layer that browser automation agents like Browser Use and Stagehand run on top of. Self-hostable or available as a cloud service. Over 6,000 GitHub stars.

open-sourceOpen Source
Suna logo

Suna

Open-source generalist AI agent for browser and code tasks

Suna is an open-source generalist AI agent that can autonomously browse the web, write and execute code, manage files, and interact with external services. It features a real-time browser automation engine, an isolated code execution sandbox, and integrations with popular APIs. Designed as an open-source alternative to commercial AI agent platforms. Over 9,000 GitHub stars with rapid community growth.

open-sourceOpen Source

Browserbase MCP Server

Cloud browser automation via MCP for scalable testing

Browserbase MCP Server gives MCP clients a hosted or self-hostable browser through Browserbase and Stagehand. It exposes tools for starting sessions, navigating, acting, observing, extracting, and taking screenshots, with a hosted Streamable HTTP endpoint for easiest setup and local STDIO/Docker options for teams that want to run the Apache-licensed server themselves.

freemiumOpen Source

BrowserMCP

Automate local Chrome browser via MCP

BrowserMCP is an MCP server that enables AI agents to automate a local Chrome browser — navigating pages, clicking elements, filling forms, extracting content, and taking screenshots. It gives coding agents the ability to interact with web applications the way a human would, directly from Claude Desktop, Cursor, or any MCP client.

open-sourceOpen Source
Arc Browser logo

Arc Browser

A better way to use the internet

Chromium-based browser by The Browser Company that reimagines the browser UI with a sidebar for tabs, Spaces for context separation (work/personal), Boosts for custom CSS on any website, split views, easels for collecting web content, and built-in ad blocking. Features Arc Max AI for page summaries, tab renaming, and 5-second previews. Available on macOS, Windows, and iOS. Designed for power users who want to organize their browsing experience. Privacy-focused with no data collection.

free
Wappalyzer logo

Wappalyzer

Identify web technologies

Wappalyzer is a technology detection service that identifies the tech stack behind any website — frameworks, CMS, analytics, CDNs, payment processors, and 2,500+ other technologies. Available as a browser extension (Chrome, Firefox, Edge), API, CLI, and bulk lookup service. Useful for competitive analysis, sales prospecting, and lead enrichment. Founded 2009 by Elbert Alias, remains independent and bootstrapped — free extension plus paid API and enterprise tiers.

freemium
Polypane logo

Polypane

The browser for ambitious developers

Standalone browser built specifically for web developers and designers that shows multiple synchronized viewports side by side for responsive design testing. Features accessibility inspector (WCAG compliance checking), visual regression testing via screenshots, layout debugging overlays, meta tag validator, social media preview cards, and color contrast checker. Reduces the need for constant resizing and device switching during front-end development. Available on macOS, Windows, and Linux.

paid
Responsively logo

Responsively

Responsive web development tool

Open-source browser for responsive web development that shows multiple device viewports simultaneously with synchronized scrolling, clicking, and navigation. Preview your site on phone, tablet, and desktop sizes side by side in real-time. Features screenshot capture for all viewports, element inspector, network monitoring, and hot-reload support. Built on Chromium. Free alternative to Polypane for developers who need quick responsive testing without paid subscriptions. 23K+ GitHub stars.

open-sourceOpen Source