aicoolies logo

Best tools for Testing & QA Automation

Writing tests, visual regression, and automated QA workflows

Showing 24 of 103 tools

Safari MCP Server

Apple's Safari-native MCP server for web debugging agents

Safari MCP Server is Apple's safaridriver-based MCP server in Safari Technology Preview, giving compatible coding agents local access to Safari page content, console logs, network requests, screenshots, JavaScript evaluation, interactions, viewport controls, and accessibility/performance checks.

freeTelemetry

Latitude

Sentry-style observability for AI agent conversations

Latitude is an agent observability platform for teams that need to inspect LLM traces, conversations, issues, and evaluation feedback in one workflow. Its public repo and docs position it as a Sentry-style monitor for AI agents, with semantic search, issue detection, annotations, MCP-assisted fixes, and cloud or self-hosted deployment paths for production debugging.

freemiumOpen SourceTelemetry
Windows-MCP logo

Windows-MCP

MCP server for controlling Windows desktops through UIAutomation

Windows-MCP is an open-source MCP server for giving AI agents structured access to Windows desktop automation. It focuses on UIAutomation, snapshots, input control, and Windows-specific app workflows, making it different from general filesystem or shell MCP servers.

open-sourceOpen Source
Webwright logo

Webwright

Microsoft browser agent that turns long-horizon web tasks into reusable Playwright code

Webwright is a Microsoft browser-agent project that asks coding models to write, debug, and reuse Playwright scripts instead of relying on one-off stochastic click loops. The approach gives automation teams a more inspectable artifact: scripts can be logged, reviewed, rerun, and maintained like normal test or scraping code. It is especially relevant for long-horizon browser tasks where teams care about determinism, auditability, and resilience to UI changes.

open-sourceOpen Source

agent-desktop

Accessibility-tree desktop automation engine for deterministic native-app control

agent-desktop is a Rust-native desktop automation engine for AI agents that need structured control of native applications without relying only on screenshots or pixel loops. It exposes accessibility-tree snapshots, stable element references, progressive traversal, and action primitives that can let coding agents and automation stacks operate on Windows, macOS, Electron, and legacy interfaces with lower token cost and better repeatability than pure vision control.

open-sourceOpen Source
Coasty logo

Coasty

Open computer-use platform for browser, terminal, and full desktop automation

Coasty is an open-source computer-use platform for teams that want AI agents to operate across browser, terminal, and full desktop surfaces instead of only clicking DOM nodes. The project combines planner/orchestrator logic, visual and input control, local Electron workflows, remote sandbox options, an MCP server, and logs for debugging long-running automations. It is a strong fit for QA, research-to-action, form workflows, and repetitive desktop tasks where browser-only agents are too narrow.

open-sourceOpen Source
Intuned Agent logo

Intuned Agent

Production-grade browser automation with AI self-healing and Playwright code ownership

Intuned is a code-first browser automation platform that turns natural language prompts into production-ready Playwright code, deploys it, and self-heals it when target sites change. Supports TypeScript and Python with Anthropic Computer Use, OpenAI CUA, Stagehand, Browser-Use, and Gemini Computer Use integrations. Built-in stealth, captcha solving, auth session management, and scheduled runs with concurrency control. No vendor lock-in—you own the code.

freemiumTelemetry
Requestly logo

Requestly

One tool for intercepting, mocking, and replaying HTTP — acquired by BrowserStack

Requestly is a BrowserStack-backed API client, HTTP interceptor, mock server, and session replay tool for frontend and QA teams. Its current product is commercial/API-client led, while the legacy interceptor/open-source code is AGPLv3. The free plan covers individual workflows, and Pro lists at $12/user/month monthly or $9/user/month annually for collaborative QA and frontend debugging teams.

freemium
Evolver logo

Evolver

Self-evolution engine for AI agents with auditable updates

Evolver is an open-source self-evolution engine for AI agents that turns run logs into auditable, reviewable updates via its Genome Evolution Protocol. Instead of ad hoc prompt tweaking, teams collect traces and Evolver proposes versioned diffs to prompts, tools and workflows that engineers can approve, reject or roll back like code.

open-sourceOpen Source
chrome-devtools-mcp logo

chrome-devtools-mcp

Official Chrome DevTools MCP server for coding agents

chrome-devtools-mcp is the Chrome DevTools team's official MCP server that lets coding agents control and inspect a live Chrome browser with first-party Chrome DevTools Protocol fidelity. It exposes Network inspection, Performance traces, Lighthouse audits, console output, and structured DOM snapshots as typed MCP tools, so agents can debug real pages and ship reliable web performance investigations without resorting to brittle DOM scraping.

open-sourceOpen Source
Browserbase logo

Browserbase

Headless browser cloud built for AI agents

Browserbase is cloud infrastructure that runs headless Chromium browsers on demand for AI agents and automation workflows, exposing Playwright, Puppeteer, and Selenium endpoints with built-in session replay, residential proxies, CAPTCHA solving, and stealth fingerprints. It also hosts Stagehand and a Model Gateway, letting teams build browser-using agents without maintaining their own fleet of Kubernetes-managed Chromium instances.

freemium
RagaAI Catalyst logo

RagaAI Catalyst

AI testing and evaluation for agents and LLM apps

RagaAI Catalyst is a comprehensive Python SDK for observability, monitoring, and evaluation of LLM and agentic applications. Provides agent tracing with execution graph visualization, self-hosted dashboard with analytics, synthetic data generation, multi-metric evaluation framework, and guardrail management. Built for teams running production RAG systems and AI agents who need systematic testing, debugging, and performance optimization workflows.

open-sourceOpen Source
Laminar logo

Laminar

Open-source observability for AI agents

Laminar is an open-source observability platform for AI agents providing tracing, evaluation, and analytics for LLM applications. It integrates with Vercel AI SDK, LangChain, OpenAI, and Anthropic with a single line of code. Features include OpenTelemetry-native SDKs, an extensible evaluation framework with CI/CD support, SQL access to traces and metrics, and a visual debugging timeline for agent reasoning and actions.

freemiumOpen Source
Great Expectations logo

Great Expectations

Data quality validation framework for Python

Great Expectations is an open-source Python framework for validating, documenting, and profiling data quality. Teams define expectations as expressive unit tests for their data using an intuitive API, then validate datasets against those rules in CI/CD pipelines or production workflows. It connects to pandas, Spark, and SQL sources, generates data documentation automatically, and integrates with orchestrators like Airflow and Prefect for continuous data quality monitoring.

freemiumOpen Source
Lychee logo

Lychee

Fast async link checker written in Rust

Lychee is a fast, asynchronous link checker written in Rust that finds broken URLs and email addresses in Markdown, HTML, reStructuredText, and websites. Available as a CLI tool, Rust library, and GitHub Action, it validates links with configurable concurrency, rate limiting, and retry logic. Supports GitHub token authentication for API rate limit avoidance and can check both internal file links and external HTTP endpoints across entire repositories or websites.

open-sourceOpen Source
Lost Pixel logo

Lost Pixel

Open-source visual regression testing tool

Lost Pixel is an open-source visual regression testing tool that serves as an alternative to Percy and Chromatic. It captures and compares screenshots of UI components and application pages across Storybook, Ladle, Histoire, and custom screenshot sources like Cypress or Playwright. Integrated directly into GitHub Actions pipelines, it detects unintended visual changes before they reach production, with a free SaaS tier available for open-source projects.

freemiumOpen Source

Agentic Security

LLM vulnerability scanner and red teaming kit

Agentic Security is an open-source vulnerability scanner for LLM agent workflows that tests AI systems against jailbreaks, fuzzing, and multimodal attacks. It probes weaknesses across text, image, and audio inputs through multi-step jailbreak simulations, randomized stress testing, and reinforcement learning-powered adaptive attacks. The toolkit connects directly to LLM APIs for high-volume real-world attack scenarios, helping developers identify and patch safety gaps before deployment.

open-sourceOpen Source

Playwright MCP

Microsoft's MCP server for structured browser automation by AI agents

Playwright MCP is Microsoft's Model Context Protocol server that enables AI agents to automate web browsers through structured tool calls. It exposes Playwright's browser automation capabilities as MCP tools for navigation, clicks, forms, extraction, and screenshots. The Microsoft-maintained repo has 30K+ GitHub stars and is a durable default for structured browser interaction in agent workflows.

open-sourceOpen Source

qodo-cover

AI-powered test generation agent for automated code coverage improvement

qodo-cover (formerly Cover Agent) is an open-source AI agent that automatically generates meaningful unit tests to improve code coverage. It analyzes existing code and test patterns to produce tests that follow project conventions and target uncovered branches. Uses an iterative approach where generated tests are verified by running them, discarding those that fail. MIT licensed with over 5,300 GitHub stars.

open-sourceOpen Source

Krkn

CNCF Sandbox chaos engineering framework for Kubernetes resilience

Krkn is a CNCF Sandbox chaos engineering tool that tests Kubernetes cluster resilience by injecting controlled failures. It simulates pod kills, node failures, network partitions, CPU/memory pressure, and zone outages. Krkn-AI adds AI-powered scenario generation that suggests chaos experiments based on cluster topology. Supports CI/CD integration for automated resilience testing in deployment pipelines.

open-sourceOpen Source
LambdaTest KaneAI logo

LambdaTest KaneAI

GenAI-powered test agent with natural language test authoring

KaneAI is LambdaTest's GenAI-powered test automation agent that creates, evolves, and debugs tests from natural language descriptions. It generates test scripts in multiple frameworks including Selenium, Playwright, and Cypress from plain English instructions. Features intelligent test maintenance that automatically updates tests when application UI changes and two-way editing between natural language and code.

paid
Ghost Inspector logo

Ghost Inspector

Codeless browser testing with visual test recorder and scheduling

Ghost Inspector provides codeless browser testing through a visual recorder that captures user interactions and converts them into automated test suites. Tests run on managed infrastructure with scheduled execution, CI/CD integration, and Slack notifications. Features visual comparison for UI regression detection, API testing, and test organization with folders and tags for managing large test suites.

freemium
testRigor logo

testRigor

AI-powered E2E testing with plain English test authoring

testRigor enables end-to-end test creation in plain English without coding or element selectors. Tests describe user actions in natural language like 'click on the Submit button' and testRigor's AI interprets and executes them across web, mobile, and API. Self-healing tests automatically adapt to UI changes. Supports cross-browser testing, visual validation, and integration with CI/CD pipelines.

freemium
Schemathesis logo

Schemathesis

Property-based API fuzz testing from OpenAPI and GraphQL schemas

Schemathesis automatically generates test cases from OpenAPI and GraphQL schemas to find crashes, validation errors, and specification violations in APIs. It uses property-based testing and fuzzing techniques to explore edge cases that manual test writing misses. CLI tool and Python library with CI/CD integration. 3.4K+ GitHub stars with support for authentication, custom checks, stateful testing, JUnit XML, and Allure reports.

open-sourceOpen Source