aicoolies logo
Tusk logo

Tusk

AI test generation from production traffic

Share
freemiumOpen Source
Visit Website →

Tusk is a Y Combinator W24-backed AI testing platform that converts real production traffic into unit and API tests, catching regressions in 43% of PRs. Its Drift SDK records live API traces with just 10 lines of code, then AI generates executable test cases covering thousands of edge cases from actual user behavior, auto-maintaining suites as application logic evolves without manual script writing.

We have a review for this tool

A detailed review by the aicoolies team — click to read

Tusk takes a fundamentally different approach to test generation by using real production traffic as the source of truth rather than relying on developers to imagine test scenarios. The Drift SDK records API calls in production, capturing request-response pairs across thousands of real user interactions. AI then filters and transforms these traces into high-quality unit and API test cases that cover edge cases developers would never think to test manually.

The platform runs generated tests in ephemeral sandboxes and self-iterates when tests fail, eliminating the back-and-forth typically required with AI coding assistants. Tests are automatically matched to relevant code changes in pull requests and run in CI, with 69% of Tusk-generated tests being incorporated into PRs. The system also ingests business context from Jira and Linear to generate tests aligned with team practices and requirements.

Tusk helped Promptfoo add over 2,000 tests and catch 56 bugs across their codebase while automatically maintaining the test suite to remove flaky tests. The platform offers a 14-day free trial with paid plans for teams. PII redaction rules and domain filtering ensure sensitive production data never leaks into test fixtures. Node.js and Python are supported with the open-source SDK and CLI.

Pricing

Free plan with 14-day Team trial; Team $50/month per active developer; Business $95/seat/month; Enterprise custom

Platforms

GitHub, Node.js, Python, CI/CD, Jira, Linear

Categories

Tags

Use Cases

Alternatives

Related Tools

Safari MCP Server

Apple's Safari-native MCP server for web debugging agents

Safari MCP Server is Apple's safaridriver-based MCP server in Safari Technology Preview, giving compatible coding agents local access to Safari page content, console logs, network requests, screenshots, JavaScript evaluation, interactions, viewport controls, and accessibility/performance checks.

freeTelemetry
rampart

Rampart

Microsoft’s pytest-native red teaming framework for turning AI agent safety findings into CI tests.

RAMPART is an open-source Microsoft framework for safety and security testing of agentic AI applications. It brings red-team findings into a pytest-native workflow so teams can turn prompt injection, unsafe tool use, and behavioral boundary failures into repeatable regression tests. The strongest aicoolies angle is developer workflow: RAMPART makes agent safety part of CI/CD instead of a one-off security review.

open-sourceOpen Source
Requestly logo

Requestly

One tool for intercepting, mocking, and replaying HTTP — acquired by BrowserStack

Requestly is a BrowserStack-backed API client, HTTP interceptor, mock server, and session replay tool for frontend and QA teams. Its current product is commercial/API-client led, while the legacy interceptor/open-source code is AGPLv3. The free plan covers individual workflows, and Pro lists at $12/user/month monthly or $9/user/month annually for collaborative QA and frontend debugging teams.

freemium
Browserbase logo

Browserbase

Headless browser cloud built for AI agents

Browserbase is cloud infrastructure that runs headless Chromium browsers on demand for AI agents and automation workflows, exposing Playwright, Puppeteer, and Selenium endpoints with built-in session replay, residential proxies, CAPTCHA solving, and stealth fingerprints. It also hosts Stagehand and a Model Gateway, letting teams build browser-using agents without maintaining their own fleet of Kubernetes-managed Chromium instances.

freemium
Anchor Browser logo

Anchor Browser

Cloud browser infrastructure for AI agents

Anchor Browser provides secure cloud-managed browser infrastructure for computer-use agents. Deploy humanized Chromium instances that access any website while maintaining bot-detection evasion and authentication support. Features OmniConnect for authentication lifecycle management, Web Action Cache for deterministic workflows, and built-in VPN infrastructure. Includes free tier and paid plans supporting millions of concurrent browser sessions for scalable agent automation.

freemium
RagaAI Catalyst logo

RagaAI Catalyst

AI testing and evaluation for agents and LLM apps

RagaAI Catalyst is a comprehensive Python SDK for observability, monitoring, and evaluation of LLM and agentic applications. Provides agent tracing with execution graph visualization, self-hosted dashboard with analytics, synthetic data generation, multi-metric evaluation framework, and guardrail management. Built for teams running production RAG systems and AI agents who need systematic testing, debugging, and performance optimization workflows.

open-sourceOpen Source

Used in Stacks

Comparisons