aicoolies logo
Trunk logo

Trunk

AI-powered CI reliability and flaky test management

Share
freemium
Visit Website →

Trunk is a developer tools platform that tackles CI reliability through AI-powered flaky test detection, automatic quarantine, and merge queue management. It uses ML-based statistical analysis to identify flaky tests, isolates them to prevent pipeline blocks, and creates GitHub issues for resolution. Used by Zillow, Brex, and Faire, with $28.5M in funding and support for all major test frameworks.

Trunk addresses one of the most persistent problems in CI/CD: flaky tests that waste engineering time and block deployments. The platform uses machine learning to analyze test results across multiple CI runs, identifying tests whose pass/fail status is non-deterministic. When a flaky test is detected, Trunk automatically quarantines it — preventing it from blocking the deployment pipeline while creating a tracked issue for the responsible engineer to investigate and fix the underlying instability.

Beyond flaky test management, Trunk provides a merge queue that ensures the main branch stays green by testing PRs in sequence before merging, and a code quality suite that includes linting, formatting, and static analysis with support for 100+ tools. The platform integrates with GitHub, GitLab, and all major CI systems including GitHub Actions, CircleCI, Jenkins, and Buildkite. Its language-agnostic approach works with any test framework — Jest, pytest, Playwright, Cypress, JUnit, RSpec, and more.

Trunk has raised $28.5M in funding and serves engineering teams at Zillow, Brex, Faire, Waabi, and Metabase. The free tier supports up to 5 committers with 5 million test spans per month, while the Team plan at $18 per committer per month adds advanced analytics and priority support. For organizations where CI reliability directly impacts deployment velocity, Trunk provides the observability and automation needed to keep pipelines moving without sacrificing test coverage quality.

Pricing

Free up to 5 committers; Team $18/committer/mo

Platforms

Web dashboard + CLI — integrates with all major CI/CD

Categories

Tags

Use Cases

Alternatives

Related Tools

Safari MCP Server

Apple's Safari-native MCP server for web debugging agents

Safari MCP Server is Apple's safaridriver-based MCP server in Safari Technology Preview, giving compatible coding agents local access to Safari page content, console logs, network requests, screenshots, JavaScript evaluation, interactions, viewport controls, and accessibility/performance checks.

freeTelemetry

KubeAI

Kubernetes operator for serving AI inference workloads

KubeAI is an Apache-2.0 Kubernetes operator for deploying and scaling AI inference workloads, including LLMs, embeddings, reranking, and speech-to-text. It gives platform teams OpenAI-compatible endpoints, model proxy/controller primitives, model caching, scale-from-zero behavior, and cluster-native resource management for self-hosted inference on Kubernetes.

open-sourceOpen Source
rampart

Rampart

Microsoft’s pytest-native red teaming framework for turning AI agent safety findings into CI tests.

RAMPART is an open-source Microsoft framework for safety and security testing of agentic AI applications. It brings red-team findings into a pytest-native workflow so teams can turn prompt injection, unsafe tool use, and behavioral boundary failures into repeatable regression tests. The strongest aicoolies angle is developer workflow: RAMPART makes agent safety part of CI/CD instead of a one-off security review.

open-sourceOpen Source
Requestly logo

Requestly

One tool for intercepting, mocking, and replaying HTTP — acquired by BrowserStack

Requestly is a BrowserStack-backed API client, HTTP interceptor, mock server, and session replay tool for frontend and QA teams. Its current product is commercial/API-client led, while the legacy interceptor/open-source code is AGPLv3. The free plan covers individual workflows, and Pro lists at $12/user/month monthly or $9/user/month annually for collaborative QA and frontend debugging teams.

freemium
Freestyle logo

Freestyle

Sandboxes for coding agents — Linux VMs, Git, and deploys in one box

Freestyle is YC-backed sandbox infrastructure built for AI coding agents, shipping secure Linux VMs with nested virtualization, Git servers, and one-click web deploys. It lets agents run real workloads, branch repos, and deploy apps under short-lived identities while billing only for active compute. Used in production by vly.ai, Rork, and Vibeflow.

freemium
OpenSRE logo

OpenSRE

Open-source toolkit for building AI SRE incident response agents

OpenSRE is Tracer Cloud’s open-source public-alpha Python toolkit for building AI SRE agents that investigate and respond to production incidents. It ships 60+ tools across observability, databases, incident management, communications, deployment and protocol integrations, plus simulation/evaluation workflows for benchmarking agent accuracy before live pager use.

open-sourceOpen Source