66 tools tagged
Showing 24 of 66 tools
Autonomous coding agents that ship while you sleep
Twill is an autonomous coding agent platform that implements features, fixes bugs, and ships pull requests without manual intervention. Uses structured workflow of research, planning, human review, implementation in isolated sandbox, AI code review, then merge. Supports custom agent configurations with multiple LLM providers, isolated dev environments for verification, and integrations with GitHub, Linear, Sentry, Notion, and cloud platforms for end-to-end engineering automation.
Task runner for Python with Poetry and uv
Poethepoet (poe) is a batteries-included task runner for Python projects that integrates with Poetry and uv package managers. Define tasks in pyproject.toml, compose them in sequential, parallel, or DAG workflows, and execute with full virtual environment context. Supports shell commands, Python scripts, environment variables, .env file loading, and auto-generated shell completion across bash, zsh, and fish for streamlined development workflows.
Container-based CI/CD automation system
Concourse is an open-source CI/CD system built on composable primitives: resources for external artifacts, tasks for containerized work units, and jobs for orchestration. All pipelines are declarative YAML with version control, every task runs in an isolated container, and stateless workers enable horizontal scaling. Deployable via BOSH, Helm, Docker Compose, or standalone binary across any infrastructure.
Open-source feature flag management platform
Unleash is the largest open-source feature flag platform, enabling teams to decouple deployment from release with gradual rollouts, A/B testing, and trunk-based development. It provides 15+ official SDKs for server and client frameworks, a web-based admin dashboard for managing feature toggles, and supports activation strategies like percentage rollout, user targeting, and environment-based rules. Self-hostable via Docker with PostgreSQL storage.
Fast async link checker written in Rust
Lychee is a fast, asynchronous link checker written in Rust that finds broken URLs and email addresses in Markdown, HTML, reStructuredText, and websites. Available as a CLI tool, Rust library, and GitHub Action, it validates links with configurable concurrency, rate limiting, and retry logic. Supports GitHub token authentication for API rate limit avoidance and can check both internal file links and external HTTP endpoints across entire repositories or websites.
Open-source visual regression testing tool
Lost Pixel is an open-source visual regression testing tool that serves as an alternative to Percy and Chromatic. It captures and compares screenshots of UI components and application pages across Storybook, Ladle, Histoire, and custom screenshot sources like Cypress or Playwright. Integrated directly into GitHub Actions pipelines, it detects unintended visual changes before they reach production, with a free SaaS tier available for open-source projects.
Code management platform for AI coding agents
Entire is an AI-native code management platform designed for teams managing fleets of AI coding agents. Founded by former GitHub CEO Thomas Dohmke, it records the reasoning and instructions behind AI-generated code alongside the code itself, making agent-written software easier to review, audit, and maintain. Launches with Checkpoints, an open-source CLI tool that logs prompts and token usage for Claude Code and Google Gemini CLI, with more agent integrations planned.
Database CI/CD and DevSecOps platform at scale
Bytebase is an open-source database DevSecOps platform that automates schema migrations, enforces SQL standards across 200+ lint rules, and provides fine-grained access control with dynamic data masking. Teams use it for GitOps-based database change management, SQL review, and compliance across PostgreSQL, MySQL, MongoDB, Snowflake, Oracle, SQL Server, and 20+ other databases. Available as self-hosted Docker or Kubernetes deployment, or as a managed cloud service.
GitHub's Kubernetes controller for autoscaling GitHub Actions runners
actions-runner-controller (ARC) is GitHub's official Kubernetes controller for managing self-hosted GitHub Actions runners. It automatically scales runner pods up and down based on workflow demand, provisioning runners when jobs queue and terminating them when complete. Supports runner groups, custom runner images, and organization-level runner management. Over 6,100 GitHub stars.
Ephemeral preview environments for every pull request
Bunnyshell automates the creation of full-stack ephemeral environments that spin up for every pull request and tear down when merged. Each environment is a complete replica of the application stack including databases, APIs, and frontend services. Integrates with GitHub, GitLab, and Bitbucket to provide shareable preview URLs for testing, QA, and stakeholder review.
Managed Docker build acceleration with up to 40x faster builds
Depot provides managed infrastructure for dramatically faster Docker image builds. It uses persistent build caches, native Intel and ARM builders, and optimized build scheduling to achieve up to 40x faster builds compared to standard Docker build workflows. Drop-in replacement for docker build that requires no Dockerfile changes. Used by major engineering teams to cut CI/CD pipeline times.
Hybrid CI/CD platform with self-hosted agents and cloud orchestration
Buildkite is a hybrid CI/CD platform that separates orchestration from execution. A cloud-hosted control plane manages pipeline coordination and UI while open-source agents run builds on your own infrastructure. Used by Shopify, Airbnb, Uber, and Tinder for internet-scale deployments. Supports 100,000+ parallel jobs with P95 billing that ignores usage spikes.
Self-hosted UI and API for Ansible, Terraform, and scripts
Semaphore UI provides a web interface and REST API for running Ansible playbooks, Terraform and OpenTofu configurations, Bash scripts, and PowerShell commands from a centralized self-hosted platform. With over 13,000 GitHub stars and 2 million Docker pulls, it replaces AWX and manual terminal execution with a polished dashboard for scheduling, access control, notifications, and execution history across mixed infrastructure automation environments.
IaC orchestration layer for scaling Terraform and OpenTofu
Terragrunt is an infrastructure-as-code orchestration tool that wraps Terraform and OpenTofu to keep configurations DRY, manage remote state, and coordinate multi-module deployments. The 1.0 release introduced stacks, filters, run reports, and backward compatibility guarantees after 900+ releases and tens of millions of infrastructure deployments. It provides a thin orchestration layer that eliminates duplication across environments without replacing the underlying IaC tools.
Run GitHub Actions 2x faster at half the cost on bare-metal gaming CPUs
Blacksmith is a drop-in replacement for GitHub-hosted runners that executes Actions on bare-metal gaming CPUs with higher single-core performance. Migration requires one line change in YAML. Features colocated warm caches, persistent Docker layer caching on NVMe, CI observability with log search, and Firecracker microVM isolation. SOC 2 Type II certified, pay-as-you-go at ~$0.004/min versus GitHub's $0.008/min.
Free open-source local AWS emulator as a drop-in LocalStack replacement
Floci is a free open-source AWS emulator designed as a lightweight drop-in replacement for LocalStack Community Edition. It runs on port 4566 with the same endpoint conventions, supporting S3, SQS, DynamoDB, RDS, ElastiCache, API Gateway, Cognito, IAM, and twenty-plus other services. The Docker image is ninety megabytes versus LocalStack's one gigabyte and starts in twenty-four milliseconds.
Static linter that catches production bugs in AI-generated code
prodlint is a zero-config static analysis tool with 52 rules targeting production bugs that AI coding tools consistently produce. It catches hallucinated npm imports, missing authentication checks, Prisma writes outside transactions, exposed secrets via NEXT_PUBLIC prefixes, and other patterns specific to code generated by Cursor, Claude Code, Bolt, and v0. Runs in one second via npx with no configuration needed.
Lightweight self-hosted CI/CD engine for the Gitea stack
Woodpecker CI is a community-maintained open-source CI/CD engine forked from Drone CI, designed as the default continuous integration solution for self-hosted Git platforms like Gitea, Forgejo, and Codeberg. With simple YAML pipeline configuration, minimal resource usage (~100MB RAM), and Apache-2.0 licensing, it provides a lightweight alternative to Jenkins and GitHub Actions for developers who prefer infrastructure they control.
AI-powered E2E test generation and maintenance platform
Octomind is an AI-powered testing platform that automatically generates, runs, and maintains end-to-end Playwright tests for web applications. It observes user flows, creates test cases from natural language descriptions, and self-heals tests when UI changes would break traditional selectors. Backed by $4.8M seed funding from Paua Ventures with enterprise production deployments.
Developer productivity platform with merge queues and flaky test detection
Aviator is a developer productivity platform combining merge queues, stacked PRs, automated code review, and flaky test management. Its merge queue prevents broken main branches by testing PRs in order before merging. Flaky test detection identifies unreliable tests causing CI failures. Founded by ex-Google engineers who built internal developer tools at scale. YC-backed with $2.3M seed from Elad Gil. Used by Bosch, Benchling, and Lightspeed.
Slack-native incident management with AI SRE agent
Incident.io is a Slack-native incident management platform with an AI SRE that autonomously investigates alerts, correlates deployments with telemetry, and drafts fix pull requests. Used by Buffer (70% fewer critical incidents), Favor (37% MTTR reduction), Intercom, and Productboard. Features include automated workflows, on-call scheduling, post-incident learning, and status pages. Integrates with PagerDuty, Datadog, GitHub, Jira, and 100+ tools.
Unified API, performance, and contract testing DSL
Karate is an open-source testing framework that unifies API testing, performance testing, UI automation, and contract testing in a single BDD-style DSL. Write tests in plain Gherkin-like syntax without any Java knowledge. Built-in assertions, data-driven testing, parallel execution, and HTML reports. 8,200+ GitHub stars, MIT licensed. 7+ years of active development with Global 2000 enterprise adoption for comprehensive API quality assurance.
Meta's open-source LLM security suite with Llama Guard and CodeShield
PurpleLlama is Meta's open-source suite of tools for evaluating and improving LLM safety. It includes Llama Guard models for input/output content safety classification, LlamaFirewall for multi-layer defense, CodeShield for insecure code detection, and CyberSecEval benchmarks for measuring LLM security. Llama Guard 4 supports multimodal safety across text and images. 4,100+ GitHub stars, backed by Meta AI with 44+ contributors.
Mutation testing framework to validate test quality
Stryker Mutator is an open-source mutation testing framework for JavaScript, TypeScript, C#, and Scala that measures the real effectiveness of your test suite. It introduces small code mutations and checks whether tests catch them, revealing gaps that line coverage metrics miss. Supports incremental mode for CI/CD integration.