Best tools for DevOps Automation
Automating CI/CD pipelines, infrastructure provisioning, and deployment workflows
Showing 24 of 257 tools
KubeAI
Kubernetes operator for serving AI inference workloads
KubeAI is an Apache-2.0 Kubernetes operator for deploying and scaling AI inference workloads, including LLMs, embeddings, reranking, and speech-to-text. It gives platform teams OpenAI-compatible endpoints, model proxy/controller primitives, model caching, scale-from-zero behavior, and cluster-native resource management for self-hosted inference on Kubernetes.
Superserve
Open-source Firecracker sandboxes for long-running AI agents
Superserve is an open-source sandbox infrastructure layer for AI agents that need durable computers instead of short-lived shells. It runs isolated Firecracker microVMs, supports pause, resume, snapshot, fork, preview URLs, MCP connectivity, SDK/API control, Docker workloads, and self-hosting, while the hosted service adds pay-as-you-go agent sandboxes for teams.
Linear MCP Server
Official authenticated remote MCP endpoint for Linear issues, projects, comments, and coding-agent workflows.
Linear MCP Server is Linear’s official authenticated remote MCP endpoint for agent access to issues, projects, and comments. It gives Claude, Codex, Cursor, VS Code, Windsurf, Zed, and other clients a centrally hosted way to find, create, and update Linear work items through OAuth-backed MCP without maintaining a local connector or brittle API glue.
Slack MCP Server
Official Slack MCP server for approved workspace search, messaging, canvas, and user-context actions.
Slack MCP Server is Slack’s official remote MCP layer for giving approved AI clients workspace context and controlled actions. It lets agents search messages, files, users, and channels, draft or send messages, read threads, manage canvases, and authenticate through Slack OAuth while workspace admins approve integrations and normal Slack rate limits still apply.
Executor
MCP gateway and integration catalog for AI agents
Executor is an MIT-licensed integration layer and MCP gateway for AI agents. It gives Claude Code, Cursor, Codex, and other MCP-speaking clients one endpoint for connected OpenAPI specs, GraphQL APIs, MCP servers, Google Discovery sources, and custom JavaScript tools, with local, cloud, and self-hosted deployment options for teams centralizing tool access.
Windows-MCP
MCP server for controlling Windows desktops through UIAutomation
Windows-MCP is an open-source MCP server for giving AI agents structured access to Windows desktop automation. It focuses on UIAutomation, snapshots, input control, and Windows-specific app workflows, making it different from general filesystem or shell MCP servers.
kubectl-ai
Google’s open-source Kubernetes assistant that translates natural-language intent into precise cluster operations.
kubectl-ai is an AI-powered Kubernetes assistant from Google Cloud Platform. It acts as an intelligent interface for cluster work, translating operator intent into Kubernetes commands and workflows. The key distinction from reactive diagnosis tools is that kubectl-ai is designed as an interactive natural-language interface for planning and executing Kubernetes operations, with provider configuration and MCP-oriented workflows around the CLI.
Agent Governance Toolkit
Microsoft’s public-preview runtime governance toolkit for policy, identity, sandboxing, audit, and MCP security around AI agents.
Agent Governance Toolkit is Microsoft’s MIT-licensed public-preview toolkit for governing AI agent runtimes. It adds policy enforcement, zero-trust identity, execution sandboxing, audit, reliability, and MCP security-gateway patterns around tool calls and autonomous actions, helping platform teams move beyond prompt-only guardrails while preserving architecture review requirements.
Traceway
OpenTelemetry-native observability with AI tracing, logs, traces, metrics, and session replay — self-hosted in 90 seconds.
Traceway is an open-source, OpenTelemetry-native observability platform that combines logs, traces, metrics, exceptions, session replay, and AI tracing in a single self-hosted system. MIT licensed with no open-core restrictions, it deploys in 90 seconds via Docker Compose and accepts OTLP/HTTP from any OTel SDK without a Collector or per-language vendor SDK.
Judgeval
Open-source post-building layer for agents — tracing, evals, and online monitoring
Judgeval is the open-source post-building layer for AI agents from Judgment Labs, providing OpenTelemetry-based tracing, hosted and custom evaluation scorers, and online behavior monitoring for LLM-powered applications. Instrument any function with a single decorator, score live production traffic against faithfulness and instruction-adherence checks, and feed real-world failures back into reinforcement learning or supervised fine-tuning loops.
TraceRoot
Open-source observability and self-healing layer for AI agents
TraceRoot is a YC S25-backed open-source observability platform purpose-built for AI agents and LLM apps. It combines OpenTelemetry-compatible tracing with an agentic debugging runtime that reads your source code, correlates failures with recent commits, and proposes fix PRs automatically. BYOK support spans seven LLM providers; the entire stack runs self-hosted via Docker Compose, with TraceRoot Cloud available for managed deployments.
GraphBit
Rust-native multi-agent orchestration for production
GraphBit is a Rust-native, multi-agent orchestration framework built for production. It targets the gap between Python-first frameworks like LangGraph and the operational expectations of enterprise systems — predictable memory, low latency, deterministic concurrency, and the ability to embed an agent runtime in services that already run Rust without dragging in a Python interpreter.
OpenSRE
Open-source toolkit for building AI SRE incident response agents
OpenSRE is Tracer Cloud’s open-source public-alpha Python toolkit for building AI SRE agents that investigate and respond to production incidents. It ships 60+ tools across observability, databases, incident management, communications, deployment and protocol integrations, plus simulation/evaluation workflows for benchmarking agent accuracy before live pager use.
chrome-devtools-mcp
Official Chrome DevTools MCP server for coding agents
chrome-devtools-mcp is the Chrome DevTools team's official MCP server that lets coding agents control and inspect a live Chrome browser with first-party Chrome DevTools Protocol fidelity. It exposes Network inspection, Performance traces, Lighthouse audits, console output, and structured DOM snapshots as typed MCP tools, so agents can debug real pages and ship reliable web performance investigations without resorting to brittle DOM scraping.
GenericAgent
Self-evolving local computer agent with a reusable skill tree
GenericAgent is a minimal, self-evolving autonomous agent from a 3.3K-line seed and ~3K core loop that gives LLMs system-level control of a local computer. It writes files, runs shell commands, browses the web, and uses keyboard/mouse/screen/mobile tools, while skill crystallization saves successful runs into a reusable skill tree that cuts token cost on repeats.
CodeBurn
See where your AI coding tokens actually go
Open-source TUI dashboard and CLI that shows where your AI coding tokens actually go, broken down by task type, tool, model, MCP server, and project. CodeBurn reads local session data directly from Claude Code, Codex, Cursor, OpenCode, Pi, and GitHub Copilot — no wrapper, proxy, or API keys — and layers on one-shot success rates so you can see whether the AI nails work first try or burns budget on edit/test/fix retries. Ships with a macOS menu bar widget and CSV/JSON export.
Magika
AI-powered file-type detection at Google scale
Open-source AI-powered file-type detection tool from Google that uses a custom deep-learning model under a few megabytes to identify more than 200 binary and textual content types in milliseconds, even on a single CPU. Magika ships as a CLI, Python package, JavaScript/TypeScript library, and an ONNX model, achieves around 99% accuracy on its test set, and is already used at Google scale across Gmail, Drive, and Safe Browsing as well as by VirusTotal and abuse.ch.
Resolve AI
AI-powered production incident resolution
Resolve AI automates production incident investigation, diagnosis, and remediation acting as an AI SRE that participates in every on-call rotation. Autonomously investigates incidents pursuing multiple hypotheses in parallel, validates against real evidence, creates code snippets and drafts PRs, generates post-mortems, and onboards new teammates with instant answers about code and infrastructure. Drives 5x faster MTTR and 87% faster incident investigations.
Poethepoet
Task runner for Python with Poetry and uv
Poethepoet (poe) is a batteries-included task runner for Python projects that integrates with Poetry and uv package managers. Define tasks in pyproject.toml, compose them in sequential, parallel, or DAG workflows, and execute with full virtual environment context. Supports shell commands, Python scripts, environment variables, .env file loading, and auto-generated shell completion across bash, zsh, and fish for streamlined development workflows.
Concourse
Container-based CI/CD automation system
Concourse is an open-source CI/CD system built on composable primitives: resources for external artifacts, tasks for containerized work units, and jobs for orchestration. All pipelines are declarative YAML with version control, every task runs in an isolated container, and stateless workers enable horizontal scaling. Deployable via BOSH, Helm, Docker Compose, or standalone binary across any infrastructure.
Unleash
Open-source feature flag management platform
Unleash is the largest open-source feature flag platform, enabling teams to decouple deployment from release with gradual rollouts, A/B testing, and trunk-based development. It provides 15+ official SDKs for server and client frameworks, a web-based admin dashboard for managing feature toggles, and supports activation strategies like percentage rollout, user targeting, and environment-based rules. Self-hostable via Docker with PostgreSQL storage.
Arthas
Java diagnostic and troubleshooting tool
Arthas is Alibaba's open-source Java diagnostic tool that lets developers troubleshoot production issues without modifying code or restarting servers. It attaches to running JVM processes to inspect class loading, decompile classes, trace method invocations, monitor performance metrics, and view real-time stack traces. Supports JDK 6+ with both telnet and WebSocket interfaces for local and remote diagnostics across Linux, macOS, and Windows.
Sentrial
Production monitoring platform for AI agent reliability
Sentrial is a YC W26-backed monitoring platform for AI agent reliability in production. It semantically detects loops, hallucinations, tool misuse, and user frustration in real-time, then diagnoses root causes and recommends fixes. The platform claims 70% MTTR reduction via automated remediation including rollback, model retraining triggers, and webhooks. Sentrial positions itself as the Datadog for teams deploying autonomous AI agents at scale.
Sonarly
AI production engineer that auto-triages and fixes alerts
Sonarly is a YC W26-backed AI production engineer that autonomously triages production alerts, deduplicates them by root cause, and sends ready-to-merge pull request fixes. It connects to monitoring tools like Sentry and Datadog, analyzes alert patterns to identify the underlying issue, and generates code fixes or optimization recommendations. Built on Claude APIs, Sonarly reduces mean time to resolution for production incidents while minimizing alert fatigue for engineering teams.