aicoolies logo

Kubernetes Tools

Kubernetes management, troubleshooting, virtual clusters, cost optimization, and AI-powered K8s operations.

Showing 24 of 33 tools

KubeAI

Kubernetes operator for serving AI inference workloads

KubeAI is an Apache-2.0 Kubernetes operator for deploying and scaling AI inference workloads, including LLMs, embeddings, reranking, and speech-to-text. It gives platform teams OpenAI-compatible endpoints, model proxy/controller primitives, model caching, scale-from-zero behavior, and cluster-native resource management for self-hosted inference on Kubernetes.

open-sourceOpen Source

kubectl-ai

Google’s open-source Kubernetes assistant that translates natural-language intent into precise cluster operations.

kubectl-ai is an AI-powered Kubernetes assistant from Google Cloud Platform. It acts as an intelligent interface for cluster work, translating operator intent into Kubernetes commands and workflows. The key distinction from reactive diagnosis tools is that kubectl-ai is designed as an interactive natural-language interface for planning and executing Kubernetes operations, with provider configuration and MCP-oriented workflows around the CLI.

open-sourceOpen SourceTelemetry
Vald logo

Vald

Cloud-native distributed vector search engine built for Kubernetes with automatic indexing and horizontal scaling.

Vald is a highly scalable distributed approximate nearest neighbor (ANN) vector search engine designed for cloud-native, Kubernetes-based architectures. Maintained by LY Corporation and listed in the CNCF Landscape, it uses the NGT algorithm (developed at Yahoo Japan), supports automatic incremental index backup, and handles billion-scale datasets across loosely coupled microservice components that scale horizontally via Helm.

open-sourceOpen Source
KubeVela logo

KubeVela

Modern application delivery platform for Kubernetes

KubeVela is a CNCF incubating project that provides a modern application delivery platform built on Kubernetes and the Open Application Model. It abstracts away infrastructure complexity by letting developers define applications declaratively with components, traits, and policies, while platform teams manage delivery workflows. KubeVela supports multi-cluster deployment, canary rollouts, GitOps integration, and extensible addon system.

open-sourceOpen Source

Krkn

CNCF Sandbox chaos engineering framework for Kubernetes resilience

Krkn is a CNCF Sandbox chaos engineering tool that tests Kubernetes cluster resilience by injecting controlled failures. It simulates pod kills, node failures, network partitions, CPU/memory pressure, and zone outages. Krkn-AI adds AI-powered scenario generation that suggests chaos experiments based on cluster topology. Supports CI/CD integration for automated resilience testing in deployment pipelines.

open-sourceOpen Source
k3s logo

k3s

Lightweight Kubernetes distribution for edge, IoT, and development

k3s is a CNCF Sandbox lightweight Kubernetes distribution packaged as a single binary under 100MB. Created by Rancher Labs and now maintained by SUSE, it strips non-essential components and bundles containerd, Flannel, CoreDNS, and Traefik into a minimal but fully conformant K8s distribution. Ideal for edge computing, IoT, ARM devices, and local development environments.

open-sourceOpen Source
Istio logo

Istio

Leading open-source service mesh for Kubernetes microservices

Istio is the most widely adopted open-source service mesh for Kubernetes, providing traffic management, security, and observability for microservice architectures. It uses Envoy proxy sidecars to intercept and manage service-to-service communication with mutual TLS, fine-grained traffic routing, circuit breaking, and distributed tracing. CNCF Graduated project used in production by Google, IBM, and Salesforce.

open-sourceOpen Source
Robusta logo

Robusta

CNCF Sandbox Kubernetes alert enrichment and automation platform

Robusta is a CNCF Sandbox project that enriches Kubernetes alerts with diagnostic context and automates remediation workflows. It intercepts Prometheus alerts, attaches relevant logs, pod status, resource metrics, and troubleshooting suggestions before delivering them to Slack, Teams, or PagerDuty. Supports custom playbooks for automated incident response and AI-powered root cause analysis.

open-sourceOpen Source
Botkube logo

Botkube

Kubernetes ChatOps bot for Slack, Teams, and Discord

Botkube is a Kubernetes ChatOps platform that brings cluster management into Slack, Microsoft Teams, and Discord. It provides real-time alerts for cluster events, enables kubectl command execution from chat, and supports automated workflows triggered by Kubernetes resource changes. Features plugin architecture for extensibility and RBAC-based access control for team collaboration.

open-sourceOpen Source
Metoro logo

Metoro

AI-powered SRE agent for Kubernetes troubleshooting

Metoro is an AI SRE platform for Kubernetes that combines observability with autonomous troubleshooting. Its Guardian agent monitors cluster health, correlates metrics, logs, and traces to identify root causes, and suggests remediation actions. Features an MCP server for integration with AI coding agents and natural language querying of infrastructure state.

freemium
Coroot logo

Coroot

Zero-instrumentation Kubernetes observability powered by eBPF

Coroot is an open-source observability platform that uses eBPF to automatically instrument Kubernetes applications without code changes. It provides application maps, latency analysis, log correlation, and continuous profiling with automatic anomaly detection. Replaces the need for manual instrumentation with agents that capture metrics, traces, and logs at the kernel level.

open-sourceOpen Source
Kubecost logo

Kubecost

Kubernetes cost monitoring and optimization platform

Kubecost is an IBM Apptio / Cloudability product for Kubernetes cost visibility, allocation, and optimization, built around the Kubecost/OpenCost ecosystem. It helps map infrastructure spend to Kubernetes namespaces, deployments, pods, labels, and teams. OpenCost remains the vendor-neutral Apache-2.0 open-source project for cloud-native cost allocation with AWS, Azure, GCP, and Prometheus integrations.

freemiumOpen Source
k0s logo

k0s

Zero-friction single-binary Kubernetes distribution by Mirantis

k0s is a lightweight, CNCF-certified Kubernetes distribution packaged as a single binary with zero host dependencies. Backed by Mirantis, it simplifies cluster deployment by bundling all required components into one executable that works on any Linux system. Supports x86-64, ARM64, and ARMv7 architectures with automatic upgrades and a built-in control plane load balancer.

open-sourceOpen Source
Cilium logo

Cilium

eBPF-based networking, security, and observability for Kubernetes

Cilium is a CNCF Graduated, Apache-2.0 project for Kubernetes networking, security, and observability using eBPF. It can replace kube-proxy, enforce identity-aware L3-L7 network policies, and add Hubble flow observability plus Tetragon runtime-security signals. Current source checks support GKE Dataplane V2 using Cilium/eBPF and Azure CNI Powered by Cilium for AKS.

open-sourceOpen Source
ScaleOps logo

ScaleOps

Autonomous Kubernetes and GPU infrastructure optimization

ScaleOps provides autonomous real-time management of Kubernetes and GPU infrastructure, reducing cloud costs by up to 80 percent without manual configuration. Backed by 130 million in Series C funding at an 800 million dollar valuation, it serves enterprises including Adobe, Wiz, DocuSign, and Salesforce. The platform continuously rightsizes pods, optimizes replicas, manages nodes, and allocates GPUs based on live workload demand rather than static configurations.

freemium

kagent

Kubernetes-native framework for DevOps AI agents

kagent is a Kubernetes-native AI agent framework developed at Solo.io and accepted into the CNCF sandbox. It provides a structured environment for running DevOps-focused agents directly within Kubernetes clusters, with a dedicated kmcp toolkit for cloud-native operations. Unlike general-purpose agent frameworks, kagent targets platform engineers and SREs who need AI assistance with cluster management, troubleshooting, and infrastructure automation workflows.

open-sourceOpen Source
Teleport Beams logo

Teleport Beams

Trusted runtime environments for AI agents in production infrastructure

Teleport Beams provides cryptographically verified, policy-gated access for AI agents to interact with production infrastructure including servers, Kubernetes clusters, and databases. Launched at KubeCon EU 2026, Beams extends Teleport's zero-trust access platform with agent-specific runtime controls, audit trails, and policy enforcement to ensure AI agents operate within defined boundaries when deployed in production environments.

open-sourceOpen Source

Freelens

Free and open-source Kubernetes IDE for managing clusters visually

Freelens is a free open-source Kubernetes IDE that provides a visual desktop interface for managing clusters, workloads, and configurations. Forked from the original Lens project after its licensing change, Freelens offers the same powerful cluster management experience with real-time monitoring, log viewing, and resource editing under the MIT license.

open-sourceOpen Source
mirrord logo

mirrord

Run local code inside your Kubernetes cluster without deploying

mirrord lets developers run local processes as if they were inside their Kubernetes cluster — intercepting network traffic, environment variables, and file access at the OS level without any deployment or configuration changes. Backed by $12.5M in seed funding with investors including Sentry's co-founder, it claims up to 98% faster iteration cycles and 30% fewer production bugs by eliminating the gap between local and cluster environments.

freemiumOpen Source
Crossplane logo

Crossplane

Kubernetes-native cloud infrastructure control plane

Crossplane is a CNCF Graduated open-source project that extends Kubernetes to manage cloud infrastructure through declarative APIs. Platform teams compose custom infrastructure abstractions as Compositions and publish them as self-service APIs. It provisions resources across AWS, Azure, GCP, and 200+ providers directly from kubectl. Used by 450+ organizations with 11,000+ GitHub stars.

open-sourceOpen Source
Kubescape logo

Kubescape

Open-source Kubernetes security platform for risk analysis and compliance

Kubescape is a CNCF-backed open-source Kubernetes security platform that scans clusters, manifests, and container images for vulnerabilities, misconfigurations, and compliance violations. It checks against NSA-CISA, MITRE ATT&CK, and CIS benchmarks, integrates into CI/CD pipelines, and provides runtime threat detection via eBPF. Supports SBOM generation and vulnerability scanning. Used by ARMO with growing enterprise adoption in cloud-native security.

open-sourceOpen Source

KServe

Kubernetes-native model inference platform

KServe is an open-source Kubernetes-native platform for deploying and managing ML model inference at scale. It provides standardized inference protocols, autoscaling including scale-to-zero, canary rollouts, A/B testing, and multi-model serving. KServe supports all major ML frameworks including TensorFlow, PyTorch, scikit-learn, XGBoost, and LLM runtimes like vLLM and Triton through pluggable serving runtimes.

open-sourceOpen Source
Falco logo

Falco

Cloud native runtime security for Kubernetes

Falco is a CNCF graduated open-source runtime security tool that detects unexpected behavior and threats across containers, Kubernetes, and cloud workloads in real time. Originally created by Sysdig, Falco monitors Linux kernel syscalls using eBPF and applies customizable detection rules to alert on malicious activity like container escapes, cryptojacking, unauthorized file access, and anomalous network connections. It supports 50+ alert output channels including SIEM integration.

open-sourceOpen Source
Devtron logo

Devtron

Kubernetes dashboard with 360-degree visibility

Devtron is an open-source Kubernetes management dashboard that provides a 360-degree view of cluster resources with fine-grained RBAC for multi-cluster environments. Its upcoming agentic AI feature automates debugging and cluster optimization, while the current platform offers centralized visibility, GitOps-based deployment workflows, and security policy enforcement across distributed Kubernetes infrastructure.

open-sourceOpen Source