aicoolies logo
Arthas logo

Arthas

Java diagnostic and troubleshooting tool

Share
open-sourceOpen Source
Visit Website →

Arthas is Alibaba's open-source Java diagnostic tool that lets developers troubleshoot production issues without modifying code or restarting servers. It attaches to running JVM processes to inspect class loading, decompile classes, trace method invocations, monitor performance metrics, and view real-time stack traces. Supports JDK 6+ with both telnet and WebSocket interfaces for local and remote diagnostics across Linux, macOS, and Windows.

Arthas solves one of the most persistent challenges in Java operations: diagnosing production issues that cannot be reproduced in development environments. By attaching to a running JVM process as a Java agent, it provides real-time introspection capabilities including class decompilation, method tracing with timing breakdowns, thread analysis, and memory inspection—all without requiring code changes, recompilation, or server restarts. This makes it invaluable for investigating performance degradation, unexpected behavior, and intermittent bugs in live systems.

The diagnostic toolkit includes commands for watching method arguments and return values in real time, profiling CPU-intensive code paths with flame graphs, inspecting and modifying object field values on the fly, and searching for class loader conflicts. It supports both interactive sessions through telnet and browser-based WebSocket connections, enabling remote troubleshooting of servers in data centers or cloud environments. The OGNL expression engine allows complex ad-hoc queries against the running application state.

Originally developed at Alibaba to manage their massive Java infrastructure, Arthas has become one of the most popular Java diagnostic tools in the global developer community with over 35,000 GitHub stars. It supports JDK versions from 6 through the latest releases and runs on Linux, macOS, and Windows. The one-line installation script and intuitive command interface make it accessible to developers at all experience levels, while its depth of instrumentation capabilities satisfies even the most demanding production debugging scenarios.

Pricing

Free and open source under Apache 2.0

Platforms

JVM tool; Linux, macOS, Windows

Categories

Tags

Use Cases

Alternatives

Related Tools

Freestyle logo

Freestyle

Sandboxes for coding agents — Linux VMs, Git, and deploys in one box

Freestyle is YC-backed sandbox infrastructure built for AI coding agents, shipping secure Linux VMs with nested virtualization, Git servers, and one-click web deploys. It lets agents run real workloads, branch repos, and deploy apps under short-lived identities while billing only for active compute. Used in production by vly.ai, Rork, and Vibeflow.

freemium
OpenSRE logo

OpenSRE

Open-source toolkit for building AI SRE incident response agents

OpenSRE is an open-source Python toolkit from Tracer Cloud for building AI SRE agents that investigate and respond to production incidents. It ships with connectors to Prometheus, Grafana, Kubernetes and incident platforms, plus a simulation harness that replays past incidents so teams can benchmark agent accuracy before trusting it on live pager rotations.

open-sourceOpen Source
Twill AI logo

Twill AI

Autonomous coding agents that ship while you sleep

Twill is an autonomous coding agent platform that implements features, fixes bugs, and ships pull requests without manual intervention. Uses structured workflow of research, planning, human review, implementation in isolated sandbox, AI code review, then merge. Supports custom agent configurations with multiple LLM providers, isolated dev environments for verification, and integrations with GitHub, Linear, Sentry, Notion, and cloud platforms for end-to-end engineering automation.

freemium
Baseten logo

Baseten

ML inference platform for production AI models

Baseten is the inference platform for deploying AI models at scale with dedicated and pre-optimized model APIs and performance-optimized infrastructure. Specializes in image generation, transcription, text-to-speech, LLM serving, embeddings, and compound AI workloads. Delivers 75% latency reduction with 415ms cold starts and 3000+ concurrent scaling. Available as managed cloud or self-hosted, trusted by Cursor, Notion, Descript, and Sourcegraph for production inference.

api-usage-based
poethepoet logo

Poethepoet

Task runner for Python with Poetry and uv

Poethepoet (poe) is a batteries-included task runner for Python projects that integrates with Poetry and uv package managers. Define tasks in pyproject.toml, compose them in sequential, parallel, or DAG workflows, and execute with full virtual environment context. Supports shell commands, Python scripts, environment variables, .env file loading, and auto-generated shell completion across bash, zsh, and fish for streamlined development workflows.

open-sourceOpen Source
Elkeid logo

Elkeid

Kernel-space host intrusion detection system

Elkeid is ByteDance's open-source HIDS for hosts, containers, Kubernetes, and serverless workloads. Its kernel-level data collection via Kprobe hooks captures process lineage, privilege escalation attempts, file access patterns, and network connections with minimal overhead. Includes an Agent for telemetry, Detector for rule evaluation, Controller for policy management, and a Dashboard for alerts and investigation.

open-sourceOpen Source