aicoolies logo
OpenLIT logo

OpenLIT

OpenTelemetry-native observability for LLM applications with evals and GPU monitoring

Share
open-sourceOpen Source
Visit Website →

OpenLIT is an open-source AI engineering platform that provides OpenTelemetry-native observability for LLM applications. It combines distributed tracing, evaluation, prompt management, a secrets vault, and GPU telemetry in a single self-hostable stack. With 50+ integrations across LLM providers and frameworks, it lets teams monitor AI applications using their existing observability backends like Grafana, Datadog, or Jaeger.

OpenLIT is an open-source observability platform purpose-built for AI applications that takes an OpenTelemetry-native approach to LLM monitoring. Rather than creating yet another proprietary tracing format, OpenLIT instruments LLM calls as standard OpenTelemetry spans and metrics, which means traces flow directly into whatever observability backend a team already runs. This design philosophy eliminates vendor lock-in and lets AI observability coexist with existing infrastructure monitoring in a single pane of glass.

The platform covers the full AI engineering lifecycle beyond basic tracing. An evaluation framework lets teams define and run quality checks on LLM outputs. A prompt management system provides version control and A/B testing for prompts. A secrets vault protects API keys and sensitive configuration. GPU telemetry tracks utilization, memory, and temperature across inference infrastructure. All of these capabilities share the same data pipeline and dashboard, avoiding the tool sprawl that typically accompanies production LLM deployments.

OpenLIT provides SDK instrumentation for Python, TypeScript, Java, and C# with auto-instrumentation for over 50 LLM providers and frameworks including OpenAI, Anthropic, LangChain, LlamaIndex, and Hugging Face. The self-hosted deployment runs on standard infrastructure with no special requirements. The project is Apache 2.0 licensed and has an active development community shipping regular releases throughout 2026.

Pricing

Free and open-source under Apache 2.0

Platforms

Self-hosted on any platform; Python, TS, Java, C# SDKs

Categories

Tags

Use Cases

Alternatives

Related Tools

Latitude

Sentry-style observability for AI agent conversations

Latitude is an agent observability platform for teams that need to inspect LLM traces, conversations, issues, and evaluation feedback in one workflow. Its public repo and docs position it as a Sentry-style monitor for AI agents, with semantic search, issue detection, annotations, MCP-assisted fixes, and cloud or self-hosted deployment paths for production debugging.

freemiumOpen SourceTelemetry

Spotlight by Backplanes

Session reports for Claude Code and Codex runs

Spotlight by Backplanes turns completed Claude Code and Codex sessions into concise reports for engineering, security, and spend review. The CLI installs on macOS, Linux, or WSL 2, watches sessions after they finish, redacts PII and credentials locally before upload, then summarizes files touched, commands run, external domains reached, scope drift, risky actions, and next-session improvements.

freemiumTelemetry
Agent Governance Toolkit logo

Agent Governance Toolkit

Microsoft’s public-preview runtime governance toolkit for policy, identity, sandboxing, audit, and MCP security around AI agents.

Agent Governance Toolkit is Microsoft’s MIT-licensed public-preview toolkit for governing AI agent runtimes. It adds policy enforcement, zero-trust identity, execution sandboxing, audit, reliability, and MCP security-gateway patterns around tool calls and autonomous actions, helping platform teams move beyond prompt-only guardrails while preserving architecture review requirements.

open-sourceOpen SourceTelemetry
Baz logo

Baz

Telemetry-aware AI code reviewer that checks how pull requests may affect real services.

Baz is an AI code-review platform focused on production-aware pull requests. Instead of only reading the diff, Baz connects code changes to application telemetry so reviewers can understand what endpoints, services, and runtime behavior may be affected. That makes it a useful complement to existing AI PR bots when the question is not just whether a change looks correct, but whether it could break a live system.

freemiumTelemetry
rampart

Rampart

Microsoft’s pytest-native red teaming framework for turning AI agent safety findings into CI tests.

RAMPART is an open-source Microsoft framework for safety and security testing of agentic AI applications. It brings red-team findings into a pytest-native workflow so teams can turn prompt injection, unsafe tool use, and behavioral boundary failures into repeatable regression tests. The strongest aicoolies angle is developer workflow: RAMPART makes agent safety part of CI/CD instead of a one-off security review.

open-sourceOpen Source
Statewright logo

Statewright

State-machine guardrails for controlling which tools AI coding agents can use at each phase.

Statewright is a guardrail layer for AI coding agents that uses explicit state machines to control what an agent can do at each stage of a workflow. Instead of relying only on prompt instructions, teams can model phases such as plan, implement, test, and review, then constrain tool access for clients like Claude Code, Codex, Cursor, opencode, and related MCP workflows.

open-sourceOpen Source

Comparisons