MLflow

Open-source platform for the complete machine learning lifecycle.

open-sourceOpen Source

MLflow is an open-source platform for managing the end-to-end machine learning lifecycle. Covers experiment tracking, model packaging, model registry, and deployment. Created by Databricks and now a Linux Foundation project. Integrates with TensorFlow, PyTorch, scikit-learn, Hugging Face, and all major ML frameworks.

We have a review for this tool

A detailed review by the aicoolies team — click to read

MLflow provides four core components for the ML lifecycle: Tracking (logging parameters, metrics, and artifacts from experiments), Models (packaging ML models in a standard format), Model Registry (centralized model store with versioning and staging), and Projects (packaging ML code for reproducible runs).

The platform is framework-agnostic, supporting TensorFlow, PyTorch, scikit-learn, XGBoost, Hugging Face Transformers, LangChain, OpenAI, and virtually any Python ML library. MLflow also includes LLM evaluation tools and a deployments server for serving models via REST API.

MLflow is free and open source under the Apache 2.0 license. Databricks offers a managed MLflow experience integrated with their data lakehouse platform. Self-hosted deployment is straightforward with pip install and supports PostgreSQL, MySQL, or SQLite backends.

Pricing

Free and open source (Apache 2.0). Managed version included in Databricks.

Platforms

Python-based. Self-hosted on any OS. Managed via Databricks. REST API + Web UI.

Use Cases

AI Model Training Data Engineering

Alternatives

Steel

Open-source browser infrastructure for AI agents at scale

Steel is an open-source browser API purpose-built for AI agents, providing managed headless browser sessions with anti-bot bypass, proxy rotation, CAPTCHA solving, and session persistence. It handles the infrastructure layer that browser automation agents like Browser Use and Stagehand run on top of. Self-hostable or available as a cloud service. Over 6,000 GitHub stars.

open-sourceOpen Source

Trigger.dev

Open-source background jobs and AI workflows for TypeScript

Trigger.dev is an open-source platform for building and deploying background jobs, AI agents, and long-running workflows in TypeScript. It eliminates serverless timeouts with durable task execution, automatic retries, queue-based concurrency control, and elastic scaling. Used by 30,000+ developers at companies like MagicSchool and Icon.com, it processes hundreds of millions of agent runs monthly. Backed by a $16M Series A led by Dalton Caldwell's Standard Capital fund.

freemiumOpen Source

Braintrust

LLM evaluation and prompt engineering platform

Braintrust is an AI observability and evaluation platform for tracing LLM applications, building datasets, running prompt/model experiments, scoring outputs and turning production feedback into regression tests. It fits teams that need repeatable quality gates for AI releases rather than one-off prompt demos.

freemium

Related Tools

KubeAI

Kubernetes operator for serving AI inference workloads

KubeAI is an Apache-2.0 Kubernetes operator for deploying and scaling AI inference workloads, including LLMs, embeddings, reranking, and speech-to-text. It gives platform teams OpenAI-compatible endpoints, model proxy/controller primitives, model caching, scale-from-zero behavior, and cluster-native resource management for self-hosted inference on Kubernetes.

open-sourceOpen Source

Latitude

Sentry-style observability for AI agent conversations

Latitude is an agent observability platform for teams that need to inspect LLM traces, conversations, issues, and evaluation feedback in one workflow. Its public repo and docs position it as a Sentry-style monitor for AI agents, with semantic search, issue detection, annotations, MCP-assisted fixes, and cloud or self-hosted deployment paths for production debugging.

freemiumOpen SourceTelemetry

Spotlight by Backplanes

Session reports for Claude Code and Codex runs

Spotlight by Backplanes turns completed Claude Code and Codex sessions into concise reports for engineering, security, and spend review. The CLI installs on macOS, Linux, or WSL 2, watches sessions after they finish, redacts PII and credentials locally before upload, then summarizes files touched, commands run, external domains reached, scope drift, risky actions, and next-session improvements.

freemiumTelemetry

Traceway

OpenTelemetry-native observability with AI tracing, logs, traces, metrics, and session replay — self-hosted in 90 seconds.

Traceway is an open-source, OpenTelemetry-native observability platform that combines logs, traces, metrics, exceptions, session replay, and AI tracing in a single self-hosted system. MIT licensed with no open-core restrictions, it deploys in 90 seconds via Docker Compose and accepts OTLP/HTTP from any OTel SDK without a Collector or per-language vendor SDK.

open-sourceOpen Source

Judgeval

Open-source post-building layer for agents — tracing, evals, and online monitoring

Judgeval is the open-source post-building layer for AI agents from Judgment Labs, providing OpenTelemetry-based tracing, hosted and custom evaluation scorers, and online behavior monitoring for LLM-powered applications. Instrument any function with a single decorator, score live production traffic against faithfulness and instruction-adherence checks, and feed real-world failures back into reinforcement learning or supervised fine-tuning loops.

open-sourceOpen Source

TraceRoot

Open-source observability and self-healing layer for AI agents

TraceRoot is a YC S25-backed open-source observability platform purpose-built for AI agents and LLM apps. It combines OpenTelemetry-compatible tracing with an agentic debugging runtime that reads your source code, correlates failures with recent commits, and proposes fix PRs automatically. BYOK support spans seven LLM providers; the entire stack runs self-hosted via Docker Compose, with TraceRoot Cloud available for managed deployments.

open-sourceOpen Source

MLflow

Pricing

Platforms

Categories

Tags

Use Cases

Alternatives

Steel

Trigger.dev

Braintrust

Related Tools

KubeAI

Latitude

Spotlight by Backplanes

Traceway

Judgeval

TraceRoot