aicoolies logo
Manifest logo

Manifest

Smart LLM router that cuts inference costs up to 70%

Share
freemiumOpen Source
Visit Website →

Manifest is an open-source smart model router that intelligently routes LLM requests to the cheapest capable model, reducing inference costs by up to 70% without sacrificing output quality. It uses a 23-dimension scoring algorithm to evaluate 300+ models across providers including OpenAI, Anthropic, Google, and DeepSeek, with automatic fallbacks and budget controls. Manifest can be deployed as a cloud service, local plugin, or self-hosted Docker container with transparent routing logic.

Manifest is an open-source intelligent routing layer for LLM API calls that automatically selects the most cost-effective model capable of handling each request. Rather than hardcoding model selections or building custom routing logic, developers point their API calls through Manifest and let its 23-dimension scoring algorithm match requests to the optimal model from a pool of over 300 options across OpenAI, Anthropic, Google, DeepSeek, and other providers. The system analyzes request complexity, required capabilities, and cost constraints to make routing decisions in real time.

The routing engine goes beyond simple load balancing by evaluating models across dimensions including reasoning depth, code generation ability, multilingual support, context window requirements, and latency sensitivity. Automatic fallback chains ensure reliability when a primary model is unavailable or rate-limited, while budget controls let teams set spending limits per project, per user, or per time period. All routing decisions are fully transparent, with detailed logs showing why each model was selected and how much was saved compared to default routing.

Built in TypeScript with over 4,200 commits, Manifest offers flexible deployment options including a hosted cloud dashboard, a local OpenClaw plugin for IDE integration, and self-hosted Docker containers for teams that need to keep API keys and traffic within their infrastructure. The MIT-licensed project has attracted 4,300 GitHub stars and serves teams looking to reduce LLM costs without the engineering overhead of building and maintaining their own model selection infrastructure across multiple providers.

Pricing

Free open source MIT; cloud dashboard available

Platforms

TypeScript, Docker, cloud, local plugin

Categories

Tags

Use Cases

Alternatives

Related Tools

Traceway logo

Traceway

OpenTelemetry-native observability with AI tracing, logs, traces, metrics, and session replay — self-hosted in 90 seconds.

Traceway is an open-source, OpenTelemetry-native observability platform that combines logs, traces, metrics, exceptions, session replay, and AI tracing in a single self-hosted system. MIT licensed with no open-core restrictions, it deploys in 90 seconds via Docker Compose and accepts OTLP/HTTP from any OTel SDK without a Collector or per-language vendor SDK.

open-sourceOpen Source
Judgeval logo

Judgeval

Open-source post-building layer for agents — tracing, evals, and online monitoring

Judgeval is the open-source post-building layer for AI agents from Judgment Labs, providing OpenTelemetry-based tracing, hosted and custom evaluation scorers, and online behavior monitoring for LLM-powered applications. Instrument any function with a single decorator, score live production traffic against faithfulness and instruction-adherence checks, and feed real-world failures back into reinforcement learning or supervised fine-tuning loops.

open-sourceOpen Source
TraceRoot logo

TraceRoot

Open-source observability and self-healing layer for AI agents

TraceRoot is a YC S25-backed open-source observability platform purpose-built for AI agents and LLM apps. It combines OpenTelemetry-compatible tracing with an agentic debugging runtime that reads your source code, correlates failures with recent commits, and proposes fix PRs automatically. BYOK support spans seven LLM providers; the entire stack runs self-hosted via Docker Compose, with TraceRoot Cloud available for managed deployments.

open-sourceOpen Source
OpenSRE logo

OpenSRE

Open-source toolkit for building AI SRE incident response agents

OpenSRE is an open-source Python toolkit from Tracer Cloud for building AI SRE agents that investigate and respond to production incidents. It ships with connectors to Prometheus, Grafana, Kubernetes and incident platforms, plus a simulation harness that replays past incidents so teams can benchmark agent accuracy before trusting it on live pager rotations.

open-sourceOpen Source
Evolver logo

Evolver

Self-evolution engine for AI agents with auditable updates

Evolver is an open-source self-evolution engine for AI agents that turns run logs into auditable, reviewable updates via its Genome Evolution Protocol. Instead of ad hoc prompt tweaking, teams collect traces and Evolver proposes versioned diffs to prompts, tools and workflows that engineers can approve, reject or roll back like code.

open-sourceOpen Source
CodeBurn logo

CodeBurn

See where your AI coding tokens actually go

Open-source TUI dashboard and CLI that shows where your AI coding tokens actually go, broken down by task type, tool, model, MCP server, and project. CodeBurn reads local session data directly from Claude Code, Codex, Cursor, OpenCode, Pi, and GitHub Copilot — no wrapper, proxy, or API keys — and layers on one-shot success rates so you can see whether the AI nails work first try or burns budget on edit/test/fix retries. Ships with a macOS menu bar widget and CSV/JSON export.

freeOpen Source