# self-hosted
151 tools tagged
Showing 24 of 151 tools
Claude Code
Top PickAnthropic's agentic coding CLI
Anthropic's agentic CLI coding tool that delegates complex tasks to Claude directly from the terminal. Understands entire codebases via automatic context gathering, edits multiple files, runs shell commands, and manages Git workflows autonomously. Supports CLAUDE.md for persistent project instructions, integrates with VS Code and JetBrains, and uses Claude Opus/Sonnet with extended thinking for complex architectural decisions. Built for terminal-first developers.
KubeAI
Kubernetes operator for serving AI inference workloads
KubeAI is an Apache-2.0 Kubernetes operator for deploying and scaling AI inference workloads, including LLMs, embeddings, reranking, and speech-to-text. It gives platform teams OpenAI-compatible endpoints, model proxy/controller primitives, model caching, scale-from-zero behavior, and cluster-native resource management for self-hosted inference on Kubernetes.
Superserve
Open-source Firecracker sandboxes for long-running AI agents
Superserve is an open-source sandbox infrastructure layer for AI agents that need durable computers instead of short-lived shells. It runs isolated Firecracker microVMs, supports pause, resume, snapshot, fork, preview URLs, MCP connectivity, SDK/API control, Docker workloads, and self-hosting, while the hosted service adds pay-as-you-go agent sandboxes for teams.
SeekDB
AI-native state store with hybrid vector and full-text search
SeekDB is an open-source AI-native state store from the OceanBase ecosystem that combines MySQL-compatible data access with hybrid vector and full-text retrieval. It targets agent and AI application teams that need embedded or server deployment, copy-on-write style sandboxes, and searchable state without gluing together several separate storage layers.
pgvectorscale
DiskANN-powered vector search extension for PostgreSQL
pgvectorscale is an open-source PostgreSQL extension from Timescale that complements pgvector with DiskANN-based approximate vector search. It is useful for teams that want faster embedding retrieval while keeping vectors, filters, and application data inside the Postgres ecosystem instead of adopting a separate hosted vector database.
CLIProxyAPI
Self-hosted proxy API for routing AI CLI accounts into OpenAI-compatible endpoints
CLIProxyAPI is an open-source Go proxy server that wraps Gemini CLI, Claude Code, OpenAI Codex, Grok Build, and related CLI account flows behind OpenAI/Gemini/Claude-compatible API endpoints. Use it carefully: it can touch OAuth sessions, auth files, logs, and provider account policies, so production use needs credential and ToS review.
Traceway
OpenTelemetry-native observability with AI tracing, logs, traces, metrics, and session replay — self-hosted in 90 seconds.
Traceway is an open-source, OpenTelemetry-native observability platform that combines logs, traces, metrics, exceptions, session replay, and AI tracing in a single self-hosted system. MIT licensed with no open-core restrictions, it deploys in 90 seconds via Docker Compose and accepts OTLP/HTTP from any OTel SDK without a Collector or per-language vendor SDK.
Judgeval
Open-source post-building layer for agents — tracing, evals, and online monitoring
Judgeval is the open-source post-building layer for AI agents from Judgment Labs, providing OpenTelemetry-based tracing, hosted and custom evaluation scorers, and online behavior monitoring for LLM-powered applications. Instrument any function with a single decorator, score live production traffic against faithfulness and instruction-adherence checks, and feed real-world failures back into reinforcement learning or supervised fine-tuning loops.
TraceRoot
Open-source observability and self-healing layer for AI agents
TraceRoot is a YC S25-backed open-source observability platform purpose-built for AI agents and LLM apps. It combines OpenTelemetry-compatible tracing with an agentic debugging runtime that reads your source code, correlates failures with recent commits, and proposes fix PRs automatically. BYOK support spans seven LLM providers; the entire stack runs self-hosted via Docker Compose, with TraceRoot Cloud available for managed deployments.
Requestly
One tool for intercepting, mocking, and replaying HTTP — acquired by BrowserStack
Requestly is a BrowserStack-backed API client, HTTP interceptor, mock server, and session replay tool for frontend and QA teams. Its current product is commercial/API-client led, while the legacy interceptor/open-source code is AGPLv3. The free plan covers individual workflows, and Pro lists at $12/user/month monthly or $9/user/month annually for collaborative QA and frontend debugging teams.
Puck
Open-source visual editor for React — self-hostable core, optional Puck Cloud
Puck is an open-source visual page builder for React that gives teams a drag-and-drop editor backed by their own components. The MIT-licensed core remains self-hostable and persists plain JSON, while Puck Cloud and Puck AI now add optional hosted and agentic editing layers. With 12.8K+ stars, it is the dev-first visual editor to watch.
GraphBit
Rust-native multi-agent orchestration for production
GraphBit is a Rust-native, multi-agent orchestration framework built for production. It targets the gap between Python-first frameworks like LangGraph and the operational expectations of enterprise systems — predictable memory, low latency, deterministic concurrency, and the ability to embed an agent runtime in services that already run Rust without dragging in a Python interpreter.
VectorChord
High-recall Postgres vector search at billion scale
VectorChord is a Postgres extension from the supervc-stack/VectorChord project that brings high-recall vector search to PostgreSQL. As the spiritual successor to pgvecto.rs, it combines IVF indexes with RaBitQ quantization to deliver Pinecone-class performance at billion-vector scale while keeping all data inside a single Postgres database — no separate vector store, no two-system sync, no rewrites when the workload grows.
Infinity
AI-native database for hybrid RAG retrieval
Infinity is an AI-native database from InfiniFlow that unifies dense vectors, sparse vectors, tensors, and full-text search in a single engine. Built for retrieval-augmented generation (RAG) at scale, it powers hybrid search workflows where lexical matching, semantic similarity, and reranking all happen against one storage layer instead of four loosely coupled services.
Open SWE
Open-source async coding agent you can run in your own sandbox
Open-source framework from LangChain AI for building your organization's internal coding agent — the same pattern Stripe's Minions, Ramp's Inspect, and Coinbase's Cloudbot follow. Built on LangGraph and Deep Agents, Open SWE runs each task in an isolated cloud sandbox (Modal, Daytona, Runloop, or LangSmith), invokes from Slack, Linear, or GitHub, orchestrates subagents, and opens pull requests autonomously — customizable end-to-end for your codebase and conventions.
Devbox
Instant isolated dev environments powered by Nix
Devbox is an open-source command-line tool that creates instant, reproducible development environments using Nix packages without requiring you to learn Nix. Define your project dependencies in a simple devbox.json file and get isolated shells with access to over 400,000 package versions. It eliminates dependency conflicts between projects and ensures every team member works in an identical environment, with support for devcontainers, Docker, and cloud deployment.
NocoDB
Open-source Airtable alternative with database power
NocoDB is a free, self-hostable open-source platform that turns any database into a smart spreadsheet interface. It offers grid, gallery, form, Kanban, and calendar views with support for rich field types including links, lookups, rollups, and formulas. NocoDB provides role-based access control, REST APIs, workflow automation, and integrations with services like Slack and Discord — making it a powerful Airtable alternative for teams who want full data ownership.
LocalSend
Open-source cross-platform file sharing over local network
LocalSend is a free, open-source application for secure peer-to-peer file and message sharing between nearby devices over your local network. It works on Windows, macOS, Linux, Android, iOS, and Fire OS without requiring an internet connection or third-party servers. Each device generates TLS/SSL certificates for encrypted HTTPS communication, making it a privacy-first alternative to AirDrop that works across all operating systems.
PostHog
Open-source product analytics, session replay, and feature flags
PostHog is an open-source product and data tools platform for analytics, session replay, feature flags, experiments, surveys, error tracking, web analytics, data warehouse, CDP and LLM observability workflows. It suits developer-led teams that want one integrated product OS instead of many separate tools.
ZITADEL
Open-source identity management with built-in multi-tenancy
ZITADEL is an open-source identity and access management platform that handles authentication, authorization, and user management for B2B and B2C applications. It supports OIDC, SAML, OAuth 2.0, Passkeys, MFA, and passwordless login out of the box. Built with multi-tenancy as a core feature, ZITADEL lets you manage multiple organizations with delegated admin access, custom branding, and isolated identity stores—all from a single deployment.
Hasura
Instant GraphQL and REST APIs on any database
Hasura auto-generates real-time GraphQL and REST APIs directly from your database schema—PostgreSQL, MySQL, SQL Server, MongoDB, and more. It provides fine-grained row-level and column-level access control, event triggers on database changes, remote schema stitching, and real-time subscriptions out of the box. Available as a managed cloud service or self-hosted, Hasura eliminates weeks of boilerplate API development while maintaining full control over authorization logic.
TaxHacker
Self-hosted AI accounting for freelancers and small teams
TaxHacker is an open-source, self-hosted AI accounting app that automatically extracts financial data from receipts, invoices, and bank statements using LLMs. It supports 170+ currencies and 14 cryptocurrencies with historical exchange rate conversion, multi-project accounting, and custom AI extraction fields. Works with OpenAI, Gemini, Mistral, or local models via Ollama—deploy with Docker and keep all financial data under your control.
llama-swap
Hot-swap between local LLM models via OpenAI-compatible API
llama-swap is an open-source tool that manages multiple local LLM models behind a single OpenAI-compatible API endpoint. It automatically loads and unloads models on demand, letting developers hot-swap between different models without restarting services. With 3.1K+ GitHub stars, it solves the common pain point of running multiple specialized models on limited hardware.
Bifrost
50x faster LLM gateway with MCP support, built in Go
Bifrost is a high-performance open-source AI gateway built from scratch in Go. Unifies access to 15+ providers and 1,000+ models through a single OpenAI-compatible API with only 11 microsecond overhead per request at 5K RPS — 50x faster than LiteLLM. Features automatic failover, load balancing, semantic caching, and functions as both MCP client and MCP server. Apache 2.0 licensed.