aicoolies logo
Khoj logo

Khoj

Open-source AI second brain with deep research and RAG

Share
freemiumOpen Source
Visit Website →

Khoj is an open-source personal AI app that serves as a self-hostable second brain. It connects to your documents — PDFs, Markdown, Notion, Word — and uses RAG to answer questions grounded in your knowledge base. Supports any local or cloud LLM including Llama, Claude, GPT, and Gemini. Features custom agents, scheduled automations, deep research mode, semantic search, and Obsidian, Emacs, and WhatsApp integrations. Over 33,000 GitHub stars, YC-backed.

Khoj occupies a unique position in the AI tools landscape as a personal knowledge companion that bridges your private documents with the reasoning capabilities of large language models. Unlike generic chat interfaces that rely solely on training data, Khoj indexes your files using semantic embeddings and retrieves relevant context before generating answers, dramatically reducing hallucinations when working with your own content. The system supports a wide range of document formats including PDFs, Markdown files, Notion pages, Word documents, and org-mode files, making it compatible with most knowledge workers' existing workflows without requiring format conversion.

The platform's agent system allows users to create specialized AI assistants with custom knowledge bases, personas, and tool access. A research agent might have access to academic papers and web search, while a project assistant might draw only from specific repository documentation. Khoj's scheduled automation feature enables recurring research tasks that deliver results as personal newsletters or notifications. The deep research mode performs multi-step investigation across both local documents and web sources, synthesizing findings into comprehensive reports.

Khoj scales from a fully offline, on-device deployment using local models through Ollama to a cloud-hosted enterprise installation with team management and SSO. The self-hosted option runs via Docker or pip with complete data sovereignty, while the hosted version at app.khoj.dev offers a free tier for individual users. With over 33,000 GitHub stars, Y Combinator backing, native Obsidian and Emacs plugins, and support for image generation and voice interaction, Khoj has established itself as the leading open-source alternative to commercial AI assistants for knowledge-intensive work.

Pricing

Free self-hosted; cloud free tier available; paid plans for teams

Platforms

Web, Desktop, Obsidian, Emacs, WhatsApp — Docker or pip self-host

Categories

Tags

Use Cases

Alternatives

Open WebUI logo

Open WebUI

Self-hosted AI platform with ChatGPT-like interface for local and cloud LLMs.

Extensible, self-hosted AI platform with 290M+ Docker pulls and 124K+ GitHub stars. Supports Ollama, OpenAI-compatible APIs, and any Chat Completions backend. Features built-in RAG, multi-user RBAC, voice/video calls, Python function workspace, model builder, and web browsing. Runs entirely offline with enterprise features including SSO and audit logging.

free
AnythingLLM logo

AnythingLLM

All-in-one self-hosted AI app with RAG, agents, and multi-user support

AnythingLLM is an open-source, privacy-first AI application that turns any document into an interactive knowledge base. It bundles document ingestion, vector storage (built-in LanceDB), RAG pipelines, AI agents, and multi-user access into a single deployable package. Supports 30+ LLM providers including OpenAI, Anthropic, Ollama, and local models. With 62K+ GitHub stars and MIT license, it runs as a desktop app or Docker container with zero configuration required out of the box.

freemiumOpen Source
LibreChat logo

LibreChat

Self-hosted multi-model AI chat platform

LibreChat is an open-source ChatGPT-like interface with 35K+ GitHub stars supporting multiple AI providers in a single self-hosted platform. Connect OpenAI, Anthropic, Google, Mistral, local models via Ollama, and custom endpoints simultaneously. Features conversation branching, file uploads, code interpreter, plugins, presets, multi-user support with RBAC, and LDAP/SSO authentication. Privacy-focused alternative to commercial AI chat services with full data ownership.

open-sourceOpen Source
Ollama logo

Ollama

Run LLMs locally with one command

Tool for running large language models locally on your machine with a simple CLI interface. Download and run Llama 3, Mistral, Gemma, Phi, Code Llama, and dozens of other open-source models with a single command. Features model management, GPU acceleration (NVIDIA/AMD/Apple Silicon), OpenAI-compatible API server, Modelfile for customization, and multi-model switching. Ideal for offline AI development, privacy-sensitive use cases, and local testing. 120K+ GitHub stars.

open-sourceOpen Source

Related Tools

KubeAI

Kubernetes operator for serving AI inference workloads

KubeAI is an Apache-2.0 Kubernetes operator for deploying and scaling AI inference workloads, including LLMs, embeddings, reranking, and speech-to-text. It gives platform teams OpenAI-compatible endpoints, model proxy/controller primitives, model caching, scale-from-zero behavior, and cluster-native resource management for self-hosted inference on Kubernetes.

open-sourceOpen Source

Notion MCP Server

Official Notion MCP server for AI-agent workspace access

Notion MCP Server is Notion's official MIT-licensed MCP server for connecting AI assistants to Notion workspaces. It supports the vendor-backed remote OAuth path and tools designed for page, workspace, and Markdown-style operations, making it a safer default than unofficial Notion bridges for teams already using Notion for docs, projects, or internal knowledge bases.

open-sourceOpen SourceTelemetry

Linear MCP Server

Official authenticated remote MCP endpoint for Linear issues, projects, comments, and coding-agent workflows.

Linear MCP Server is Linear’s official authenticated remote MCP endpoint for agent access to issues, projects, and comments. It gives Claude, Codex, Cursor, VS Code, Windsurf, Zed, and other clients a centrally hosted way to find, create, and update Linear work items through OAuth-backed MCP without maintaining a local connector or brittle API glue.

freemiumTelemetry

Slack MCP Server

Official Slack MCP server for approved workspace search, messaging, canvas, and user-context actions.

Slack MCP Server is Slack’s official remote MCP layer for giving approved AI clients workspace context and controlled actions. It lets agents search messages, files, users, and channels, draft or send messages, read threads, manage canvases, and authenticate through Slack OAuth while workspace admins approve integrations and normal Slack rate limits still apply.

freemiumTelemetry

Spotlight by Backplanes

Session reports for Claude Code and Codex runs

Spotlight by Backplanes turns completed Claude Code and Codex sessions into concise reports for engineering, security, and spend review. The CLI installs on macOS, Linux, or WSL 2, watches sessions after they finish, redacts PII and credentials locally before upload, then summarizes files touched, commands run, external domains reached, scope drift, risky actions, and next-session improvements.

freemiumTelemetry
agmsg logo

agmsg

Cross-agent messaging for CLI coding agents

agmsg is an MIT-licensed Bash and SQLite messaging layer for CLI coding agents. It lets Claude Code, Codex, Gemini CLI, GitHub Copilot CLI, Antigravity, OpenCode, Hermes, and other terminal agents exchange messages through a shared local database instead of relying on a human copy-paste relay. It is intentionally not MCP, not a broker, and not a subagent framework.

open-sourceOpen Source