aicoolies logo

Kotaemon

RAG-based document QA with multi-user support and agent reasoning

Share
open-sourceOpen Source
Visit Website →

Kotaemon is an open-source RAG-powered document question-answering interface backed by Cinnamon AI. It supports multi-user workspaces with access controls, advanced retrieval pipelines including hybrid search and knowledge graph extraction, and agentic reasoning for complex multi-step queries. The web UI handles PDFs, Office documents, and images with citations pointing to exact source passages, making it suitable for both individual research and team knowledge management.

Kotaemon provides a production-ready web interface for document question-answering powered by Retrieval-Augmented Generation. Unlike simpler RAG demos, it implements multi-user authentication with workspace isolation, letting organizations deploy a shared knowledge base where different teams maintain separate document collections with appropriate access controls. The retrieval pipeline supports multiple strategies including dense vector search, sparse BM25 matching, and hybrid combinations that balance semantic understanding with keyword precision.

The agentic reasoning mode enables complex multi-step queries that require synthesizing information across multiple documents or performing intermediate reasoning before generating final answers. Citation support links every response to specific source passages with page numbers and highlighted excerpts, giving users confidence in the accuracy of generated answers. Document processing handles PDFs with OCR for scanned pages, Microsoft Office formats, and images with text extraction, covering the document types commonly found in enterprise knowledge bases.

Backed by Cinnamon AI, a well-funded Japanese technology company, Kotaemon has grown to over 25,000 GitHub stars with 200,000+ Docker pulls. The Apache 2.0 license allows unrestricted commercial use, and the Docker-based deployment provides a straightforward path to self-hosted operation. Human-in-the-loop feedback mechanisms let users rate answer quality, creating a feedback signal for continuous improvement of retrieval and generation quality over time.

Pricing

Free and open source under Apache 2.0

Platforms

Docker on any server; web-based interface

Categories

Tags

Use Cases

Alternatives

Related Tools

Hermes Agent logo

Hermes Agent

Top Pick

Open-source AI agent framework with persistent memory, reusable skills, tools, and messaging gateways

Hermes Agent is an open-source AI agent framework with persistent memory, reusable skills, 40+ tools, cron jobs, and messaging gateways.

open-sourceOpen Source

Accomplish Coworker

Open-source desktop AI coworker for browsing and code execution.

Accomplish Coworker is an MIT-licensed open-source AI coworker that runs on the desktop, combining computer-use style browsing with code execution so agents can research, implement, run, and debug workflows in one local environment.

open-sourceOpen SourceTelemetry

Headroom

Context compression for LLM apps and coding agents

Headroom is an Apache-2.0 context compression layer for LLM apps and coding agents. It compresses tool output, logs, files, RAG chunks, and agent history through a local library, proxy, wrapper, or MCP server, with retrieval hooks for bringing originals back when needed. Treat its savings numbers as Headroom-reported benchmarks, not independent aicoolies measurements.

open-sourceOpen SourceTelemetry

Codebase Memory MCP

Codebase knowledge graph MCP server for AI coding agents

Codebase Memory MCP is an MIT-licensed MCP server that turns a repository into a persistent code knowledge graph for AI coding agents. It gives Claude Code, Cursor, Codex-style agents, and other MCP clients structural queries for functions, classes, call chains, routes, and architecture, helping them explore large projects without repeatedly rereading files or relying only on broad search.

open-sourceOpen SourceTelemetry

KubeAI

Kubernetes operator for serving AI inference workloads

KubeAI is an Apache-2.0 Kubernetes operator for deploying and scaling AI inference workloads, including LLMs, embeddings, reranking, and speech-to-text. It gives platform teams OpenAI-compatible endpoints, model proxy/controller primitives, model caching, scale-from-zero behavior, and cluster-native resource management for self-hosted inference on Kubernetes.

open-sourceOpen Source
BeeAI Framework logo

BeeAI Framework

Python and TypeScript framework for production multi-agent systems

BeeAI Framework is an Apache-2.0 toolkit for building production-ready AI agents and multi-agent systems in Python and TypeScript. Its docs cover agents, tools, RAG, memory, workflows, backend providers, serving, and A2A/MCP integration surfaces, making it a vendor-neutral option for teams comparing LangGraph, CrewAI, Mastra, and related agent runtimes.

open-sourceOpen SourceTelemetry

Used in Stacks