aicoolies logo

LightRAG

Knowledge graph-powered RAG framework from HKU

Share
open-sourceOpen Source
Visit Website →

LightRAG is a research-backed RAG framework from Hong Kong University that combines knowledge graph structures with vector search for more contextual retrieval. Published at EMNLP 2025, it extracts entities and relationships from documents to build a structured knowledge graph, then uses dual-level retrieval across both graph and vector representations with five query modes: naive, local, global, hybrid, and mix.

We have a review for this tool

A detailed review by the aicoolies team — click to read

LightRAG is a retrieval-augmented generation framework from HKU Data Science that combines knowledge-graph structures with vector retrieval. Published around the EMNLP 2025 paper with roughly 36K+ GitHub stars at write time, it addresses a core limitation of ordinary RAG systems: flat vector chunks often miss the relationships between entities, systems, people, and concepts.

The framework extracts entities and relationships from documents, stores them alongside vector representations, and supports multiple retrieval modes for local graph traversal, global graph context, hybrid retrieval, and broader mixed strategies. It also supports incremental updates so new documents can be added without rebuilding an entire knowledge graph, which is important for living knowledge bases.

LightRAG is MIT licensed and works with a broad set of storage backends including PostgreSQL, MongoDB, Neo4j, Milvus, Qdrant, ChromaDB, Faiss, and related options. The ecosystem includes RAG-Anything for multimodal documents, tables, formulas, and images. Extraction quality depends on the selected LLM and corpus complexity, so production teams should validate graph quality instead of assuming a fixed model-size rule.

Pricing

Free and open source (MIT). Bring your own LLM API key for entity extraction and queries.

Platforms

Python package via pip or uv. Docker and Kubernetes deployment. Web UI included. Works with any LLM provider.

Categories

Tags

Use Cases

Alternatives

LangChain logo

LangChain

Framework for LLM applications

The most widely-used framework for building LLM-powered applications, available in Python and JavaScript. Provides abstractions for chains, agents, RAG, memory, tool usage, and structured output. Integrates with 100+ LLM providers, vector stores, document loaders, and tools. LangSmith offers tracing and evaluation. LangGraph enables stateful, multi-agent workflows with cycles. 100K+ GitHub stars. The de facto standard for LLM application development despite growing alternatives like LlamaIndex.

open-sourceOpen Source
LlamaIndex logo

LlamaIndex

Data framework for LLM applications

Leading Python framework for building LLM-powered applications with focus on data-aware and agentic workflows. Provides tools for RAG (Retrieval-Augmented Generation), document indexing, vector store integrations, query engines, and multi-agent orchestration. 150+ data connectors for various sources. Works with OpenAI, Anthropic, local models, and more. Includes LlamaHub for community tools and LlamaCloud for managed RAG pipelines. 50K+ GitHub stars.

open-sourceOpen Source
RAGFlow logo

RAGFlow

Deep document understanding RAG engine

RAGFlow is an open-source RAG engine with 76K+ GitHub stars that provides deep document understanding for building knowledge-based AI applications. Optimizes chunking for 20+ document types including PDFs, Word docs, presentations, and images using layout-aware parsing. Features template-based chunking strategies, citation with source references, multi-recall retrieval combining keyword and semantic search, and a visual knowledge base management interface with drag-and-drop document upload.

open-sourceOpen Source

Related Tools

Hermes Agent logo

Hermes Agent

Top Pick

Open-source AI agent framework with persistent memory, reusable skills, tools, and messaging gateways

Hermes Agent is an open-source AI agent framework with persistent memory, reusable skills, 40+ tools, cron jobs, and messaging gateways.

open-sourceOpen Source

Accomplish Coworker

Open-source desktop AI coworker for browsing and code execution.

Accomplish Coworker is an MIT-licensed open-source AI coworker that runs on the desktop, combining computer-use style browsing with code execution so agents can research, implement, run, and debug workflows in one local environment.

open-sourceOpen SourceTelemetry

Headroom

Context compression for LLM apps and coding agents

Headroom is an Apache-2.0 context compression layer for LLM apps and coding agents. It compresses tool output, logs, files, RAG chunks, and agent history through a local library, proxy, wrapper, or MCP server, with retrieval hooks for bringing originals back when needed. Treat its savings numbers as Headroom-reported benchmarks, not independent aicoolies measurements.

open-sourceOpen SourceTelemetry

Codebase Memory MCP

Codebase knowledge graph MCP server for AI coding agents

Codebase Memory MCP is an MIT-licensed MCP server that turns a repository into a persistent code knowledge graph for AI coding agents. It gives Claude Code, Cursor, Codex-style agents, and other MCP clients structural queries for functions, classes, call chains, routes, and architecture, helping them explore large projects without repeatedly rereading files or relying only on broad search.

open-sourceOpen SourceTelemetry
BeeAI Framework logo

BeeAI Framework

Python and TypeScript framework for production multi-agent systems

BeeAI Framework is an Apache-2.0 toolkit for building production-ready AI agents and multi-agent systems in Python and TypeScript. Its docs cover agents, tools, RAG, memory, workflows, backend providers, serving, and A2A/MCP integration surfaces, making it a vendor-neutral option for teams comparing LangGraph, CrewAI, Mastra, and related agent runtimes.

open-sourceOpen SourceTelemetry
Klavis AI logo

Klavis AI

MCP integration platform for agent tool use at scale

Klavis AI is an Apache-2.0 MCP integration platform for teams connecting AI agents to external SaaS tools and APIs. The public repo and official docs position it as infrastructure for reliable tool access at scale, so it fits teams that want reusable MCP connectors without treating every integration as a one-off script or custom OAuth maintenance project.

open-sourceOpen SourceTelemetry

Used in Stacks

Comparisons