Docs generators, knowledge bases, and developer reference tools
Showing 24 of 49 tools
AI-ready PDF parser with benchmark-leading accuracy
OpenDataLoader PDF is a high-performance parser that extracts structured, AI-ready data from PDFs with industry-leading 0.907 benchmark accuracy. Combines deterministic local processing with optional AI hybrid mode for complex layouts, OCR support across 80+ languages, formula extraction in LaTeX, chart descriptions, and built-in prompt injection filtering. Available as Python, Node.js, and Java SDKs for seamless RAG pipeline and data preparation integration.
CI-friendly database documentation generator
tbls is an open-source database documentation tool that automatically generates schema documentation in Markdown, with built-in linting to enforce documentation standards and coverage metrics for tables and columns. It supports 13+ databases including PostgreSQL, MySQL, BigQuery, Snowflake, MongoDB, and ClickHouse. Designed for CI integration with GitHub Actions support, tbls runs schema diff detection and documentation enforcement as part of automated pipelines.
Context engineering patterns for AI coding assistants
Context Engineering Intro is an open-source repository by Cole Medin providing structured context engineering patterns for AI coding assistants. Built around Claude Code, it includes .claude command files, PRP templates, and the WISC framework for managing AI context in coding sessions. The repo shows how to structure project context and rules so AI assistants produce reliable, architecture-aware code. With 13K+ GitHub stars, it is a go-to reference for context-first AI coding.
Programmable Markdown typesetting for docs, books, and slides
Quarkdown is a Turing-complete Markdown typesetting system that compiles a single source into print-ready books, academic papers, knowledge bases, or interactive presentations. It extends Markdown with a built-in scripting language featuring functions, variables, and a standard library for full document control. Supports HTML, PDF, and plain text output with live preview and real-time reloading during authoring.
On-device hybrid search engine for your docs and notes
QMD is an on-device search engine built by Tobi Lütke (Shopify CEO) that indexes markdown notes, meeting transcripts, and documentation locally. It combines BM25 full-text search, vector semantic search, and LLM-powered re-ranking into a single hybrid pipeline. Ships with a built-in MCP server for seamless integration with Claude Code, Cursor, and other AI editors. All processing happens on your machine via node-llama-cpp with GGUF models — zero cloud dependency.
MIT-licensed API documentation framework by Zuplo
Zudoku is an MIT-licensed API documentation framework built by the team behind Zuplo API gateway. It generates developer portals from OpenAPI specs with features like interactive API playground, markdown content pages, authentication integration, and search. Designed as a free alternative to ReadMe and Stoplight with customizable themes and plugin architecture.
Modern API documentation platform and OpenAPI reference generator
Scalar is an API documentation platform that generates beautiful, interactive API references from OpenAPI specifications. Selected as the default API docs renderer in .NET 9, it replaces Swagger UI with a modern interface featuring dark mode, request examples in multiple languages, and a built-in API client. Available as open-source packages for any framework or as a hosted platform.
Self-hosted collaborative wiki and documentation platform
Docmost is an open-source collaborative wiki offering real-time editing, spaces with granular permissions, page history, and rich diagram support via Draw.io, Excalidraw, and Mermaid. Designed as a self-hosted Confluence and Notion alternative for teams needing data sovereignty, it provides nested pages, comments, and full-text search. Over 19,700 GitHub stars and 52 releases demonstrate strong community adoption.
Open standard for guiding AI coding agents at the repository level
AGENTS.md is an open standard format adopted by 60,000+ open-source projects for providing AI coding agents with repository-level instructions. With 20,000+ GitHub stars, it has been adopted by GitHub Copilot, OpenAI Codex, Google Gemini CLI, and multiple IDEs as the de facto way to communicate project context and coding conventions to AI agents.
No-code knowledge base platform with visual AI workflow and built-in RAG
FastGPT is an open-source no-code AI knowledge base platform with 27,000+ GitHub stars and 500,000+ users worldwide. It combines visual workflow orchestration, built-in RAG pipelines, QA-pair extraction, and API-aligned completions into a single deployable stack that runs on just 2GB RAM via Docker one-liner deployment.
Auto-generates interactive ER diagrams from database schemas
Liam ERD automatically generates beautiful, interactive entity-relationship diagrams from existing database schemas. It supports Rails schema.rb, PostgreSQL, Prisma, and raw SQL — with zero configuration required. A web version handles public repos instantly, while the CLI processes private repositories with CI/CD integration for keeping documentation in sync.
Instant MCP server for any GitHub repository
GitMCP is a free, open-source remote MCP server that transforms any GitHub repository or GitHub Pages site into an AI-accessible documentation hub. Just replace github.com with gitmcp.io in any repo URL to give AI assistants grounded context about that project — eliminating code hallucinations with zero configuration required.
AI documentation tool with video-to-docs conversion
Trupeer is an AI documentation platform that converts screen recordings and videos into structured technical documentation with AI-generated voiceovers. It automates the creation of step-by-step guides, API documentation, and SaaS help articles by analyzing video content and generating written docs with annotated screenshots. Useful for teams creating onboarding materials and product documentation.
Auto-generate API docs from live production traffic
Levo.ai generates OpenAPI specifications and API documentation automatically from live production traffic using eBPF-based instrumentation. It captures real request/response patterns to create always-accurate API docs that eliminate documentation drift. Features include automated API discovery, security testing of documented endpoints, and Postman collection generation without requiring code changes.
Generate production-ready SDKs from API specifications
Stainless generates idiomatic, production-ready client SDKs from OpenAPI specifications with features like automatic pagination, retries, streaming, and proper error handling baked in. Used by Anthropic, OpenAI, Cloudflare, and Lithic to maintain their official SDKs. Produces TypeScript, Python, Go, Java, and Kotlin SDKs that match hand-written quality with automated maintenance and versioning.
Generate SDKs and API documentation from OpenAPI specs
Fern is a platform that generates idiomatic client SDKs and beautiful API documentation from OpenAPI specifications. It produces type-safe SDKs for TypeScript, Python, Java, Go, Ruby, and C# with features like pagination, retries, and streaming built in. The documentation includes API reference, guides, and an AI-powered search assistant. Used by companies like Cohere, ElevenLabs, and Merge.
Instant AI API documentation from code snippets
DevDocsAI is a fast-utility AI documentation generator where developers paste code snippets and receive structured API documentation in 30 seconds. It targets individual makers and small teams who need quick documentation for specific code segments without the overhead of setting up a full documentation platform, providing rapid turnaround for function and endpoint documentation.
AI-powered marketing-ready READMEs from code
PitchDocs is an AI documentation generator that scans codebases to produce marketing-ready READMEs, changelogs, and AI context files like llms.txt. It bridges the gap between raw source code and consumer-ready technical communication, targeting the growing need for high-quality README files that serve both human developers and AI agents accessing documentation.
AI localization engine for software and docs
Lingo.dev is a Y Combinator F24-backed AI localization engine that automates translation of software interfaces and technical documentation into multiple languages while maintaining technical context. It preserves code variables, formatting tokens, and technical terminology that generic translation tools mangle, solving the global reach problem for developer-facing products and documentation.
IDE documentation that stays synced with code
Swimm uses AI to keep code documentation in sync with real-time code changes, providing interactive walkthroughs directly in VS Code and JetBrains IDEs. It solves the stale documentation problem by making docs part of the development workflow rather than a separate artifact, automatically detecting when code changes invalidate existing documentation and suggesting updates.
Developer-focused documentation with AI assistance
Archbee is a developer-focused documentation platform with AI assistance for writing technical guides, API documentation, and internal knowledge bases. Built specifically for engineering teams rather than general content writers, it provides specialized code blocks, team collaboration spaces, API reference generators, and version-controlled documentation alongside AI-powered writing suggestions.
AI documentation generator from code repositories
DocuWriter.ai creates technical documentation from code repositories by scanning files, understanding patterns, and generating structured documentation including API references, function descriptions, and architectural overviews. It supports batch processing for entire repositories, serving as a zero-to-one documentation tool for undocumented codebases that need initial documentation coverage quickly.
Auto-generate SDKs and docs from backend routes
Speakeasy uses AI to parse backend API routes and auto-generate production-ready SDKs in multiple languages alongside OpenAPI documentation. It ensures that client libraries and documentation always match the current backend implementation, eliminating the drift between API code, SDK behavior, and documentation that plagues most API-first teams as their services evolve.
AI-powered Stripe-style API documentation
Theneo is a Y Combinator-backed tool that generates high-quality, Stripe-style API documentation in seconds using AI. It transforms OpenAPI specs, Postman collections, or raw code into interactive developer portals with code playgrounds, multi-language SDK examples, and beautiful visual design, helping API-first teams create professional documentation without spending weeks on manual writing.