Open-source metadata platform for data discovery
DataHub is an open-source metadata platform for data discovery, governance, and observability, originally developed at LinkedIn. It provides a centralized catalog with 80+ integrations for data warehouses, lakes, dashboards, and ML platforms. DataHub offers real-time metadata ingestion, column-level lineage tracking, automated quality checks, and fine-grained access policies. Used by 3,000+ organizations in production. Apache 2.0 licensed with 11.8K+ GitHub stars.