Database platforms, management tools, ORMs, and migration utilities
Showing 24 of 71 tools
TypeScript ORM that feels like SQL
Lightweight, serverless-ready TypeScript ORM with zero dependencies bringing SQL-like syntax into TypeScript for maximum type-safety. Maps queries closely to raw SQL with full auto-completion and type inference. Features Drizzle Kit for migrations, Drizzle Studio for visual browsing, supports PostgreSQL, MySQL, SQLite, and Turso. Runs on Node.js, Bun, Deno, Cloudflare Workers, and Vercel Edge. 26K+ GitHub stars, growing fast as a Prisma alternative.
Context retrieval layer for AI agents and RAG
Airweave is an open-source context retrieval platform that connects AI agents and RAG systems to 50+ apps and databases through a unified search interface. It continuously syncs data from sources like Notion, Slack, GitHub, and databases, making it searchable through LLM-friendly APIs. Airweave includes Python and TypeScript SDKs, MCP support, and a CLI for managing data connections.
Google Zanzibar-inspired authorization database
SpiceDB is an open-source authorization database inspired by Google's Zanzibar system, providing relationship-based access control (ReBAC) at scale. It defines permissions through a schema language that models relationships between users, resources, and roles, then evaluates authorization checks in single-digit milliseconds. Used by companies like Netflix and GitHub, SpiceDB handles millions of permission checks per second.
Full-text and vector search engine in under 2KB
Orama is a complete search engine and RAG pipeline that runs in browsers, servers, and edge environments in under 2KB. It provides full-text search, vector search, and hybrid search with built-in faceting, filters, geo-search, and typo tolerance. Orama requires no external dependencies and works entirely client-side for instant search experiences, or server-side with Node.js and Deno for larger datasets.
Open-source high-performance key/value datastore
Valkey is a Linux Foundation-backed open-source fork of Redis 7.2.4 under BSD 3-Clause license, created after Redis changed to restrictive licensing. It provides a drop-in Redis replacement for caching, message queues, and real-time data workloads. Valkey 9 adds JSON, Bloom filters, vector search, cluster scaling to 2000 nodes, and over one billion requests per second in benchmarks. Backed by AWS, Google, Oracle, and Ericsson.
Python library for declarative data loading that LLMs can generate
dlt (data load tool) is a Python library for building data pipelines with declarative, schema-aware loading that is simple enough for LLMs to generate correctly. It extracts data from APIs, databases, and files, normalizes nested structures, handles schema evolution, and loads into warehouses and lakes. Supports 30+ destinations including BigQuery, Snowflake, DuckDB, and PostgreSQL. Over 5,200 GitHub stars.
Managed Postgres platform with 200+ extensions as pre-built stacks
Tembo is a managed PostgreSQL platform that packages 200+ Postgres extensions into purpose-built stacks for specific workloads. Stacks include OLAP analytics, vector search, message queues, geospatial, and machine learning, turning PostgreSQL into a specialized database for each use case. Eliminates the need for separate Redis, Elasticsearch, or Kafka instances alongside Postgres.
Postgres sync engine for local-first and real-time applications
ElectricSQL is a sync engine that keeps local application state synchronized with PostgreSQL in real-time. It enables local-first architectures where apps work offline with instant responsiveness, syncing data bidirectionally when connectivity is available. Supports partial replication with shape-based subscriptions to sync only relevant data subsets to each client.
Graph-relational database with EdgeQL query language, formerly EdgeDB
Gel (formerly EdgeDB) is a graph-relational database that combines the relational model with graph database traversal capabilities through its EdgeQL query language. Built on PostgreSQL, it eliminates the object-relational impedance mismatch with a type system that maps directly to application data models. Features built-in migrations, authentication, and an interactive web UI.
Elasticsearch-quality full-text and hybrid search inside Postgres
ParadeDB brings Elasticsearch-quality full-text search, BM25 ranking, and hybrid vector-keyword search directly into PostgreSQL as native extensions. Backed by a 12 million dollar Series A with over 500,000 Docker deployments, it eliminates the overhead of running separate search infrastructure. Teams get powerful search within their existing Postgres stack without managing additional clusters.
Multi-model database for the AI era — document, graph, vector, and relational in one
SurrealDB is a multi-model database that natively combines document, graph, relational, key-value, and vector storage in a single engine. It eliminates the need for separate databases by handling structured queries, graph traversals, full-text search, and vector similarity in one SQL-like query language called SurrealQL. Built in Rust for performance and safety, it supports real-time subscriptions, row-level permissions, and embedded or distributed deployment modes.
Git for data — version-controlled SQL database with branch, merge, and diff
Dolt is a SQL database that implements Git-style version control directly on your data. Every write creates a commit, and you can branch, merge, diff, and revert tables just like source code. It speaks the MySQL wire protocol so existing MySQL clients, ORMs, and tools work out of the box. Dolt is used for AI training data management, reproducible analytics, collaborative data editing, and agent memory stores.
Conversational data analysis with natural language queries over databases
PandasAI enables natural-language queries against databases, data lakes, CSVs, and parquet files using LLMs and RAG pipelines. With 23,400+ GitHub stars, it bridges the gap between database tools and AI by letting developers and analysts interact with data conversationally, supporting SQL, PostgreSQL, and various file formats.
High-performance S3-compatible object storage built in Rust
RustFS is an open-source distributed object storage system built entirely in Rust, offering 2.3x faster performance than MinIO for small object payloads. It provides full S3 API compatibility, enabling seamless migration from MinIO, Ceph, and AWS S3 with existing SDKs and CLI tools. Released under Apache 2.0 license, it avoids MinIO's restrictive AGPL terms. Features include distributed architecture, erasure coding, WORM compliance, encryption via RustyVault, and a web management console.
Fully managed RAG-as-a-Service platform for enterprise AI applications
Ragie is a managed retrieval-augmented generation platform that handles document ingestion, indexing, and retrieval so developers can build grounded AI applications without managing vector databases or chunking pipelines. It connects to Google Drive, Notion, Slack, Confluence, and other enterprise data sources with simple APIs for hybrid search and entity extraction.
Financial transactions database designed for mission-critical safety and speed
TigerBeetle is a purpose-built database for financial transactions that prioritizes safety and performance above all else. Written in Zig, it provides strict debit-credit consistency, serializable isolation, append-only immutability, and multi-cloud high availability. Designed for ledgers, payment systems, and any application where losing or duplicating a transaction is catastrophic.
High-performance data engine for multimodal AI workloads
Daft is a high-performance distributed data engine designed specifically for AI and multimodal workloads. It processes structured data alongside images, audio, video, and embeddings natively, outperforming Spark and Polars on AI-specific data pipelines. Built in Rust with a Python API, Daft handles the data engineering challenges unique to machine learning workflows.
SQL-native memory infrastructure for AI agents and applications
Memori is an AI memory engine that provides persistent, queryable memory for agents and applications using SQL-native storage. It stores structured memories with semantic search, temporal awareness, and relationship tracking, enabling AI systems to remember user preferences, past interactions, and contextual facts across sessions. With 12,900 GitHub stars, it offers a database-native approach to the agent memory problem.
Build real-time temporal knowledge graphs for AI agents
Graphiti is an open-source Python framework by Zep for building temporally-aware knowledge graphs for AI agents. It continuously integrates conversations, business data, and external information into queryable graphs with bi-temporal tracking. The hybrid retrieval combines semantic search, BM25 keywords, and graph traversal for sub-300ms queries without LLM calls at retrieval time.
BM25 full-text search extension for PostgreSQL
pg_textsearch is a PostgreSQL extension from Timescale that adds BM25 relevance-ranked full-text search directly inside Postgres. Using the same ranking algorithm as Elasticsearch and Lucene, it provides search-engine quality results without requiring a separate search cluster — particularly valuable for developers building RAG pipelines on PostgreSQL who want semantic-quality ranking alongside pgvector.
Auto-generates interactive ER diagrams from database schemas
Liam ERD automatically generates beautiful, interactive entity-relationship diagrams from existing database schemas. It supports Rails schema.rb, PostgreSQL, Prisma, and raw SQL — with zero configuration required. A web version handles public repos instantly, while the CLI processes private repositories with CI/CD integration for keeping documentation in sync.
Open data platform wrapping any SQL database with instant APIs
Directus is a database-first open data platform that wraps any existing SQL database with instant REST and GraphQL APIs, a no-code admin dashboard, and built-in authentication. Unlike traditional headless CMS platforms that impose their own data model, Directus introspects your existing schema and works on top of it — supporting PostgreSQL, MySQL, SQLite, MS SQL, MariaDB, and CockroachDB out of the box.
Serverless vector and full-text search on object storage
turbopuffer is a serverless vector and full-text search engine built on object storage that delivers 10x lower costs than traditional vector databases. Used by Anthropic, Cursor, Notion, and Atlassian for production search workloads. Manages 2+ trillion vectors across 8+ petabytes with automatic scaling and no infrastructure management. Funded by Thrive Capital.
Open-source ELT platform with 350+ data connectors
Airbyte is an open-source ELT platform with 350+ pre-built connectors for syncing data from any source to warehouses, lakes, and AI pipelines. It handles incremental syncs, schema evolution, and change data capture with a connector builder for custom integrations. Used by DoorDash, Replit, and thousands of data teams. Over 15,000 GitHub stars and $150M+ in funding.