aicoolies logo
Dolt logo

Dolt

Git for data — version-controlled SQL database with branch, merge, and diff

Share
open-sourceOpen Source
Visit Website →

Dolt is a SQL database that implements Git-style version control directly on structured data. Table changes can be staged, committed, branched, merged, diffed, and reverted through SQL workflows and a Git-like CLI. It speaks the MySQL wire protocol so existing MySQL clients, ORMs, and tools can connect with minimal driver changes. Dolt is used for AI training data management, reproducible analytics, collaborative data editing, and agent-memory experiments.

We have a review for this tool

A detailed review by the aicoolies team — click to read

Dolt is an open-source database that brings the full Git versioning model to structured data. Under the hood it implements a storage engine built on prolly trees that makes branch, merge, diff, blame, and log operations first-class SQL-accessible workflows rather than external tooling. MySQL wire protocol compatibility means applications and tools that already connect to MySQL can usually connect to Dolt without changing database drivers, then use Dolt's commit, branch, and diff model for tables.

The versioning primitives map directly to Git concepts. Developers create branches to experiment with data transformations, diff branches to see exactly which rows changed, merge branches with automatic three-way conflict resolution, and tag releases for reproducibility. The system call interface exposes these operations as stored procedures, so version control happens inside SQL transactions rather than through a separate CLI. DoltHub provides a hosted collaboration platform where teams can fork, clone, and submit pull requests on databases.

Dolt has found strong adoption in AI and ML workflows where training data lineage and reproducibility are critical. Teams use branches to A/B test dataset modifications, diff to audit data quality changes, and the full commit history as an audit trail. The same branch-and-merge model can also support agent-memory experiments when teams need isolated working state and reviewed merges. Dolt is Apache 2.0 licensed with about 23K GitHub stars and a commercial hosted offering called Hosted Dolt.

Pricing

Free open-source core; Hosted Dolt managed service available

Platforms

Windows, Linux, macOS, Docker

Categories

Tags

Use Cases

Alternatives

Neon logo

Neon

Serverless Postgres

Serverless Postgres platform separating storage and compute for branching, autoscaling, read replicas, instant restore, and scale-to-zero workloads. Neon works with standard PostgreSQL clients and ORMs, supports extensions such as pgvector, and sits inside a broader Neon backend platform with Auth, Data API, Functions, Object Storage, and AI Gateway features.

freemiumOpen Source
PlanetScale logo

PlanetScale

MySQL-compatible serverless database

Relational database platform for MySQL and Postgres with Vitess-backed MySQL scale, PlanetScale Postgres, query insights, deploy-request workflows, and Database Traffic Control. It fits production teams that need managed relational performance, safe schema changes, replicas, and database expertise rather than a simple hobby database.

paid
Turso logo

Turso

SQLite for production

Edge-hosted distributed database built on libSQL (an open-source fork of SQLite) designed for low-latency data access worldwide. Features multi-region replication, embedded replicas that sync to your application server for microsecond reads, database branching for development workflows, and point-in-time recovery. Ideal for edge computing, serverless functions, and mobile apps. Compatible with SQLite ecosystem tooling. Generous free tier with 9GB storage and 500 databases.

freemiumOpen Source
lakeFS logo

lakeFS

Git-like version control for data lakes and object storage

lakeFS is an open-source platform that brings Git-like branching, committing, and merging to data lakes and object storage. It works on top of S3, GCS, Azure Blob, and MinIO, enabling teams to create isolated data branches for experimentation, run CI/CD for data pipelines, and maintain full data lineage. Acquired DVC in 2025, uniting data version control for both small and enterprise-scale workloads.

freemiumOpen Source

Related Tools

Supabase MCP

MCP server for connecting AI assistants to Supabase projects

Supabase MCP is Supabase's Apache-2.0 server for connecting AI assistants to Supabase projects. It can expose database, configuration, and project-management workflows to MCP clients such as Cursor, Claude, and Windsurf, while the official docs emphasize permission and security review before production use, SQL changes, or high-privilege database access.

open-sourceOpen SourceTelemetry
Ardent logo

Ardent

Database branching for coding agents

Ardent is a Postgres database branching platform built for coding-agent workflows. It creates isolated database copies in seconds so Claude Code, Codex, Cursor, or human developers can test migrations, clean data, reproduce bugs, and run risky experiments without touching production. The strongest fit is teams already using Postgres who need agent-safe dev/test databases rather than another generic serverless database.

freemium
VectorChord logo

VectorChord

High-recall Postgres vector search at billion scale

VectorChord is a Postgres extension from the supervc-stack/VectorChord project that brings high-recall vector search to PostgreSQL. As the spiritual successor to pgvecto.rs, it combines IVF indexes with RaBitQ quantization to deliver Pinecone-class performance at billion-vector scale while keeping all data inside a single Postgres database — no separate vector store, no two-system sync, no rewrites when the workload grows.

open-sourceOpen Source
Infinity logo

Infinity

AI-native database for hybrid RAG retrieval

Infinity is an AI-native database from InfiniFlow that unifies dense vectors, sparse vectors, tensors, and full-text search in a single engine. Built for retrieval-augmented generation (RAG) at scale, it powers hybrid search workflows where lexical matching, semantic similarity, and reranking all happen against one storage layer instead of four loosely coupled services.

open-sourceOpen Source
sqlite-vec logo

sqlite-vec

Vector search extension for SQLite that runs anywhere

sqlite-vec is a lightweight vector search extension for SQLite written in pure C with zero dependencies. It brings nearest-neighbor search capabilities directly into SQLite databases, enabling AI applications to store and query embeddings without running a separate vector database. The extension works everywhere SQLite runs including Linux, macOS, Windows, WebAssembly in browsers, and even Raspberry Pi devices. Sponsored by Mozilla Builders, Fly.io, and Turso.

freeOpen Source
WeKnora logo

WeKnora

Enterprise RAG framework by Tencent

WeKnora is a Tencent-developed LLM-powered knowledge management and Q&A framework for enterprise document understanding and semantic retrieval. Supports 10+ document formats including PDF, Word, Excel, and images with seamless IM platform integration for WeCom, Feishu, Slack, and Telegram. Offers Quick Q&A mode using RAG pipelines and Intelligent Reasoning mode with ReACT agents for complex multi-step reasoning tasks across organizational knowledge bases.

open-sourceOpen Source

Used in Stacks

Comparisons