Vespa

Hybrid search and ML ranking engine at scale

open-sourceOpen Source

Vespa is an open-source serving engine with 6K+ GitHub stars for hybrid search combining vector similarity, BM25 text ranking, and structured filtering in a single query. Built by Yahoo for web-scale, it handles billions of documents with millisecond latency. Features real-time indexing, ML model serving, tensor computation, and ACID-compliant writes. Supports custom ranking models, query federation, and geographic search. Used for recommendation systems, personalization, and RAG.

Vespa combines vector, text, and structured search in one engine. Billions of documents, millisecond latency. Built by Yahoo for web-scale.

Real-time indexing, built-in ML model serving, tensor computation. ACID-compliant partial updates.

Custom ranking models, query federation, geographic search. Self-hosted or Vespa Cloud managed.

Pricing

Free open-source / Vespa Cloud available

Platforms

Self-hosted, Docker, Vespa Cloud

Use Cases

Data Engineering Backend API Design

Alternatives

LanceDB

Embedded vector database for multimodal AI with petabyte scale

LanceDB is an open-source embedded vector database built on the Lance columnar format for multimodal AI. It delivers near in-memory performance from disk with zero-copy architecture, supporting vector search, full-text search, and SQL. Native SDKs for Python, TypeScript, and Rust integrate with LangChain, LlamaIndex, and DuckDB. Backed by a $30M Series A, used by Harvey AI and Runway, with 18,000+ GitHub stars.

open-sourceOpen Source

turbopuffer

Serverless vector and full-text search on object storage

turbopuffer is a serverless vector and full-text search engine built on object storage and vendor-positioned as roughly 10x cheaper than traditional vector databases. Used by Anthropic, Cursor, Notion, and Atlassian for production search workloads. Official site reports 4T+ documents, 10M+ writes/s, and 25k+ queries/s in production systems. Funded by Thrive Capital.

paid

Ragie

Fully managed RAG-as-a-Service platform for enterprise AI applications

Ragie is a managed retrieval-augmented generation platform that handles document ingestion, indexing, and retrieval so developers can build grounded AI applications without managing vector databases or chunking pipelines. It connects to Google Drive, Notion, Slack, Confluence, and other enterprise data sources with simple APIs for hybrid search and entity extraction.

api-usage-based

Related Tools

Supabase MCP

MCP server for connecting AI assistants to Supabase projects

Supabase MCP is Supabase's Apache-2.0 server for connecting AI assistants to Supabase projects. It can expose database, configuration, and project-management workflows to MCP clients such as Cursor, Claude, and Windsurf, while the official docs emphasize permission and security review before production use, SQL changes, or high-privilege database access.

open-sourceOpen SourceTelemetry

Deep Lake

AI data runtime for multimodal datasets and vector search

Deep Lake is an open-source AI data runtime from Activeloop for storing, versioning, and querying multimodal data and embeddings. It fits teams building RAG, training, evaluation, or dataset-heavy agent workflows that need a bridge between vector search, structured metadata, and large image, text, audio, or video collections.

open-sourceOpen Source

SeekDB

AI-native state store with hybrid vector and full-text search

SeekDB is an open-source AI-native state store from the OceanBase ecosystem that combines MySQL-compatible data access with hybrid vector and full-text retrieval. It targets agent and AI application teams that need embedded or server deployment, copy-on-write style sandboxes, and searchable state without gluing together several separate storage layers.

open-sourceOpen Source

pgvectorscale

DiskANN-powered vector search extension for PostgreSQL

pgvectorscale is an open-source PostgreSQL extension from Timescale that complements pgvector with DiskANN-based approximate vector search. It is useful for teams that want faster embedding retrieval while keeping vectors, filters, and application data inside the Postgres ecosystem instead of adopting a separate hosted vector database.

open-sourceOpen Source

Ardent

Database branching for coding agents

Ardent is a Postgres database branching platform built for coding-agent workflows. It creates isolated database copies in seconds so Claude Code, Codex, Cursor, or human developers can test migrations, clean data, reproduce bugs, and run risky experiments without touching production. The strongest fit is teams already using Postgres who need agent-safe dev/test databases rather than another generic serverless database.

freemium

Vald

Cloud-native distributed vector search engine built for Kubernetes with automatic indexing and horizontal scaling.

Vald is a highly scalable distributed approximate nearest neighbor (ANN) vector search engine designed for cloud-native, Kubernetes-based architectures. Maintained by LY Corporation and listed in the CNCF Landscape, it uses the NGT algorithm (developed at Yahoo Japan), supports automatic incremental index backup, and handles billion-scale datasets across loosely coupled microservice components that scale horizontally via Helm.

open-sourceOpen Source

Used in Stacks

Kubernetes FinOps Stack

A complete Kubernetes cost optimization and operations stack combining AI-powered cost reduction, open-source cost monitoring, intelligent troubleshooting, and virtual cluster management. These tools work together to cut cloud spending while maintaining performance and reliability.

varies

Vespa

Pricing

Platforms

Categories

Tags

Use Cases

Alternatives

LanceDB

turbopuffer

Ragie

Related Tools

Supabase MCP

Deep Lake

SeekDB

pgvectorscale

Ardent

Vald

Used in Stacks