aicoolies logo
Vald logo

Vald

Cloud-native distributed vector search engine built for Kubernetes with automatic indexing and horizontal scaling.

Share
open-sourceOpen Source
Visit Website →

Vald is a highly scalable distributed approximate nearest neighbor (ANN) vector search engine designed for cloud-native, Kubernetes-based architectures. Maintained by LY Corporation and listed in the CNCF Landscape, it uses the NGT algorithm (developed at Yahoo Japan), supports automatic incremental index backup, and handles billion-scale datasets across loosely coupled microservice components that scale horizontally via Helm.

Vald is a cloud-native distributed vector search engine designed from the ground up for Kubernetes. Maintained by LY Corporation (the Japanese tech company behind LINE and Yahoo! Japan) and listed in the CNCF Landscape, it uses the NGT (Neighborhood Graph and Tree) algorithm — developed internally at Yahoo Japan and consistently among the fastest approximate nearest neighbor algorithms in benchmark comparisons. Vald is Apache 2.0 licensed, built in Go, and has shipped over 110 releases since launch.

Architecturally, Vald breaks the vector search workload into loosely coupled Kubernetes components — an LB Gateway for request routing, a Discoverer for service discovery, Agent pods that hold index shards in memory, and an Index Manager that orchestrates updates and backups. Each component scales independently via Helm charts, and the system supports automatic incremental index backup to object storage so recovery does not require full re-indexing. Billion-scale vector deployments are a native design goal rather than a stretch case, and horizontal scaling happens without downtime.

For teams that already operate Kubernetes as a first-class platform and need a vector engine that scales with the same primitives as the rest of their infrastructure, Vald offers a production-validated, sovereignty-friendly alternative to managed vector databases. Official gRPC SDKs cover Go, Python, Node.js, and Java. The tradeoff is operational: Vald assumes a team comfortable writing Helm values and managing pod lifecycles, which is steeper than running a single-binary engine like Qdrant or an embedded library like FAISS or hnswlib. For ML platform teams at scale, that operational surface is the point rather than the cost.

Pricing

Free and open-source (Apache 2.0). Self-hosted on Kubernetes via Helm charts.

Platforms

Kubernetes (Helm). Official gRPC SDKs for Go, Python, Node.js, and Java.

Categories

Tags

Use Cases

Alternatives

Qdrant logo

Qdrant

High-performance vector database written in Rust for similarity search at scale.

Qdrant is a high-performance vector similarity search engine and database written in Rust. Designed for production-grade AI applications with advanced filtering, payload indexing, and distributed deployment. Supports billion-scale vector collections with sub-second query times. Popular choice for RAG, recommendation systems, and anomaly detection.

freemiumOpen Source
Milvus logo

Milvus

GPU-accelerated open-source vector database

Milvus is an open-source vector database with 33K+ GitHub stars for billion-scale similarity search. Features GPU-accelerated indexing, hybrid search combining vector and scalar filtering, multi-tenancy, partitioning, and horizontal scaling. Supports HNSW, IVF, DiskANN, and GPU index types. SDKs for Python, Java, Go, and Node.js. Zilliz Cloud offers a managed version. A production-grade foundation for RAG pipelines and recommendation systems at enterprise scale.

open-sourceOpen Source
FAISS logo

FAISS

Library for efficient similarity search and clustering of dense vectors at billion-scale.

FAISS is Meta AI Research's open-source library for efficient similarity search and clustering of dense vectors. It implements approximate nearest-neighbor algorithms designed to scale to billions of vectors, with optimized indexes that fit in RAM and GPU acceleration for the largest workloads. Engineering teams use FAISS as the retrieval primitive underneath custom RAG pipelines, recommendation systems, and large-scale embedding search infrastructure.

free
hnswlib logo

hnswlib

Header-only C++ implementation of HNSW for fast approximate nearest-neighbor search.

hnswlib is a header-only C++ library implementing the Hierarchical Navigable Small World (HNSW) graph algorithm for approximate nearest-neighbor search, with Python bindings and a tiny dependency footprint. Originally developed by the nmslib team, it has become the default HNSW implementation embedded inside many vector databases and search products. Engineers use it directly when they want HNSW retrieval without pulling in a heavyweight vector DB.

free

Related Tools

Deep Lake

AI data runtime for multimodal datasets and vector search

Deep Lake is an open-source AI data runtime from Activeloop for storing, versioning, and querying multimodal data and embeddings. It fits teams building RAG, training, evaluation, or dataset-heavy agent workflows that need a bridge between vector search, structured metadata, and large image, text, audio, or video collections.

open-sourceOpen Source

SeekDB

AI-native state store with hybrid vector and full-text search

SeekDB is an open-source AI-native state store from the OceanBase ecosystem that combines MySQL-compatible data access with hybrid vector and full-text retrieval. It targets agent and AI application teams that need embedded or server deployment, copy-on-write style sandboxes, and searchable state without gluing together several separate storage layers.

open-sourceOpen Source

pgvectorscale

DiskANN-powered vector search extension for PostgreSQL

pgvectorscale is an open-source PostgreSQL extension from Timescale that complements pgvector with DiskANN-based approximate vector search. It is useful for teams that want faster embedding retrieval while keeping vectors, filters, and application data inside the Postgres ecosystem instead of adopting a separate hosted vector database.

open-sourceOpen Source

kubectl-ai

Google’s open-source Kubernetes assistant that translates natural-language intent into precise cluster operations.

kubectl-ai is an AI-powered Kubernetes assistant from Google Cloud Platform. It acts as an intelligent interface for cluster work, translating operator intent into Kubernetes commands and workflows. The key distinction from reactive diagnosis tools is that kubectl-ai is designed as an interactive natural-language interface for planning and executing Kubernetes operations, with provider configuration and MCP-oriented workflows around the CLI.

open-sourceOpen SourceTelemetry
Marqo logo

Marqo

Embedding-first search and discovery engine for AI-powered product experiences.

Marqo is an open-source tensor search engine that combines embedding generation and vector search in a single API, removing the need to manage separate embedding pipelines and vector databases. Built for product discovery and multi-modal search, it lets teams index text, images, and structured data together, returning ranked results based on semantic similarity rather than keyword overlap.

freemium
VectorChord logo

VectorChord

High-recall Postgres vector search at billion scale

VectorChord is a Postgres extension from the supervc-stack/VectorChord project that brings high-recall vector search to PostgreSQL. As the spiritual successor to pgvecto.rs, it combines IVF indexes with RaBitQ quantization to deliver Pinecone-class performance at billion-vector scale while keeping all data inside a single Postgres database — no separate vector store, no two-system sync, no rewrites when the workload grows.

open-sourceOpen Source

Comparisons