Vald is a cloud-native distributed vector search engine designed from the ground up for Kubernetes. Maintained by LY Corporation (the Japanese tech company behind LINE and Yahoo! Japan) and listed in the CNCF Landscape, it uses the NGT (Neighborhood Graph and Tree) algorithm — developed internally at Yahoo Japan and consistently among the fastest approximate nearest neighbor algorithms in benchmark comparisons. Vald is Apache 2.0 licensed, built in Go, and has shipped over 110 releases since launch.
Architecturally, Vald breaks the vector search workload into loosely coupled Kubernetes components — an LB Gateway for request routing, a Discoverer for service discovery, Agent pods that hold index shards in memory, and an Index Manager that orchestrates updates and backups. Each component scales independently via Helm charts, and the system supports automatic incremental index backup to object storage so recovery does not require full re-indexing. Billion-scale vector deployments are a native design goal rather than a stretch case, and horizontal scaling happens without downtime.
For teams that already operate Kubernetes as a first-class platform and need a vector engine that scales with the same primitives as the rest of their infrastructure, Vald offers a production-validated, sovereignty-friendly alternative to managed vector databases. Official gRPC SDKs cover Go, Python, Node.js, and Java. The tradeoff is operational: Vald assumes a team comfortable writing Helm values and managing pod lifecycles, which is steeper than running a single-binary engine like Qdrant or an embedded library like FAISS or hnswlib. For ML platform teams at scale, that operational surface is the point rather than the cost.
