Vald vs Milvus

Milvus is the stronger choice for most teams building GenAI and similarity search applications. It offers broader index support, hybrid search, flexible deployment from laptop to distributed cluster, and a managed cloud option. Vald is a better fit for Kubernetes-native teams that value asynchronous indexing and a lightweight, focused vector search engine without the overhead of a full database layer.

Vald4.3Milvus3.9

Vector Databases

Page Quality Score: 95/100

•

Last Updated: May 6, 2026

Quick Comparison

Feature	Vald	Milvus
ANN Algorithm	NGT (Neighborhood Graph and Tree) by Yahoo Japan Research	Multiple: IVF_FLAT, HNSW, DiskANN, SCANN, IVF_SQ8
Architecture	Kubernetes-native with distributed agents; each agent holds an index shard	Disaggregated storage/compute with fully stateless query, data, and index nodes
Deployment Flexibility	Kubernetes required for all deployments; Helm chart provided	Milvus Lite (pip install), Standalone (Docker), Distributed (K8s), Zilliz Cloud (managed)
Query Features	Pure vector similarity search with custom gRPC ingress/egress filters	Hybrid search, metadata filtering, multi-vector search, boolean expressions
Indexing Strategy	Asynchronous auto-indexing with zero stop-the-world pauses	Segment-based with background compaction; multiple index algorithm choices
Best For	Kubernetes-native teams needing a lightweight vector search engine with zero-downtime indexing	Teams building GenAI apps that need hybrid search, flexible deployment, and a mature ecosystem
	Full Review →	Full Review →

Vald

ANN Algorithm:: NGT (Neighborhood Graph and Tree) by Yahoo Japan Research
Architecture:: Kubernetes-native with distributed agents; each agent holds an index shard
Deployment Flexibility:: Kubernetes required for all deployments; Helm chart provided
Query Features:: Pure vector similarity search with custom gRPC ingress/egress filters
Indexing Strategy:: Asynchronous auto-indexing with zero stop-the-world pauses
Best For:: Kubernetes-native teams needing a lightweight vector search engine with zero-downtime indexing

Full Review →

Milvus

ANN Algorithm:: Multiple: IVF_FLAT, HNSW, DiskANN, SCANN, IVF_SQ8
Architecture:: Disaggregated storage/compute with fully stateless query, data, and index nodes
Deployment Flexibility:: Milvus Lite (pip install), Standalone (Docker), Distributed (K8s), Zilliz Cloud (managed)
Query Features:: Hybrid search, metadata filtering, multi-vector search, boolean expressions
Indexing Strategy:: Segment-based with background compaction; multiple index algorithm choices
Best For:: Teams building GenAI apps that need hybrid search, flexible deployment, and a mature ecosystem

Full Review →

Feature Comparison

Feature	Vald	Milvus
Core Architecture
Primary ANN algorithm	NGT (Neighborhood Graph and Tree)	Multiple: IVF_FLAT, IVF_SQ8, HNSW, SCANN, DiskANN
Storage-compute separation	No; agents hold both index and compute	Yes; fully disaggregated since Milvus 2.0
Stateless components	Partially; gateway and filter components are stateless	Fully stateless query, data, and index nodes
Coordination service	Kubernetes-native service discovery	etcd for metadata and service coordination
Indexing & Search
Async auto-indexing	Built-in; no stop-the-world pauses during index builds	Segment-based with background compaction
Index replication	Automatic multi-agent replication with rebalancing	Replica groups with configurable replication factor
Metadata filtering	Custom ingress/egress gRPC filters only	Native boolean expressions on scalar fields
Hybrid search	Not natively supported	Combines vector similarity with scalar filtering
Multi-vector search	❌	Supported across multiple vector fields per collection
Deployment & Operations
Kubernetes-native deployment	Required; Helm chart provided	Supported via Helm and Milvus Operator
Lightweight local mode	Not available; requires full Kubernetes cluster	Milvus Lite runs in-process via pip install
Managed cloud offering	No managed service available	Zilliz Cloud (serverless and dedicated clusters)
Index backup and recovery	Auto-backup to object storage or persistent volumes	Snapshots via Minio/S3 object storage
Developer Experience
SDK languages	Go, Java, Node.js, Python	Python, Java, Go, Node.js, C#, RESTful API
API protocol	gRPC only	gRPC and RESTful HTTP
Community size	Smaller niche community backed by Yahoo Japan (vdaas)	Large community with 30K+ GitHub stars and active contributors

Core Architecture

Primary ANN algorithm

ValdNGT (Neighborhood Graph and Tree)

MilvusMultiple: IVF_FLAT, IVF_SQ8, HNSW, SCANN, DiskANN

Storage-compute separation

ValdNo; agents hold both index and compute

MilvusYes; fully disaggregated since Milvus 2.0

Stateless components

ValdPartially; gateway and filter components are stateless

MilvusFully stateless query, data, and index nodes

Coordination service

ValdKubernetes-native service discovery

Milvusetcd for metadata and service coordination

Indexing & Search

Async auto-indexing

ValdBuilt-in; no stop-the-world pauses during index builds

MilvusSegment-based with background compaction

Index replication

ValdAutomatic multi-agent replication with rebalancing

MilvusReplica groups with configurable replication factor

Metadata filtering

ValdCustom ingress/egress gRPC filters only

MilvusNative boolean expressions on scalar fields

Hybrid search

ValdNot natively supported

MilvusCombines vector similarity with scalar filtering

Multi-vector search

Vald❌

MilvusSupported across multiple vector fields per collection

Deployment & Operations

Kubernetes-native deployment

ValdRequired; Helm chart provided

MilvusSupported via Helm and Milvus Operator

Lightweight local mode

ValdNot available; requires full Kubernetes cluster

MilvusMilvus Lite runs in-process via pip install

Managed cloud offering

ValdNo managed service available

MilvusZilliz Cloud (serverless and dedicated clusters)

Index backup and recovery

ValdAuto-backup to object storage or persistent volumes

MilvusSnapshots via Minio/S3 object storage

Developer Experience

SDK languages

ValdGo, Java, Node.js, Python

MilvusPython, Java, Go, Node.js, C#, RESTful API

API protocol

ValdgRPC only

MilvusgRPC and RESTful HTTP

Community size

ValdSmaller niche community backed by Yahoo Japan (vdaas)

MilvusLarge community with 30K+ GitHub stars and active contributors

✅ Full support⚠️ Partial / limited❌ Not supported

Our Verdict

When to Choose Each

Choose Vald if:

Choose Vald when your infrastructure is already Kubernetes-native and you need a focused, distributed vector search engine with zero-downtime asynchronous indexing. Vald works well for teams that want fine-grained control over index sharding and replication without the complexity of a full-featured database system.

Choose Milvus if:

Choose Milvus when you need a production-grade vector database with hybrid search, metadata filtering, multiple index algorithms, and a path from local prototyping (Milvus Lite) to distributed deployment or managed cloud (Zilliz Cloud). Milvus is the better choice for GenAI applications, RAG pipelines, and teams that want a mature ecosystem with extensive tooling.

This verdict is based on general use cases. Your specific requirements, existing tech stack, and team expertise should guide your final decision.

Frequently Asked Questions

Is Vald or Milvus better for RAG applications?

Milvus is the stronger choice for RAG (Retrieval Augmented Generation) applications. It provides native metadata filtering that lets you scope vector searches by document source, timestamp, or any scalar attribute, which is essential for grounding LLM responses in relevant context. Milvus also supports hybrid search combining dense vectors with sparse vectors or keyword matching, improving retrieval precision. Vald handles the vector search component effectively but lacks built-in metadata filtering and hybrid search, meaning you would need to build those layers yourself on top of Vald's gRPC filter system.

Can Vald and Milvus both handle billions of vectors?

Yes, both engines are designed to scale to billions of vectors. Vald achieves this by distributing index shards across multiple Kubernetes agents, where each agent holds a different portion of the index. Milvus scales through its disaggregated architecture, where query nodes, data nodes, and index nodes can be scaled independently. In practice, Milvus has been benchmarked more extensively at this scale and offers more tuning options through its multiple index types (IVF, HNSW, DiskANN). Vald's NGT-based approach is highly performant but provides fewer knobs for optimizing recall-vs-latency tradeoffs at extreme scale.

Do I need Kubernetes to run Vald or Milvus?

Vald requires Kubernetes for all deployments. It is fundamentally designed around Kubernetes primitives for service discovery, scaling, and agent management, and there is no standalone binary or Docker Compose option. Milvus is more flexible: Milvus Lite runs in-process with a simple pip install for prototyping, Milvus Standalone runs as a single Docker container for small-scale production, and Milvus Distributed uses Kubernetes for full horizontal scaling. If you do not have a Kubernetes cluster, Milvus is the only viable option of the two.

How do Vald and Milvus differ in their indexing approach?

Vald uses Yahoo Japan's NGT algorithm exclusively and performs asynchronous auto-indexing. This means new vectors are searchable without requiring a stop-the-world pause to rebuild the index graph. The distributed index is spread across agents, and rebalancing happens automatically when agents join or leave the cluster. Milvus takes a segment-based approach where data is first buffered in a growing segment, then sealed and indexed in the background. Milvus supports multiple index algorithms including IVF_FLAT for brute-force accuracy, HNSW for low-latency graph search, and DiskANN for cost-efficient billion-scale deployments using SSD storage.

Which vector database has better community support and ecosystem?

Milvus has a significantly larger community and ecosystem. It is backed by Zilliz, has over 30,000 GitHub stars, and integrates with popular AI frameworks including LangChain, LlamaIndex, Haystack, and Semantic Kernel. Milvus also has dedicated tools like Attu (GUI management), Birdwatcher (diagnostics), and VDBBench (benchmarking). Vald is maintained by the vdaas organization (Yahoo Japan's research division) and has a smaller but focused community. Vald's ecosystem is more minimal, with fewer third-party integrations and tools. For teams that value extensive documentation, community support, and framework integrations, Milvus has a clear advantage.

← View all comparisons

Vald vs Milvus

Vald4.3Milvus3.9

Vector Databases

Quick Comparison

Feature	Vald	Milvus
ANN Algorithm	NGT (Neighborhood Graph and Tree) by Yahoo Japan Research	Multiple: IVF_FLAT, HNSW, DiskANN, SCANN, IVF_SQ8
Architecture	Kubernetes-native with distributed agents; each agent holds an index shard	Disaggregated storage/compute with fully stateless query, data, and index nodes
Deployment Flexibility	Kubernetes required for all deployments; Helm chart provided	Milvus Lite (pip install), Standalone (Docker), Distributed (K8s), Zilliz Cloud (managed)
Query Features	Pure vector similarity search with custom gRPC ingress/egress filters	Hybrid search, metadata filtering, multi-vector search, boolean expressions
Indexing Strategy	Asynchronous auto-indexing with zero stop-the-world pauses	Segment-based with background compaction; multiple index algorithm choices
Best For	Kubernetes-native teams needing a lightweight vector search engine with zero-downtime indexing	Teams building GenAI apps that need hybrid search, flexible deployment, and a mature ecosystem
	Full Review →	Full Review →

Vald

ANN Algorithm:: NGT (Neighborhood Graph and Tree) by Yahoo Japan Research
Architecture:: Kubernetes-native with distributed agents; each agent holds an index shard
Deployment Flexibility:: Kubernetes required for all deployments; Helm chart provided
Query Features:: Pure vector similarity search with custom gRPC ingress/egress filters
Indexing Strategy:: Asynchronous auto-indexing with zero stop-the-world pauses
Best For:: Kubernetes-native teams needing a lightweight vector search engine with zero-downtime indexing

Full Review →

Milvus

ANN Algorithm:: Multiple: IVF_FLAT, HNSW, DiskANN, SCANN, IVF_SQ8
Architecture:: Disaggregated storage/compute with fully stateless query, data, and index nodes
Deployment Flexibility:: Milvus Lite (pip install), Standalone (Docker), Distributed (K8s), Zilliz Cloud (managed)
Query Features:: Hybrid search, metadata filtering, multi-vector search, boolean expressions
Indexing Strategy:: Segment-based with background compaction; multiple index algorithm choices
Best For:: Teams building GenAI apps that need hybrid search, flexible deployment, and a mature ecosystem

Full Review →

Feature Comparison

Feature	Vald	Milvus
Core Architecture
Primary ANN algorithm	NGT (Neighborhood Graph and Tree)	Multiple: IVF_FLAT, IVF_SQ8, HNSW, SCANN, DiskANN
Storage-compute separation	No; agents hold both index and compute	Yes; fully disaggregated since Milvus 2.0
Stateless components	Partially; gateway and filter components are stateless	Fully stateless query, data, and index nodes
Coordination service	Kubernetes-native service discovery	etcd for metadata and service coordination
Indexing & Search
Async auto-indexing	Built-in; no stop-the-world pauses during index builds	Segment-based with background compaction
Index replication	Automatic multi-agent replication with rebalancing	Replica groups with configurable replication factor
Metadata filtering	Custom ingress/egress gRPC filters only	Native boolean expressions on scalar fields
Hybrid search	Not natively supported	Combines vector similarity with scalar filtering
Multi-vector search	❌	Supported across multiple vector fields per collection
Deployment & Operations
Kubernetes-native deployment	Required; Helm chart provided	Supported via Helm and Milvus Operator
Lightweight local mode	Not available; requires full Kubernetes cluster	Milvus Lite runs in-process via pip install
Managed cloud offering	No managed service available	Zilliz Cloud (serverless and dedicated clusters)
Index backup and recovery	Auto-backup to object storage or persistent volumes	Snapshots via Minio/S3 object storage
Developer Experience
SDK languages	Go, Java, Node.js, Python	Python, Java, Go, Node.js, C#, RESTful API
API protocol	gRPC only	gRPC and RESTful HTTP
Community size	Smaller niche community backed by Yahoo Japan (vdaas)	Large community with 30K+ GitHub stars and active contributors

Core Architecture

Primary ANN algorithm

ValdNGT (Neighborhood Graph and Tree)

MilvusMultiple: IVF_FLAT, IVF_SQ8, HNSW, SCANN, DiskANN

Storage-compute separation

ValdNo; agents hold both index and compute

MilvusYes; fully disaggregated since Milvus 2.0

Stateless components

ValdPartially; gateway and filter components are stateless

MilvusFully stateless query, data, and index nodes

Coordination service

ValdKubernetes-native service discovery

Milvusetcd for metadata and service coordination

Indexing & Search

Async auto-indexing

ValdBuilt-in; no stop-the-world pauses during index builds

MilvusSegment-based with background compaction

Index replication

ValdAutomatic multi-agent replication with rebalancing

MilvusReplica groups with configurable replication factor

Metadata filtering

ValdCustom ingress/egress gRPC filters only

MilvusNative boolean expressions on scalar fields

Hybrid search

ValdNot natively supported

MilvusCombines vector similarity with scalar filtering

Multi-vector search

Vald❌

MilvusSupported across multiple vector fields per collection

Deployment & Operations

Kubernetes-native deployment

ValdRequired; Helm chart provided

MilvusSupported via Helm and Milvus Operator

Lightweight local mode

ValdNot available; requires full Kubernetes cluster

MilvusMilvus Lite runs in-process via pip install

Managed cloud offering

ValdNo managed service available

MilvusZilliz Cloud (serverless and dedicated clusters)

Index backup and recovery

ValdAuto-backup to object storage or persistent volumes

MilvusSnapshots via Minio/S3 object storage

Developer Experience

SDK languages

ValdGo, Java, Node.js, Python

MilvusPython, Java, Go, Node.js, C#, RESTful API

API protocol

ValdgRPC only

MilvusgRPC and RESTful HTTP

Community size

ValdSmaller niche community backed by Yahoo Japan (vdaas)

MilvusLarge community with 30K+ GitHub stars and active contributors

✅ Full support⚠️ Partial / limited❌ Not supported

Our Verdict

When to Choose Each

Choose Vald if:

Choose Milvus if:

This verdict is based on general use cases. Your specific requirements, existing tech stack, and team expertise should guide your final decision.

Vald vs Milvus

Quick Comparison

Vald

Milvus

Feature Comparison

Core Architecture

Indexing & Search

Deployment & Operations

Developer Experience

Our Verdict

When to Choose Each

Frequently Asked Questions

Is Vald or Milvus better for RAG applications?

Can Vald and Milvus both handle billions of vectors?

Do I need Kubernetes to run Vald or Milvus?

How do Vald and Milvus differ in their indexing approach?

Which vector database has better community support and ecosystem?

Explore More

Related Comparisons

Vald vs Milvus

Quick Comparison

Vald

Milvus

Feature Comparison

Core Architecture

Indexing & Search

Deployment & Operations

Developer Experience

Our Verdict

When to Choose Each

Frequently Asked Questions

Is Vald or Milvus better for RAG applications?

Can Vald and Milvus both handle billions of vectors?

Do I need Kubernetes to run Vald or Milvus?

How do Vald and Milvus differ in their indexing approach?

Which vector database has better community support and ecosystem?

Explore More

Related Comparisons