Milvus is an open source (Apache-2.0 licensed) vector database built to power embedding similarity search and AI applications. Milvus's architecture is designed to handle large-scale vector datasets and includes various deployment modes: Milvus Standalone, and Milvus Distributed, to accommodate different data scale needs.
Milvus supports two deployment modes to accommodate different scale requirements:
A lightweight deployment suitable for development and testing:
A distributed deployment for production workloads with multiple specialized components:
Access Layer
Compute Layer
Coordination Layer
Storage Layer
KubeBlocks simplifies Milvus operations with comprehensive lifecycle management:
Feature | Description |
---|---|
Horizontal Scaling | Scale replicas in/out to adjust capacity |
Vertical Scaling | Adjust CPU/memory resources for Milvus instances |
Restart Operations | Controlled cluster restarts with minimal disruption |
Start/Stop | Temporarily suspend/resume cluster operations |
Custom Services | Expose specialized database endpoints |
Replica Management | Safely decommission or rebuild specific replicas |
Version Upgrades | Perform minor version upgrades seamlessly |
Advanced Scheduling | Customize pod placement and resource allocation |
Monitoring | Integrated Prometheus metrics collection |
Logging | Centralized logs via Loki Stack |
KubeBlocks Milvus Addon supports these Milvus versions:
Major Version | Supported Minor Versions |
---|---|
2.3 | 2.3.2 |
The list of supported versions can be found by following command:
kubectl get cmpv milvus