Milvus is an open source (Apache-2.0 licensed) vector database built to power embedding similarity search and AI applications. Milvus's architecture is designed to handle large-scale vector datasets and includes various deployment modes: Milvus Standalone, and Milvus Distributed, to accommodate different data scale needs.
Milvus supports two deployment modes to accommodate different scale requirements:
A lightweight deployment suitable for development and testing:
A distributed deployment for production workloads with multiple specialized components:
Access Layer
Compute Layer
Coordination Layer
Storage Layer
KubeBlocks simplifies Milvus operations with comprehensive lifecycle management:
| Feature | Description |
|---|---|
| Horizontal Scaling | Scale replicas in/out to adjust capacity |
| Vertical Scaling | Adjust CPU/memory resources for Milvus instances |
| Restart Operations | Controlled cluster restarts with minimal disruption |
| Start/Stop | Temporarily suspend/resume cluster operations |
| Custom Services | Expose specialized database endpoints |
| Replica Management | Safely decommission or rebuild specific replicas |
| Version Upgrades | Perform minor version upgrades seamlessly |
| Advanced Scheduling | Customize pod placement and resource allocation |
| Monitoring | Integrated Prometheus metrics collection |
| Logging | Centralized logs via Loki Stack |
KubeBlocks Milvus Addon supports these Milvus versions:
| Major Version | Supported Minor Versions |
|---|---|
| 2.3 | 2.3.2 |
The list of supported versions can be found by following command:
kubectl get cmpv milvus