Requirements
The following requirements must be met to provision and operate clusters using the One-Click GPU PaaS Kubernetes deployment model.
Configuration Parameters (PaaS Admin)¶
PaaS Admins define the following in compute profiles:
- Kubernetes Version: Pre-selected in the profile
- Control Plane Node Configuration:
- CPU-only or mixed CPU/GPU setup
- Node count (e.g., 1, 3, or 5 for HA)
- Worker Node Configuration:
- GPU-enabled or CPU-only nodes
- Node SKU (CPU cores, memory, GPU type)
- Storage configuration
- CNI Plugin: Defaults to Calico; can be changed in the profile
- Networking & Security: Proxy settings, public IP allocation, firewall rules
- Blueprint Selection: Standard or GPU-enabled blueprint for workloads and monitoring
- Advanced Settings Exposure: Controls which fields are visible to end users
Input Parameters (End User)¶
End users provide only minimal inputs when launching a cluster:
- Cluster Name: Unique name for the new Kubernetes cluster
- Compute Profile: Select from pre-published shared profiles created by the PaaS Admin
- Worker Node Count: Number of worker nodes to provision
- (Optional) Cluster Labels: Add custom labels for workload grouping
- Deployment Action: Single-click Deploy initiates provisioning
All other settings (CPU/GPU type, storage, network, node sizing) are pre-configured in the compute profile by the PaaS Admin.
Infrastructure Requirements¶
- Compute Profiles: At least one published profile must be available to end users
- Rafay Inventory Integration: Bare metal or VM resources must exist and match profile specifications
- GPU Inventory: For GPU workloads, GPU-equipped nodes must be available
Operational Constraints¶
- Master node count and compute profile selection are immutable after cluster creation
- All nodes within a cluster must match the SKU defined in the selected compute profile
- Public IP assignments and dedicated control plane settings are fixed post-deployment