Requirements

Rafay automatically provisions the specified GPU Infrastructure in the GPU Cloud's datacenter.

  • Provisions physical GPU servers or GPU-enabled VMs
  • Sets up networking, storage, security groups, and VPC isolation
  • Provisions a production-grade Kubernetes cluster (e.g. Rafay MKS) with the control plane and worker nodes
  • Deploys and configures cluster add-ons, monitoring, logging, and observability components

The following permutations have been extensively validated. Newer versions of Run:AI and Kubernetes should also work unless specifically identified.

Component Version
Run:AI Application Run:AI v2.23 or higher
Kubernetes Distribution Rafay Managed Kubernetes Service (MKS) with Kubernetes v1.33 or higher
Rafay Platform v3.1-37 or higher

Important

Please contact Rafay CS if you would like access to the software artifacts for this SKU. Please review the technical approach section to understand the details behind the automation.