Overview
Deliver Run:AI as a scalable, repeatable SKU for users allowing them to configure and deploy both the underlying infrastructure resoures and the Run:AI application with just a few clicks.
Deliver Run:AI as a self-service SKU. Users will get access to a fully configured Run:AI environment—complete with GPU infrastructure, a Kubernetes cluster, necessary k8s operators, and a ready-to-use Run:AI tenant—all deployed automatically.
Rationale¶
For GPU Clouds, SKU-based managed services offer tremendous benefits:
- Predictable, standardized offerings for customers
- Reduced complexity, since the SKU hides all underlying infrastructure
- Faster onboarding, enabling customers to begin using Run:AI in minutes
- Higher margins, by offering value-added services instead of raw compute
- Scalability, allowing dozens or hundreds of customers/tenants to onboard seamlessly
Rafay helps transform Run:AI from a manually deployed application and infrastructure into a self-service SKU that GPU Cloud providers can expose to customers with confidence. By automating everything—from provisioning GPU infrastructure to tenant creation to cluster onboarding—Rafay ensures that customers can begin using Run:AI within minutes of selecting a SKU.
For customers, it means instant access to Run:AI. For cloud operators, this means:
- Higher operational efficiency
- Scalable onboarding of new customers
- Stronger differentiation in the GPU Cloud market
- A future-proof platform for expanding GPU-accelerated services
Turning Run:AI into a SKU transforms it from a complex integration into a consumption-ready product.
The experience begins in the GPU Cloud provider’s marketplace or self-service portal. Customers simply choose the Run:AI SKU, which can come in variants such as:
| SKU | Description | Cost |
|---|---|---|
| 1 | Run:AI Small — 4 GPUs (e.g., L40S or A100) | $1/hr |
| 2 | Run:AI Medium — 8 GPUs (e.g., H100) | $3/hr |
| 3 | Run:AI Large - 2× H100 nodes | $5/hr |
Each SKU can be pre-defined by the cloud provider. Rafay will orchestrate required infrastucture, deploying and configuring required sofware etc. An illustrative example is shown below.

