Skip to content

AI/ML and GenAI

Core Platform

The Core Platform delivers the essential building blocks for constructing a production-grade GPU Cloud, including dynamic GPU partitioning, multi-tenancy isolation, policy enforcement, and automated lifecycle management. These capabilities enable reliable and cost-efficient execution of AI/ML and GenAI workloads across diverse infrastructure environments.


AI Infrastructure SKUs

Rafay’s AI infrastructure SKUs deliver turnkey, production-grade compute environments—ranging from bare metal servers and GPU-enabled VMs to Kubernetes-based clusters and serverless execution frameworks. Each SKU is optimized for multi-tenancy, automated lifecycle management, and high-performance AI/ML workloads, enabling teams to build and scale modern AI platforms with ease.


AIML App SKUs

Rafay’s AIML SKUs provide fully managed, GPU-accelerated environments for interactive development, distributed training, automated MLOps pipelines, and real-time inference. Each SKU is optimized for multi-tenancy, lifecycle automation, and operational consistency, enabling organizations to streamline the entire AI/ML model lifecycle from experimentation to production.


App Marketplace

The App Marketplace enables service providers to publish open-source or custom applications that can be consumed by end users across hundreds of tenants, with all deployments running on the provider’s managed infrastructure. This delivers a seamless, one-click self-service experience for every tenant while ensuring consistency, security, and operational efficiency.

  • Overview


    Prepackaged or custom applications deployable on Kubernetes clusters using manifests, Helm charts, or other Kubernetes-native formats.

    Overview

  • Kubernetes Apps


    Prepackaged or custom applications deployable on Kubernetes clusters using manifests, Helm charts, or other Kubernetes-native formats.

    Learn More

  • Docker Apps


    Containerized applications deployable via Docker images from DockerHub or private registries, enabling fast and portable deployments.

    Learn More