Skip to content

Get Started

Cloud Native Automation (CNA)

  • Environment Management


    Define, version, and manage environment blueprints—ensuring consistency, governance, and automation for rapid and reliable environment provisioning on any infrastructure.

    Get Started

  • Kubernetes Management


    Provision and Manage a fleet of Kubernetes clusters with ease using Rafay's Kubernetes Management's capabilities.

    By Cluster Type By Capability

  • End User Self Service


    Encapsulate your custom applications and deliver them as a 1-click end user experience using Rafay's PaaS

    Get Started


AI/ML Tooling

  • Jupyter Notebooks


    Provide your data scientists and researchers with seamless access to Jupyter Notebooks enabling rapid experimentation.

    Get Started

  • Serverless Inference (Hourly Metering)


    Provide users with access to on-demand, serverless inference powered by open source LLMs with hourly metering.

    Get Started

  • Token Factory


    Configure and use Rafay Token Factory for secure token creation and lifecycle management. Learn the essential setup steps, key workflows, and best practices to generate, manage, and distribute tokens for multiple tenants.

    Part-1 Part-2 Part-3

  • Training (Ray)


    Provide users with access to an on-demand MLOps training environment powered by Ray.

    Get Started

  • MLOps (Kubeflow)


    Provide users with access to a managed MLOps platform based on Kubeflow pre-integrated with MLflow etc.

    Get Started


AI Infrastructure

  • 🖥 Virtual Machines (VM)


    Provide users with self-service access to Virtual Machines (VMs) with one/many GPUs.

  • Bare Metal Servers


    Provide users with self-service access to Bare Metal Servers with one/many GPUs.

  • Kubernetes Clusters


    Provide users with self-service access to Managed Kubernetes Clusters with one/many GPUs.

  • Virtual Kubernetes Clusters


    Provide users with self-service access to Virtual Kubernetes Clusters with one/many GPUs.

  • Developer Pods


    Provide users with serverless compute options for flexible and ephemeral workloads.

    Basics Fractional GPU

  • SLURM on Bare Metal


    Provide users with access to a high-performance SLURM cluster deployed on Bare Metal Servers — ideal for traditional HPC and tightly coupled AI/ML workloads.