Skip to content

Overview

A MachineHealthCheck is a Cluster API resource that enables users to establish criteria for identifying when Machines within a Cluster are in an unhealthy state. It is created in the context of a management cluster and applies to a specific workload cluster.

When configuring a MachineHealthCheck, users set a timeout for each defined condition to be assessed on the Machine's Node. If any of these conditions persist for the specified timeout duration, the affected Machine undergoes remediation. The default remediation action involves generating a new Machine to replace the malfunctioning one.

Automation Matrix

The below matrix presents a breakdown of actions like creation, updating, and deletion of Machine Health Check across multiple deployment methods: Interactive UI, Declarative RCTL commands, API-driven automation, and Terraform.

Action UI CLI API Terraform
Create Yes Yes Yes No
Update Yes Yes Yes No
Delete Yes Yes Yes No