Kured is a Kubernetes
DaemonSet that performs safe automatic node reboots when the need to do so is indicated by the package management system of the underlying OS.
- Watches for the presence of a reboot sentinel file e.g.
/var/run/reboot-requiredor the successful run of a sentinel command.
- Utilizes a lock in the API server to ensure only one node reboots at a time
- Optionally defers reboots in the presence of active Prometheus alerts or selected pods
- Cordons & drains worker nodes before reboot, uncordoning them after