Session

Zero-Downtime Control Plane Replacement: A Kubernetes Survival Guide

Sometimes Control plane node replacement may be necessary. These situations include security incidents and compromises, compliance and regulatory requirements, infrastructure modernization, and operating system and platform updates. A significant concern is ensuring the uninterrupted operation of stateful set workloads.Replacing control plane nodes in a Kubernetes cluster presents several challenges like API server or complete cluster unavailability, risking etcd data corruption without reliable backups. Network and load balancer reconfigurations, careful timing for rolling updates, and rollback plans are also necessary. This talk details a proven approach for replacing ALL control plane nodes. We will also demonstrate how automation can be leveraged to streamline the process, enhance robustness, and guarantee zero downtime. We will provide a detailed walkthrough of how to accomplish these essential steps using a combination of shell scripts and Ansible playbooks.

Nikhil Pathak

Senior Platform Engineer, DigitalOcean

Actions

Please note that Sessionize is not responsible for the accuracy or validity of the data provided by speakers. If you suspect this profile to be fake or spam, please let us know.

Jump to top