Speaker

Michael Shen

Michael Shen

Senior Site Reliability Engineer @ Red Hat

Actions

Michael Shen is passionate about fostering continuous learning organizations, especially when it relates to DevOps culture. He's an active Kubernetes contributer in the Kubebuilder and Cluster-API projects and currently works as a Senior Site Reliability Engineer at Red Hat.

SLOwly Burning Out - Avoiding Common Pitfalls When Setting SLOs

As systems reach production, the value it provides to a customer can become a focus of engineering teams holding a pager by setting relevant SLOs and responding to alerts. However, as systems change over time, they may gain more points of failure, increase in complexity, or customers may simply use the system differently.

If SLOs aren't kept up-to-date, teams can find themselves responding to more and more alerts that are increasingly hidden from customers. Even the best teams can find themselves firefighting toilsome alerts and without time to improve the system's as a whole.

Based on a true story, in this talk you will learn about pitfalls encountered when setting SLOs and how these pitfalls directly impacted the day-to-day developer experience of engineers and the systems being worked on at Red Hat. We'll also discuss how avoiding and climbing out of these pitfalls can bring about a better understanding of the system, reducing burn out.

Michael Shen

Senior Site Reliability Engineer @ Red Hat

Actions

Please note that Sessionize is not responsible for the accuracy or validity of the data provided by speakers. If you suspect this profile to be fake or spam, please let us know.

Jump to top