Session

Site Reliability Engineering is a lasting journey, not a finalized roadmap!

SRE is an industry wide, value-based approach for product reliability. In this talk, we'll go over the SRE journey for Cloud IAM products and how we measure their impact on other Cerner products. Using Newrelic as our guiding compass, associates who have an interest in their product's reliability can take our lessons learned whether they are just starting their journey or have fully adopted SRE implementations.

We'll discuss what mindset is needed to embark on this journey and what we need to take along: Do we have to wait for site reliable engineers to make products reliable? Is Newrelic enough on its own? What kinds of tooling and automation will it take to get started? Next we'll discuss common patterns for those on their first steps towards measuring and improving reliability: What the heck is an Apdex and why is it so important for your product? How do deployment markers make the business better aware of changes? Is there a difference between DevOps and SRE? How do we make COEs (correction of events) a consistent part of the journey? And what are the risks to falling off of the path? Finally, we'll discuss what a mature SRE implementation will look like at Cerner: How can we establish SLOs and Error Budgets? Can we detect failures before end user impact? Wherever you are on your journey, we hope to equip you as you set out on your adventures!

Kyle Lipke

Senior System Engineer

Actions

Please note that Sessionize is not responsible for the accuracy or validity of the data provided by speakers. If you suspect this profile to be fake or spam, please let us know.

Jump to top