Session
Leading with Reliability: Applying SRE Principles to Build Stronger Engineering Organizations
Service Reliability Engineering (SRE) has long been the discipline responsible for keeping complex systems healthy, resilient, and predictable under pressure. But the real power of SRE lies not just in the tools, dashboards, or operational frameworks—it lies in its philosophy: focusing on what matters most, measuring the right things, and making intentional trade-offs.
As engineering leaders, we can apply these principles far beyond production environments. This talk explores how core SRE concepts can become high-leverage leadership tools for shaping team culture, guiding prioritization, and driving meaningful business outcomes.
We begin with service criticality, expanding the traditional technical lens to view the entire end-to-end customer journey. Instead of assessing components in isolation, we’ll explore how to map dependencies across teams and systems to surface the true bottlenecks and organizational weak points that impact users.
From there, we’ll look at Service-Level Indicators (SLIs) and reinterpret them at the business level. What does “reliability” mean when framed through customer expectations rather than CPU metrics? How can engineering leaders define measurable signals that reflect whether the product is delivering on its intended value?
Next, we’ll dig into Service-Level Objectives (SLOs)—not as uptime percentages, but as promises to customers. We'll discuss how leaders can craft SLOs that articulate what “good enough” looks like for the business, and how these objectives guide healthier conversations around trade-offs, investment, and risk.
Finally, we’ll explore error budgets as a strategic leadership mechanism. Error budgets offer a structured way to balance innovation and stability, negotiate between delivery teams and product, and make aligned decisions about when to push forward and when to fix foundational issues.
Attendees will leave with a toolkit for adopting SRE thinking at the organizational level—helping them connect engineering decisions to business impact, create a culture of reliability, and lead teams that deliver value with clarity and confidence.
Site Reliability Engineering offers powerful frameworks for managing system health—but these ideas don’t have to stay confined to production. This talk shows engineering leaders how to translate SRE concepts such as service criticality, SLIs/SLOs, and error budgets into organizational tools that improve decision-making, clarify priorities, and strengthen alignment between engineering and the business.
Maxim Schepelin
Engineering leader at Booking.com
Amsterdam, The Netherlands
Links
Please note that Sessionize is not responsible for the accuracy or validity of the data provided by speakers. If you suspect this profile to be fake or spam, please let us know.
Jump to top