Session

Architecting the Error Budget: How SLT Use SRE Metrics to Balance Innovation and Risk

You will learn how the Error Budget is the financial and operational expression of resilience, and how you can use it to make data-driven decisions on reliability, feature velocity, and risk tolerance across your product portfolio.

Key Takeaways for Leaders:
The Cost of Perfection: Understand why chasing 100% availability is economically irrational and how to define a Maximum Tolerable Downtime (MTD) that aligns with customer willingness to pay and competitive market standards.

The Error Budget as Capital: Learn to view the Error Budget as the shared currency between Development (velocity) and Operations (stability). We will cover how to govern this budget to enforce a disciplined trade-off between shipping new features and improving resilience.

SLOs for Business Alignment: Discover how to establish user-centric Service Level Objectives (SLOs) that directly map to your organization's business Key Performance Indicators (KPIs), such as customer retention, conversion rates, and revenue.

Driving Proactive Investment: Use Error Budget consumption metrics to proactively justify and prioritize investment in reliability work (like automation, testing, and Chaos Engineering) before system failures force a costly, reactive halt to feature development.

This session will equip you with the managerial vocabulary and framework needed to lead SRE adoption and embed organizational resilience into your quarterly planning and resource allocation cycles.

Venkata Srinivas Kantamneni

Richmond, Virginia, United States

Actions

Please note that Sessionize is not responsible for the accuracy or validity of the data provided by speakers. If you suspect this profile to be fake or spam, please let us know.

Jump to top