Session
Failure is Inevitable: Engineering for Resilience in Distributed Systems
In the cloud-native world, we often talk about "uptime" as the ultimate goal. But what happens when failure isn't just a possibility, but an inevitability? Distributed systems are inherently complex and prone to hiccups. The real challenge isn't preventing every single failure, but building applications that can gracefully withstand and recover from them – moving beyond mere uptime to true resilience.
This session is tailored for developers and architects keen on understanding how to bake resilience directly into their applications from the very first line of code. We'll explore practical patterns and design philosophies that move beyond basic error handling, helping you create robust systems that not only survive chaos but thrive in it.
We'll dive into technical strategies and demonstrate how applying specific patterns can significantly improve your application's fault tolerance and graceful degradation. This isn't just about operations; it's about empowering developers to design for the messy reality of production.
Key Takeaways for Attendees:
Understand Resilience Patterns: Explore common distributed system resilience patterns like Circuit Breakers, Bulkheads, Retries with Exponential Backoff, and Throttling, and when to apply them.
Code for Failure Scenarios: Learn practical coding techniques and library choices (e.g., Polly for .NET, Resilience4j for Java, Go's context package) to implement these patterns effectively.
Graceful Degradation Strategies: Discover how to design your application to shed non-essential load or provide reduced functionality during partial outages, maintaining core user experience.
Test for Resilience (Chaos Engineering Light): Get an introduction to developer-friendly approaches to testing resilience, including basic chaos engineering principles you can apply without a dedicated team.
Observability for Resilience: Learn how robust observability (metrics, logging, tracing) becomes the cornerstone for verifying and understanding your application's resilient behavior in real-time.
Join us to empower yourself with the knowledge to build applications that are not just functional, but truly antifragile – ready to face the inevitable storms of cloud-native production environments.
Please note that Sessionize is not responsible for the accuracy or validity of the data provided by speakers. If you suspect this profile to be fake or spam, please let us know.
Jump to top