Session

Self-Healing Systems: The Delicate Balance Between Resilience, Availability and Cost

The core platform systems that support mission-critical applications are expected to be highly available and resilient 24/7/365. These systems are built with self-healing capabilities to automatically detect and recover from failures with minimal human intervention. These systems aim to maintain seamless service delivery even when components fail.
In this talk, we’ll share insights, experiences, and innovative solutions for striking a delicate balance between resilience, availability, and cost. Attendees will gain insights into often-overlooked trade-offs involved in building self-healing architectures, from over-provisioning and redundancy to observability and failover strategies.
Key Takeaways:
• Understand the design triangle of self-healing systems for sustainable balance of availability and cost
• Understand the practical trade-offs between resilience and cost in self-healing system design.
• Architecting smarter systems that recover gracefully without burning through your budget
• Building cost-aware healing strategies that degrade gracefully

This talk is inspired by the challenges we faced when architecting and running distributed platforms that demanded self-healing capabilities. Designing and operating platforms that require self-healing capability requires careful assessment of resilience, availability, and cost considerations for a balanced approach, taking into account tradeoffs and risks.

Aman Sardana

Discover Financial Services, Expert Application Architect

Chicago, Illinois, United States

Actions

Please note that Sessionize is not responsible for the accuracy or validity of the data provided by speakers. If you suspect this profile to be fake or spam, please let us know.

Jump to top