Session
Fault Tree Analysis Applied to Apache Flink
As Flink's adoption grows, we find more developers asking our small Flink infrastructure team for answers about whether their application will meet specific reliability guarantees. For example, can Flink maintain a data freshness guarantee lower than 5 minutes?
This session dives into how an age-old reliability technique can be applied to guide Flink platform and application developers who want to tune and monitor their Flink-based solutions and avoid over-promising and under-delivering for their users.
We present a calculator and step-by-step guide that we came up with to show what can be tuned to improve Flink application reliability. Throughout the session, we visualize failure probabilities by growing a Fault Tree in order to systematically find strengths and weaknesses with Flink.

Andrey Falko
Lyft, Staff Software Engineer
Oakland, California, United States
Links
Please note that Sessionize is not responsible for the accuracy or validity of the data provided by speakers. If you suspect this profile to be fake or spam, please let us know.
Jump to top