Session

Latency Forensics: Uncovering Hidden P99 Bottlenecks in Kubernetes

Modern systems rarely fail at average latency — they fail in the long tail.

Applications that appear healthy in staging environments often experience severe P99 degradation in production due to noisy neighbors, burst amplification, CPU throttling, cache invalidation storms, thread contention, and distributed coordination delays.

This session explores a “Containerized Time Travel” approach for reproducing production-like latency behavior inside Kubernetes environments before deployment.

Modern distributed systems rarely fail because of average latency.

They fail because hidden bottlenecks silently amplify tail latency under realistic production conditions.

Applications that appear healthy in staging environments often experience severe P99 degradation in production due to CPU throttling, noisy neighbors, retry storms, queue amplification, cache invalidation cascades, and distributed coordination delays.

This session explores a “Latency Forensics” approach for uncovering hidden performance bottlenecks inside Kubernetes environments using replay-driven performance engineering and observability correlation techniques.

We will examine how realistic workload reconstruction helps teams:

reproduce production-like request behavior
expose hidden latency amplifiers
correlate infrastructure and application bottlenecks
benchmark workloads under realistic contention scenarios
uncover distributed-system side effects invisible in synthetic tests

Sravanthi Naga

Senior Engineering Manager - Pega Systems

Hyderābād, India

Actions

Please note that Sessionize is not responsible for the accuracy or validity of the data provided by speakers. If you suspect this profile to be fake or spam, please let us know.

Jump to top