Session

When Logging Becomes The Outage: Escaping the ECS Logging Trap

Logging systems are typically treated as passive observers of production workloads. But in containerized environments, logging pipelines can quietly become a critical dependency. In one of our large ECS environments, a downstream logging disruption created a cascading reliability risk. Because workloads were configured with blocking log drivers, application containers began stalling once the log buffers filled, effectively turning the observability pipeline into a potential outage trigger.

This session walks through the real reliability problem, the investigation process, and the architectural changes that followed. We’ll explore how blocking logging modes interact with downstream failures, why this configuration can introduce hidden reliability risks, and how switching to non-blocking logging changes system behavior during logging outages.

The talk will cover practical strategies for building resilient logging pipelines in ECS environments, including buffer management, failure isolation, and protecting application workloads from observability dependencies. Attendees will walk away with a better understanding of how to design logging architectures that support reliability rather than accidentally becoming a source of downtime.

Rahul Tanniru

Senior Vice President Of Software Engineering, Jp Morgan Chase

Dallas, Texas, United States

Actions

Please note that Sessionize is not responsible for the accuracy or validity of the data provided by speakers. If you suspect this profile to be fake or spam, please let us know.

Jump to top