Session

AI SRE: Building Incident-Response Agents That Start the RCA Before You Do

PagerDuty goes off. Before a human fully opens the laptop, an AI SRE agent can already be pulling telemetry, checking dashboards, correlating logs, and drafting an incident summary. This talk shows how to design an AI incident-response workflow that integrates with tools like PagerDuty, Datadog, and New Relic to accelerate triage without bypassing safety.

What would be covered:
• event trigger from alert to investigation
• gathering evidence from observability systems
• forming a first-pass RCA hypothesis
• drafting timelines and incident summaries
• keeping humans in the approval loop

Ishan Shah

PayPal, Software Engineer | Distributed Systems, AI, and Platform Engineering

San Francisco, California, United States

Actions

Please note that Sessionize is not responsible for the accuracy or validity of the data provided by speakers. If you suspect this profile to be fake or spam, please let us know.

Jump to top