Session
AI SRE: Building Incident-Response Agents That Start the RCA Before You Do
PagerDuty goes off. Before a human fully opens the laptop, an AI SRE agent can already be pulling telemetry, checking dashboards, correlating logs, and drafting an incident summary. This talk shows how to design an AI incident-response workflow that integrates with tools like PagerDuty, Datadog, and New Relic to accelerate triage without bypassing safety.
What would be covered:
• event trigger from alert to investigation
• gathering evidence from observability systems
• forming a first-pass RCA hypothesis
• drafting timelines and incident summaries
• keeping humans in the approval loop
Ishan Shah
PayPal, Software Engineer | Distributed Systems, AI, and Platform Engineering
San Francisco, California, United States
Links
Please note that Sessionize is not responsible for the accuracy or validity of the data provided by speakers. If you suspect this profile to be fake or spam, please let us know.
Jump to top