Session
Building resilient DevOps pipelines with AI for scalable cloud reliability
In this session, we will explore how DevOps practices are enhanced by multi-agent LLM systems to automate and optimize incident triage in cloud environments. Drawing from my work at Microsoft Research, I’ll share how I implemented a multi-agent framework that reduced Mean Time to Mitigation (MTTM) by 35% in large-scale cloud services. By leveraging semantic routing, adaptive SLAs, and context-aware prioritization, we transformed traditional manual workflows into dynamic, AI-powered systems. This session will focus on the intersection of DevOps, incident management, and machine learning, showing how AI-driven automation can streamline cloud reliability operations, enhance system performance, and minimize downtime. Attendees will gain practical insights on how to integrate AI-based solutions into their DevOps pipelines to ensure faster incident resolution and improve overall system scalability and resilience.

Salma Shaik
Research Software Engineering @ Microsoft AI
Bengaluru, India
Links
Please note that Sessionize is not responsible for the accuracy or validity of the data provided by speakers. If you suspect this profile to be fake or spam, please let us know.
Jump to top