Session

Stop Grepping, Start Reasoning: Skill-Based Agentic SRE 🐉

Let’s be honest: debugging production at scale is soul-crushing manual labor.
At Google, we decided making thousands of SREs act like human search engines in 2026 was a bug, not a feature. Enter the Agentic SRE Extension—a skill-based framework that lives in your terminal and actually knows its way around Kubernetes.
🚀 In this high-energy deep dive, I’m showing you exactly how we use modular skills and MCP to chain diagnostic tools, analyze metric regressions, and execute safe mitigations (Rollbacks, Throttling) without the "deleted production" anxiety.
🔥 We’ll dissect the "Outage Investigator" agent's logic loop, see it draft a technical postmortem in seconds, and discuss why we built this as a portable framework whose skills can be easily leveraged by any modern AI harness. You’ll leave with the code to wire up your own Kubernetes stack and let the agent's skills do the heavy lifting while you drink espresso.
No fluff, no 101s. Just AI agents, specialized skills, and less toil.


Target Audience: SREs, DevOps Practitioners, Platform Engineers, and Developers interested in AI-assisted operations.

Preferred Session Duration: 30 minutes (40m if I can do a demo).
Technical Requirements: Internet connection to execute a live terminal-based outage investigation demo on a cloud Kubernetes cluster.
First Public Delivery: This will be the first public deep-dive into the skill-based agentic framework methodology and live demonstration of the "Outage Investigator" logic loop.

Riccardo Carlesso

Developer Advocate, google cloud

Zürich, Switzerland

Actions

Please note that Sessionize is not responsible for the accuracy or validity of the data provided by speakers. If you suspect this profile to be fake or spam, please let us know.

Jump to top