Speaker

Mihai Zaharia

Mihai Zaharia

Principal Cloud Developer

Actions

Mihai Zaharia – Principal Cloud Developer, Orange Services
Designs and automates large-scale telco cloud platforms based on Kubernetes and OpenStack.
As maintainer of the open-source Sylva Project, he participates in the development of cloud-native observability and automation stacks used in telco environments.
His work spans Kubernetes operators, GitOps-driven deployments (FluxCD, ClusterAPI), and secure multi-tenant logging systems — now evolving toward AI-powered root cause analysis.

Teaching Machines to Debug Machines: Building an AI-Powered RCA Agent for Sylva-Core

In modern cloud-native environments, operators face hundreds of alerts daily from Kubernetes clusters, network functions, and infrastructure components. Traditional approaches require manual investigation, correlation across multiple tools, and, let's face it, tribal knowledge that doesn't quite scale well.

This talk focuses on our journey building an RCA agent that autonomously investigates and tries to get the root cause of alerts in a Kubernetes-based telco cloud environment.
As the base infrastructure we used `sylva-core` to deploy our k8s clusters and manage the monitoring stack.

We'll go through the steps that we took to build this , focusing on:
- Choosing the right framework .
- Prompt engineering and design patterns.
- Model Context Protocol
- Memory systems

We'll walk through real examples of the agent investigating Kubernetes pod failures, correlating them with similar past incidents, and providing actionable RCA without human intervention.

Mihai Zaharia

Principal Cloud Developer

Actions

Please note that Sessionize is not responsible for the accuracy or validity of the data provided by speakers. If you suspect this profile to be fake or spam, please let us know.

Jump to top