Hajed Khlifi
Principal solutions architect- Cloud & AI
Luxembourg
Actions
Solutions Architect specializes in the management of large-scale systems and the development of cutting-edge solutions based on platform engineering , Infrastructure and operations best practices. My work is focused on cloud-native architectures , Disconnected cloud and lately I am mainly working on AI Infrastructure and operations. I am an international speaker, researcher and also an active contributor to the Cloud Native Computing Foundation (CNCF) and Nvidia AI institute community.
Area of Expertise
Topics
GPU is not Monolithic : Packing LLMs with MIGs on Kubernetes
Most of the LLM workloads now are deployed on Kubernetes clusters with GPU nodes and let's be honest this is the most expensive resource in the cluster. Currently, using GPUs in passthrough mode locks a single model to an entire GPU, leading to severe underutilization (~30%). In this talk I will explain how to manage GPU resources in an efficient way and attendees will understand how GPU cards are configured in a Kubernetes cluster, what is the difference between the three main Nvidia GPU installation modes: Passthrough, vGPU and MIG and how everything works behind the scene. I will demonstrate how Multi instance GPUs are the best solution for packing LLMs on Kubernetes and how it should be used in an advanced case scenario like packing multiple LLMs in the same cluster sharing the same GPUs without causing the noisy neighbor problem.
Breaking Barriers with Dapr: Simplified and Portable Cloud-Native Architectures
Microservices are often built with different technologies, leading to complexity in management and integration. This session demonstrates how Dapr simplifies multi-stack microservices across cloud environments. We will explore an e-commerce application built using Golang, Java, Python, and Vue.js, that we are going to deploy live (1) on premises (Docker-Compose), then on (2) AWS, then (3) Azure and finally orchestrated in a (4) multi-cloud setup. Attendees will see how Dapr’s features (service invocation, state management, pub/sub) abstract complexities, enable easy development and migration across environments. This session highlights how Dapr’s "Lift and Shift" approach facilitates seamless cloud transitions without re-architecture, making it an ideal solution for modern, multi-stack microservices.
Container Days London Sessionize Event Upcoming
ContainerDays Conference 2025 Sessionize Event
KCD New York 2025 Sessionize Event
Please note that Sessionize is not responsible for the accuracy or validity of the data provided by speakers. If you suspect this profile to be fake or spam, please let us know.
Jump to top