Speaker

Hajed Khlifi

Hajed Khlifi

Principal solutions architect- Cloud & AI

Luxembourg

Actions

Solutions Architect specializes in the management of large-scale systems and the development of cutting-edge solutions based on platform engineering , Infrastructure and operations best practices. My work is focused on cloud-native architectures , Disconnected cloud and lately I am mainly working on AI Infrastructure and operations. I am an international speaker, researcher and also an active contributor to the Cloud Native Computing Foundation (CNCF) and Nvidia AI institute community.

Area of Expertise

  • Information & Communications Technology

Topics

  • Cloud & DevOps
  • Cloud Architecture
  • Cloud Native
  • Kubernetes
  • CICD Pipeline
  • Microservices Architectures
  • Serverless
  • Docker
  • Dapr
  • AI
  • LLMOps
  • GPU
  • Nvidia vGPU
  • Nvidia MIG
  • Red hat OpenShift

GPU is not Monolithic : Packing LLMs with MIGs on Kubernetes

Most of the LLM workloads now are deployed on Kubernetes clusters with GPU nodes and let's be honest this is the most expensive resource in the cluster. Currently, using GPUs in passthrough mode locks a single model to an entire GPU, leading to severe underutilization (~30%). In this talk I will explain how to manage GPU resources in an efficient way and attendees will understand how GPU cards are configured in a Kubernetes cluster, what is the difference between the three main Nvidia GPU installation modes: Passthrough, vGPU and MIG and how everything works behind the scene. I will demonstrate how Multi instance GPUs are the best solution for packing LLMs on Kubernetes and how it should be used in an advanced case scenario like packing multiple LLMs in the same cluster sharing the same GPUs without causing the noisy neighbor problem.

Breaking Barriers with Dapr: Simplified and Portable Cloud-Native Architectures

Microservices are often built with different technologies, leading to complexity in management and integration. This session demonstrates how Dapr simplifies multi-stack microservices across cloud environments. We will explore an e-commerce application built using Golang, Java, Python, and Vue.js, that we are going to deploy live (1) on premises (Docker-Compose), then on (2) AWS, then (3) Azure and finally orchestrated in a (4) multi-cloud setup. Attendees will see how Dapr’s features (service invocation, state management, pub/sub) abstract complexities, enable easy development and migration across environments. This session highlights how Dapr’s "Lift and Shift" approach facilitates seamless cloud transitions without re-architecture, making it an ideal solution for modern, multi-stack microservices.

Container Days London Sessionize Event Upcoming

February 2026 London, United Kingdom

ContainerDays Conference 2025 Sessionize Event

September 2025 Hamburg, Germany

KCD New York 2025 Sessionize Event

June 2025 New York City, New York, United States

Hajed Khlifi

Principal solutions architect- Cloud & AI

Luxembourg

Actions

Please note that Sessionize is not responsible for the accuracy or validity of the data provided by speakers. If you suspect this profile to be fake or spam, please let us know.

Jump to top