Building a Scalable Data Lakehouse: Real-Time Analytics with Apache Druid and Iceberg on Kubernetes

Imagine analyzing streaming data in real time while ensuring your storage solutions perform efficiently and scale seamlessly. Join us to explore an architecture that meets modern data demands through advanced cloud-native technologies.

We’ll discuss best practices for resource management in Kubernetes, focusing on CPU, memory, and storage for Druid and Iceberg. Learn about setting resource requests and limits, along with using Horizontal Pod Autoscaler (HPA) to dynamically scale Druid nodes for optimal performance and cost efficiency.

We’ll also address challenges like data consistency and operational overhead in using Druid and Iceberg. By offering practical solutions, you’ll gain insights to enhance your cloud-native architecture's robustness and ensure it meets future data needs effectively.

Shekhar Prasad Rajak

Data/AI , Platform Engg, Open Source

Actions

View Speaker Profile

Please note that Sessionize is not responsible for the accuracy or validity of the data provided by speakers. If you suspect this profile to be fake or spam, please let us know.

Session

Building a Scalable Data Lakehouse: Real-Time Analytics with Apache Druid and Iceberg on Kubernetes

Shekhar Prasad Rajak

Links

Actions