Turbocharging Your Data Lake: Real-World Apache Iceberg Performance Tuning

Want to make your Apache Iceberg tables blazingly fast ? Join me for an in-depth session packed with practical strategies to fine-tune your data lake for top-tier performance and scalability. We’ll walk through critical table maintenance procedures—such as metadata optimization, handling the small-file problem through smart compaction, and visualizing bottle necks in our table —alongside battle-tested best practices for both streaming and batch processing workloads.
We’ll also dive into a key question many teams overlook: Does the catalog layer matter? Spoiler: it does. The choice between catalogs can have real implications on write/read performance, and multi-engine compatibility.
Plus, discover how Puffin files are reshaping how metadata is stored and queried, unlocking new ways to accelerate your analytical workloads. Expect actionable insights, compelling benchmarks, and real-world takeaways to help you lower latency, reduce I/O, and keep compute costs in check.

Amit Gilad

Lakesphere, CTO

Actions

View Speaker Profile

Please note that Sessionize is not responsible for the accuracy or validity of the data provided by speakers. If you suspect this profile to be fake or spam, please let us know.

Session

Turbocharging Your Data Lake: Real-World Apache Iceberg Performance Tuning

Amit Gilad

Links

Actions