Session
An Apache Spark query's journey through the layers of Microsoft Fabric
Headline: An Apache Spark Query's Journey Through the Layers of Microsoft Fabric
Abstract: Join us for an exciting deep dive into the heart of Apache Spark! We'll take you on a journey to see exactly how your Spark queries get executed, both within Apache Spark itself and through the different layers of Microsoft Fabric. Here's what we'll explore together:
* Spark SQL and Catalyst: A break down how Spark SQL works hand-in-hand with the Catalyst optimizer to make your queries smarter and faster.
* A Note on Tungsten: Discover how Tungsten boosts Spark’s performance with better memory management and lightning-fast execution.
* A note on Fabrics native execution engine: Bringing the power of C++, for even faster query execution.
*Delta Lake: See how Delta Lake makes your data lakes more reliable and scalable, ensuring your data is always in top shape.
*Parquet Files: Learn why Parquet’s columnar storage is a game-changer for efficient data storage and quick retrieval.
We'll look into the official Apache Spark source code on GitHub, giving you a real, hands-on look at what's happening under the hood.
By the end of this session, you'll have a clearer understanding of how your queries run and some tools and tips to help you solve problems and optimize your Spark jobs for both speed and cost.
Please note that Sessionize is not responsible for the accuracy or validity of the data provided by speakers. If you suspect this profile to be fake or spam, please let us know.
Jump to top