Spark Performance tuning in Spark 3 - is it still needed?

The aim is to pinpoint Spark troubleshooting and performance tuning techniques which are tricky and not well understood. Also they are relevant even in the newest versions of Spark.

Are some technical aspects of Apache Spark tricky? Are you struggling with performance or troubleshooting? Did you expect that Spark 3.x will solve all your problems? But it’s not the case?
We’ll highlight the nitty gritty details beyond the SQL. In a digestible manner. All to truly help you get your top Apache Spark issues resolved and get the most of your ecosystem. Briefly? We’ll share the top takeaways on avoiding failures from our longstanding experience with Spark. Skewed data, Cartesian join, executor fountaining - we will cover that all.

Marcin Szymaniuk

CEO, Data Engineers at TantusData

Berlin, Germany

Actions

View Speaker Profile

Please note that Sessionize is not responsible for the accuracy or validity of the data provided by speakers. If you suspect this profile to be fake or spam, please let us know.

Session

Spark Performance tuning in Spark 3 - is it still needed?

Marcin Szymaniuk

Links

Actions