Spark ETL To Derive Sales Insights on Azure HDInsight And Power BI

In this session, we will review how easy it is to set up an end-to-end ETL data pipeline that runs on StreamSets Transformer to perform extract, transform, and load (ETL) operations. The pipeline will run on Apache Spark for Azure HDInsight cluster to extract raw data and transform it (cleanse and curate) before storing it in multiple destinations for efficient downstream analysis. The pipeline will also leverage technologies like Azure Data Lake Storage Gen2 and Azure SQL database, and the curated data will be queried and visualized in Power BI.

Dash D

Director of Platform and Technical Evangelism

San Francisco, California, United States

Actions

View Speaker Profile

Please note that Sessionize is not responsible for the accuracy or validity of the data provided by speakers. If you suspect this profile to be fake or spam, please let us know.

Session

Spark ETL To Derive Sales Insights on Azure HDInsight And Power BI

Dash D

Links

Actions