Most Active Speaker

Sally Dabbah

Sally Dabbah

Empowering innovation through Azure’s boundless possibilities

Herzliya, Israel

Actions

Meet Sally! Sally serves as a Data Engineer with an expertise in Azure Cloud Analytics Services, with years of experience under her belt.
Since earning her B.Sc. degree in Software Engineering, Sally has become a significant voice in Azure Cloud Analytics Services, publishing over ten blogs on Microsoft's Tech Community blog. What excites Sally most about her work is the opportunity to build Proof of Concepts (POCs) that help Microsoft customers unlock their business potential – by doing so, she remains true to her ultimate mission: to empower innovation through Azure's boundless possibilities. Away from her professional endeavors, Sally finds joy in baking pastries, a hobby she shares with her followers on Instagram for inspiration.

Awards

  • Most Active Speaker 2023

Area of Expertise

  • Business & Management
  • Government, Social Sector & Education
  • Information & Communications Technology
  • Media & Information
  • Real Estate & Architecture

Topics

  • Microsoft Azure
  • Azure Data Engineering
  • Microsoft Fabric
  • Azure Data Factory
  • Azure Synapse Analytics
  • migration to cloud
  • Apache Airflow

Efficient Data Partitioning with Microsoft Fabric: Best Practices and Implementation Guide

the session contains an information about Microsoft Fabric SAAS service, i will explain about the main components such as oneLake,DF ,Synapse data engineering and much more.
after the brief introduction, I will explain about partitioning in data lakehouse, what is the difference between lakehouse and warehouse and how to organize data using medallion architecture pattern, this pattern is used by data engineers to logically store data in multiple layers to guarantee a clean and smooth transition till we have Gold data which represents the final refinement that data has to have before displaying in PowerBI / engaging to a third part such as data scientists for ML operations
A short demo is presented to the audience just to get a better understanding of how to implement the architecture in Microsoft Fabric service
and in the end we will have about 10 minutes for Q&A

Mastering Data Workflows: Microsoft Fabric's Metadata Pipeline Solution with the Medallion Architect

Microsoft Fabric provides both Data Lakehouse and Data Warehouse platforms for Data Analytics. In a separate post, I illustrated a Metadata Driven Pipeline pattern for Microsoft Fabric following the medallion architecture with Fabric Data Lakehouses used for both the Bronze and Gold layers and SQL views over tables for the Silver layer. Fabric Data Lakehouse is perfect for landing data into the Bronze layer. And if I can host my star schema in a Fabric Data Lakehouse, why consider Fabric Data Warehouse as the Gold layer? Because Fabric Data Warehouse has some features that may be a better fit for your organization.

Microsoft Fabric: All About The Data Analytics Platform

A breif introduction about fabric components and main concepts (OneLake, OneSecurity,V-order optimization and much more) and a short demo on how to execute ETL's using Fabric different components.

Step-by-Step Guide: Building ETL Workflows in Microsoft Fabric

In this session, we will delve into building robust ETL (Extract, Transform, Load) pipelines using Microsoft Fabric. This demonstration aims to enhance our data insights by leveraging powerful features of Microsoft Fabric.

Throughout the session, participants will explore the full capabilities of Microsoft Fabric, focusing on:

Workspace: Setting up collaborative environments for efficient data management.
Lakehouse and Warehouse: Constructing the Medallion architecture for unified data storage and processing.
Data Flow Gen2: Designing scalable and efficient data processing workflows.
Pipelines: Automating the flow of data from source to destination.
Notebooks: Using interactive notebooks for exploratory data analysis and pipeline development.
SQL Endpoints: Integrating structured query capabilities for advanced data manipulation.
Datasets: Organizing and managing data assets for streamlined access and analysis.
PowerBI Reports: Visualizing insights through interactive and insightful reports.
By the end of the session, attendees will have gained practical experience in harnessing these features to build resilient ETL pipelines, empowering them to extract maximum value from their data resources.

Prerequisites
Internet connectivity
Your own active Azure Subscribtion
Your own Microsoft Fabric Trial

Mastering the Art: Orchestrating ADF with the Power of Workflow Orchestration Manager

in-depth session where we tackle the challenge of dynamically invoking Azure Data Factory (ADF) pipelines using Apache Airflow. While ADF does not natively support this functionality, Apache Airflow provides a powerful solution to orchestrate these pipelines effectively.

What You'll Learn:
Running ADF Pipelines from Airflow:

Setup and Configuration: We will guide you through the prerequisites, including necessary tools and accounts (Azure, ADF, Airflow), and provide a step-by-step configuration process for integrating Airflow with ADF.
Creating a DAG (Directed Acyclic Graph): Learn how to create a DAG in Airflow and configure tasks to invoke ADF pipelines.
Execution and Monitoring: Experience a live demonstration of executing the DAG, monitoring, and managing the pipeline execution.
Running Custom Modules in Airflow:

Module Development: Discover how to create custom modules tailored for specific tasks.
Integration with Airflow: Understand the process of adding custom modules to Airflow DAGs.
Execution and Monitoring: See a live demo of executing custom modules and observing the execution process.

Session Highlights:
Overcome the limitation of ADF's native capabilities by leveraging Apache Airflow.
Gain hands-on experience through two detailed demos.
Enhance your data orchestration skills with practical, real-world applications.

This session is perfect for data engineers, developers, and IT professionals looking to expand their knowledge and capabilities in data pipeline orchestration using cutting-edge tools.

Key Takeaways from PySpark Notebooks in Microsoft Fabric

In this session, I will guide you step by step through the process of working with Notebooks and extracting data from APIs. Here's the approach we'll take:

Introduction to Notebooks and APIs: I will explain how Notebooks in Fabric provide a fast, efficient solution for data transformation and how APIs can serve as valuable data sources.

Setting Up the Environment: We'll start by installing and importing the essential libraries needed for working with PySpark and interacting with APIs.

Extracting Data from APIs: I'll demonstrate how to fetch data from an API, process it, and bring it into a DataFrame for further manipulation.

Creating a User Defined Function (UDF): I'll show you how to create a custom UDF in PySpark to handle more advanced data transformations. We will walk through the steps of defining the function, converting it into a UDF, and applying it to your DataFrame.

Complex Data Transformations: Finally, I will guide you on how to use UDFs to perform more complex transformations on your data, making it ready for other Fabric tasks.

Copilot: Understand how to use Copilot in notebooks for Data Engineering workloads to generate code snippets, provide explanation for existing code, suggest data visualizations, suggest analytical machine learning models, and more.

By the end of this session, you’ll have a clear understanding of how to leverage Notebooks and APIs in Fabric to perform complex data transformations with PySpark and UDFs.

This session is suitable for beginners, although basic knowledge of Azure Synapse Analytics or Microsoft Fabric is recommended. While the session is primarily aimed at data engineers, data architects and stakeholders are also welcome to attend.

Triangle Area SQL Server User Group (TriPASS) User group Sessionize Event Upcoming

Not scheduled yet.

M365 Community Days MTL Octobre 2024 Sessionize Event

October 2024 Montréal, Canada

SQLSaturday - Minnesota 2024 Sessionize Event

September 2024 Saint Paul, Minnesota, United States

Experts Live Europe 2024 Sessionize Event

September 2024 Budapest, Hungary

SQLSaturday Denver 2024 Sessionize Event

August 2024 Denver, Colorado, United States

(Virtual) Kansas City SQL Server Users Group User group Sessionize Event

August 2024

SQLSaturday Wellington 2024 Sessionize Event

August 2024 Wellington, New Zealand

Data Toboggan - Cool Runnings 2024 Sessionize Event

July 2024

Data Platform Next Step 2024 Sessionize Event

June 2024 Copenhagen, Denmark

Data Point Prague Sessionize Event

May 2024 Prague, Czechia

Visual Studio Live! Las Vegas 2024 Sessionize Event

March 2024 Las Vegas, Nevada, United States

South Florida Women in Tech User group Sessionize Event

February 2024

Data.TLV Summit 2024 Sessionize Event

February 2024 Rishon LeTsiyyon, Israel

Power BI & Fabric Summit 2024 Sessionize Event

February 2024

Microsoft Fabric Usergroup Denmark 2024 H1 User group Sessionize Event

February 2024

Microsoft Learn Zero to Hero Community User group Sessionize Event

January 2024

Azure User Group Sweden User group Sessionize Event

December 2023

Data Toboggan - Alpine Coaster 2023 Sessionize Event

November 2023

2023 SQL Saturday Silicon Valley (SQLSatSV) Sessionize Event

October 2023 San Jose, California, United States

Azure Back to School 2023 Sessionize Event

September 2023

Sally Dabbah

Empowering innovation through Azure’s boundless possibilities

Herzliya, Israel

Actions

Please note that Sessionize is not responsible for the accuracy or validity of the data provided by speakers. If you suspect this profile to be fake or spam, please let us know.

Jump to top