Most Active Speaker

Doug Leal

Doug Leal

Director of Consulting (Data & Analytics) at CGI

Orlando, Florida, United States

Actions

Enterprise Architect with 20+ years of experience executing key data-centric solutions, big data, cloud, analytics, and data engineering strategies and implementations in both enterprise and consulting capacities. Trusted advisor of enterprise data architecture for existing and new platforms. Machine Learning enthusiast.

Awards

  • Most Active Speaker 2023

Area of Expertise

  • Information & Communications Technology

Topics

  • All things data
  • Microsoft Data Platform
  • Azure Data Lake
  • Data Analytics
  • Databricks
  • Apache Spark
  • Azure Data & AI
  • Databases
  • Data Warehousing
  • ETL/ELT

Optimizing Delta Tables

Delta tables are frequently used for data lakes, where data is ingested via streaming or in large batches. During this session, we will dive into techniques for optimizing Delta tables, with a particular focus on essential strategies for data lake partitioning, data skipping and z-ordering to enhance performance. In addition, we will review important features of Delta tables, including time travel, vacuum, and many more!

Seminar for College Students

This two hour seminar is composed of four sessions of approximately 30 minutes each. You'll get to hear about the many different job titles and technologies in the area of data, getting ready for your first interview, how to participate in your profession now, and lots of Q&A. No preparation needed, join us and you'll get to hear professionals working in IT share ideas and answer questions very relevant to those trying to focus their education and getting ready for their first job in IT! See your professor about extra credit for attending. Note that this seminar is held concurrently with SQLSaturday, you're welcome to attend those sessions after the seminar completes.

End to End MLOps using Azure Machine Learning

The first segment of this beginner-level session will provide an introduction to MLOps and an overview of the machine learning lifecycle, spanning data preparation, model training, deployment, and maintenance.
The subsequent part will shift the focus to hands-on MLOps practices, harnessing the capabilities of Azure ML for seamless integration into your machine learning projects.
Participants will explore various MLOps techniques, including data versioning, experiments, and model tracking, which are essential for ensuring reproducibility and scalability. Moreover, we will explore operationalizing models using Azure ML, empowering attendees to efficiently manage and scale model deployments across your organization.
This session is suitable for anyone interested in MLOps or who wants to learn more about Azure ML's capabilities.

Building Data Lake on Azure

Your organization is building a Data Lake on Azure and you are on the team implementing it... How do you organize the information in the Data Lake? What are the security best practices to protect your data? What paths companies take to build an effective solution? In this session, you will learn about design and architecture patterns in building Data Lakes on the Azure cloud. We will discover ways to design zones or layers to provide different groups access to the data, which was curated at different levels. You will leave this session with a clear understanding of how to implement and bring together the best of your data warehouse and data lake into one unified platform.

Cutting Through the Buzz: LLM, RAG, and GPT - Making Sense of Generative AI

Join us for an introductory session about Generative AI! In this session, we will "Cut Through the Buzz" and demystify LLM, RAG, and GPT. We will discover how these technologies work behind the scenes with live demos showcasing their practical applications. Whether you're new to AI or looking to expand your understanding, we'll dive into details and show you how to leverage these tools effectively. Join us to explore, ask questions, and leave with a clearer perspective on how Generative AI can transform your projects.

Is Data Mesh right for your Organization?

This session explores the concept of Data Mesh and its potential applicability to your organization. Based on theoretical insights and real-world case studies, attendees will gain a deep understanding of the principles behind Data Mesh and its implications for data management, scalability, and agility. In addition, we will discuss key considerations when determining whether Data Mesh aligns with your organization's goals, culture, and technical capabilities.
Walking away from this session, attendees will have a better understanding of:
- What is Data Mesh
- Pros and Cons of Data Mesh
- Walkthrough of successful implementation of Data Mesh

How to get started with MLOps in Azure Databricks

The first portion of this entry-level session will introduce you to the machine learning life cycle, including data preparation, model training, selection, and deployment. The second portion of this session will cover how to use Azure Databricks for your Machine Learning project. We will try different algorithms, parameters, and track this information to reproduce our work. We will learn how to operationalize your model with Databricks Machine Learning, and gain a deeper understanding of how to scale deployments of models across your company.

Data Governance and Management with Databricks Unity Catalog

Adopting Unity Catalog has been crucial to meet compliance and auditing requirements in the Lakehouse. In this session, we'll discover the intricacies of Unity Catalog's integration with the Databricks Platform, providing you with the insights and tools necessary to harness the full potential of your data assets. Gain invaluable knowledge on accessing Unity Catalog through Clusters and SQL Warehouses, enabling seamless data management across your organization. And finally, we will share how to integrate Unity Catalog external Delta tables to OneLake using shortcuts.

Discover best practices for creating and governing data assets within Unity Catalog, empowering your team to unlock actionable insights and drive informed decision-making.

Data Engineering with Databricks on Google Cloud

Databricks, based on Apache Spark, is a fast, easy-to-use, and scalable data platform.
With Databricks on Google Cloud, you can build open, flexible data lakes integrated with BigQuery and Looker.
In this session, you will learn about Spark and Databricks, what problems it solves, and how Databricks work on Google Cloud.
This isn't another theoretical presentation; this session will include hands-on demonstrations of loading, transforming, and visually presenting data with Databricks.

Getting Started with Azure Databricks: Data Engineering

Azure Databricks, based on Apache Spark, is a fast, easy-to-use, and scalable data platform.
In this session, you will learn what is Apache Spark and Azure Databricks, what problems Azure Databricks solve, and how Azure Databricks work.
This session will include hands-on demonstrations of loading, transforming, and presenting data with Azure Databricks.

PASS Data Community Summit 2024 Upcoming

Is Data Mesh Right for Your Organization?

November 2024 Seattle, Washington, United States

SQLSaturday Orlando 2024 Sessionize Event Upcoming

October 2024 Sanford, Florida, United States

DevFest Florida Orlando 2024 Sessionize Event Upcoming

September 2024 Sanford, Florida, United States

Atlanta Developers' Conference 2024 Sessionize Event Upcoming

September 2024 Alpharetta, Georgia, United States

SQL Saturday Baton Rouge 2024 Sessionize Event

July 2024 Baton Rouge, Louisiana, United States

SQL Saturday South FL 2024 Sessionize Event

June 2024 Fort Lauderdale, Florida, United States

SQLBits 2024 - General Sessions Sessionize Event

March 2024 Farnborough, United Kingdom

DISTRIBUTECH International 2024

Unlocking the Power of Insights with Data Mesh and Data Lakehouse

February 2024 Orlando, Florida, United States

Orlando Code Camp 2024 Sessionize Event

February 2024 Sanford, Florida, United States

SQL Saturday Atlanta 2024 - BI & Data Analytics Sessionize Event

February 2024 Alpharetta, Georgia, United States

PASS Data Community Summit 2023

Getting Started with Azure Databricks: Data Engineering

November 2023 Seattle, Washington, United States

DevFest Florida Orlando Sessionize Event

October 2023 Sanford, Florida, United States

SQL Saturday Baton Rouge 2023 Sessionize Event

July 2023 Baton Rouge, Louisiana, United States

SQL Saturday South FL 2023 Sessionize Event

June 2023 Davie, Florida, United States

SQL Saturday Jacksonville #1041 Sessionize Event

May 2023 Jacksonville, Florida, United States

SQL Saturday Atlanta 2023 - BI & Data Analytics Sessionize Event

February 2023 Alpharetta, Georgia, United States

SQLSaturday Orlando 2022 Sessionize Event

October 2022 Sanford, Florida, United States

SQL Saturday Orlando 2022 #1030

Seminar for College Students @ Seminole State College.

October 2022 Orlando, Florida, United States

PASS Data Community Summit 2021 Sessionize Event

November 2021

DISTRIBUTECH International 2019

Leveraging Utility Data Analytics for Insights

February 2019 New Orleans, Louisiana, United States

Doug Leal

Director of Consulting (Data & Analytics) at CGI

Orlando, Florida, United States

Actions

Please note that Sessionize is not responsible for the accuracy or validity of the data provided by speakers. If you suspect this profile to be fake or spam, please let us know.

Jump to top