Most Active Speaker

Erwin de Kreuk

Erwin de Kreuk

Lead Data and AI | Data Platform MVP | Public Speaker | InSpark | Innovate to Accelerate

Rotterdam, Netherlands

Erwin de Kreuk is a passionate and very experienced Microsoft Solution Architect.
Working as a Principal Consultant/ Lead Data and AI for InSpark in the Netherlands. Speaking at different national and international data community events. He is been awarded as Data Platform MVP.

He is working in the world of data on the Microsoft Platform for last 14 years and the last 6 years he has shifted his focus to the Azure Platform.
Answering complex customer cases and technical issues are part of his day-to-day work. In addition to this work, he is a member of the Technology Board within InSpark and leads a team of highly experienced Data Expert in the field of Microsoft Data Platform.

He is eager in helping out customers in getting the most added value out of their complex Analytics environment with a strong focus on solutions in the Azure Cloud (Platform as a Service).
As a Technology Board member, he is always investigating the latest (vs newest) possibilities/opportunities and sharing his enthusiasm among his colleagues, the community and customers. He is one of the main Stakeholders for the InSpark Solution (Managed) Oxygen, a Modern Data platform Estate as-a-service.

Awards

Area of Expertise

  • Travel & Tourism
  • Manufacturing & Industrial Materials
  • Transports & Logistics
  • Business & Management

Topics

  • Azure
  • Azure Synapse
  • Azure SQL Database
  • Azure Data Factory
  • SSIS
  • SSAS
  • Azure Data Platform
  • Azure PaaS
  • Azure Analysis Services
  • Azure CosmosDB
  • Azure Synapse Analytics
  • IoT
  • Microsoft Purview

Designing and managing a cost-effective data platform in Azure Synapse Analytics

Azure Synapse Analytics is a powerful data platform, but it can also be expensive if you don't know what you're doing. In this session, we will go through the different components of Azure Synapse Analytics and discuss how to design a cost-effective data platform.

We will cover topics such as choosing the right pricing tier, optimizing data storage and processing, and leveraging built-in cost management features. We will also discuss how to optimize your data platform for cost efficiency by using features such as serverless compute, pre purchase compute and reserved capacity.

Attendees will leave with a better understanding of how to design and manage their Azure Synapse Analytics platform for cost efficiency and how to design a cost-effective data platform that meets your organization's needs.
Topics we will cover:

- Pricing Models for compute Resources in Azure Synapse Analytics
- Storage Types and Tiers
- Pricing Models
- Optimizing Cost with Resource-Scaling
- Using serverless compute and reserved capacity options for cost savings

Building a Metadata driven ELT Framework in Azure Synapse Analytics

A metadata-driven ELT framework in Azure Synapse Analytics is a way of organizing and managing data pipelines that involves using metadata to define and control the flow of data from source to destination. This can be useful for organizations that have a large number of data pipelines and want to have more control over how data is processed and moved between systems.

In this session, we will discuss the benefits of using a metadata-driven approach to managing data pipelines in Azure Synapse Analytics. We will cover the key components of a metadata-driven ELT framework, including the metadata repository and the processes for managing and maintaining metadata. We will also provide practical examples and best practices for implementing a metadata-driven ELT framework in your organization.

The session is intended for data engineers and other technical professionals who are interested in using Azure Synapse Analytics to manage and optimize data pipelines.

Make your Azure Synapse Analytics a stronghold and win the game!

Want to learn how to build a secure by design Azure Campaign? Join us for an action-packed game!

The goal of today’s game is to provide guidance on building a secure and cost-effective data adventure and on making the technologies work together seamlessly and securely.
The game is led by the **Dungeon Master**, so gather your troops to play the campaign.

Building a secure by design Azure Synapse Analytics adventure is not something what we play by default, the troops have to make their strategy well in advance.
In the morning we will start with the first adventure “Greyhawk”. This is the design adventure, the heroes will use the different security design principles from the Well Architecture Framework (WAF).

The next adventure “Eberron” is the deployment, the knights should work carefully together, some configuration matters and can only be set from the first moment and are irreversible, so making the right decision are very important here. During the first part of this adventure, we will learn how to configure, build and to secure an Azure Synapse Analytics campaign.
Data exfiltration Protection, (Managed) Private Endpoints and securing connections are settings from this adventure.

In the afternoon we will finalize the “Eberron” adventure, with a strong focus on how to manage access control before we start the last adventure “Forgotten Realms”.
The campaign is now built and the Synapse Workspace is ready for use. The troops will look, how they can build and transform secure Pipelines in Azure Synapse Analytics in a safe way with the help of Azure Key Vault and by applying policies. Policies ensure that we can enforce certain configuration settings.
At the end of the day, the troops exactly know how to build their Azure Synapse Analytics campaign, completely accurately and what building secure by design adventures will do with their costs.
With the final quiz we will see which heroes have won the GAME.

This game is suitable for mix of characters, a hacker " Rain Rage" as a rogue, a data scientist "Donilor the Great" as a Wizard, a data engineer "Tony" as an artificer, a cloud engineer "Danielle de Brave" as a paladin, each with their own strong and security weaknesses.

Data Governance with Microsoft Purview - Ask the Experts

In this open session, it's your chance to ask our panel of 3 MVPs about data governance for your business using Azure Purview

Managing access to data sources in your data estate with Microsoft Purview

One of the new applications/apps within Microsoft Purview Governance Portal are Access Policies.
Access policies in Microsoft Purview allow you to manage access to different data systems across your entire data estate.
The big advantage of data policies is that you do not have to apply RBAC roles and you have an overview of all applied policies.

Are you a data consumer? Then the self-service access policy is an easy way to request access to data while browsing or searching for data.

Are you a data producer? Then access policies will help you to easily create and publish access to data sources.

DevOps policies are a simple, central, cloud-based experience that allows you to provision access at scale to DBAs and other DevOps users

In this session, I will explain and show you how to create and publish data policies, devops policies and how to set up a self-service access workflow.

Create an Azure Synapse Lake Database without writing code

So, I don't have to write any code to build up my facts and dimensions, yes you have read that correctly.

Within Azure Synapse Analytics, a new functionality/tool is available, the map data tool. The map data tool allows you to easily map your Data from a source into the target tables in the Synapse Lake Database.

Map Data is a guided experience where you can generate a mapping data flow without having to start from a blank canvas. Once you have created the mappings then you can easily generate a scalable mapping data flow in a Synapse Pipeline.

After you have published the Synapse Pipelines, you can run these Pipelines and then visualize your generated data model in Power BI? Sounds great or not?

I will show you how the map data tool works and how to visualize the data in Power BI afterwards in a step-by-step demo-based session. After this session you will have the knowledge to build and visualize your first Synapse Lake Database.

Lifecycle Management for Azure Synapse Analytics

Building a secure data platform by design is very important these days. How do we ensure that we keep our InfoSec happy and that our policies do not fail?
Connection string, username and passwords needs to be stored as secrets in de Azure Key Vault.

• How can we apply the secrets in Azure Synapse
• How do we deploy Synapse Pipelines or code in Azure DevOps to Test, Acceptance and Production environments?
• Can this be setup dynamically?

During this session I will walk you through some design decisions and give answer on above questions.

You will learn how to build and validate your Synapse Workspace in Azure DevOps, how to secure your connection strings and finally deploy your code and pipelines (CI/CD).
A basic knowledge of Azure Synapse and Azure DevOps can be useful to understand this session well
By the end of the session, you're ready to implement the deployment in your projects and to make your InfoSec happy.

Solve your Data Governance challenges with Microsoft Purview

What data do I have? Where did the data come from? Can I trust it? How do I manage access and control?
These are questions that a Chief Data Officer wants to have answers on when analyzing an organization's Data Estate.

Data consumer, data producers and the security administrator all have their own challenges. Microsoft Purview is designed to address these challenges.

Microsoft Purview will help to understand assets across the entire data estate and provide easy access to all data, security and risk solutions.

In this session, we'll take a closer look at Unified Data Governance, one of Microsoft Purview's solutions and see if we have answers on the followings questions:

· What challenges do organizations and user groups face with Data Governance?
· How can Microsoft Purview contribute to this?
· How can we easily create a holistic, up-to-date map of our data landscape?
· How can we find valuable and reliable data?
· What are the costs for Microsoft Purview?
· What are the latest/new features available in Microsoft Purview

So if you're a CDO, a data consumer, a data producer, or a security administrator, these sessions are definitely worth following.

Microsoft Purview what does this mean to me as an organization?

Microsoft Purview brings together data governance from Microsoft Data and AI, along with compliance and risk management from Microsoft Security and is now complemented with many other solutions
But what's in for me as an organization?
• Which solutions does Microsoft Purview actually include?
• Which solutions can be easily deployed in my organization?
• Which portals can I use now and for which solution?
An agreeable series of questions that we will answer during this session. At the end of the session, you will have an answer on what Microsoft Purview could mean in your organization.

How to use and create Data Lineage in Microsoft Purview?

The use of data Lineage is a hot topic for many organizations.
Many organizations struggle with answers to the following questions:
• I want to adjust a measure, but where do I have to adjust it and where does the data come from?
• What will be the effect on my data if I rename this column in the source?
• Can I visually overview my Data Estate including how the data has been transformed?

As you can see, data lineage is used for different kinds of backward-looking scenarios, such as troubleshooting, root cause discovery in data pipelines, and debugging. Lineage is also used for data quality analysis, compliance and 'what if' scenarios, often referred to as impact analysis.

How can Microsoft Purview help us to create these visual overviews to better understand our Data Estate.
During this session I will take you through guidelines how you can enable Data Lineage with Microsoft Purview, Azure Synapse Analytics and how to use Custom Lineage components for unsupported data sources with Apache Atlas.

Lake Database with Database Template and Mapping Data with Azure Synapse Analytics

Database templates in Azure Synapse Analytics are blueprints which can be used by organizations to plan, architect and design solutions.

How can we use these Database Templates in a day-to-day business, in order to speed up to automate this process? Map data tool can help us with that. The map data tool can generate a mapping data flow without having to start from a blank canvas. In this presentation, you will see how this all works in a step-by-step demo-based session.

After this session you will have the knowledge to build your first Lake Database.

Get control of your Azure Synapse environment, define your access control the right way today!

Azure Synapse Analytics is Microsoft's analytical engine that brings together data integration, enterprise data warehousing, and big data analytics.
As we now take a more holistic approach, more different types of user groups will use the platform. The more important the setup of an authorization matrix in advance will be. The following topics will be covered during this session:

• What Azure AD roles do we need to deploy an Azure Synapse Workspace?
• How can we simplify access control by using security groups that are aligned with people's job roles.
• How do we handle different user personas in Azure Synapse Analytics? For example, what is a Data Scientist or Data Engineer allowed to do and what not?
What access control settings do we need to have to:
• Store code in Azure Devops
• Debug a pipeline
• Run a Notebook
During this session I would like to take you through some practical examples on how you can set up these roles for your Azure Synapse in order to get in to control of your environment.

Automate the deployment and governance of your Azure Synapse Solutions

The first treatment for severe cases of "Click-Ops".

In this full day session we will go hands-on with a lab to start drafting a repeatable data infrastructure without using the azure portal. We will dive into the different parts that go into a well managed/operated and secured Azure Synapse environment.

Learn how you can make incremental changes to your infrastructure with Terraform without breaking the flow. Get a grasp on possible increase in costs and monitor overall performance without breaking a sweat.

The final result is a Data Platform with Azure DataLake, Azure Keyvault, Azure Synapse Analytics where we use the Managed Vnets, Logging, monitoring and of course we will ensure that the public endpoints are not accessible from outside. We will certainly not forget you to teach how to setup alerts, security and policies.

At the end of the day, you know exactly what it is like to be a “Data-Ops” and what automating and securing a data platform can do for you.

Scale your SQL Pool dynamically in Azure Synapse

In this Lightning talk I explain how can you scale up or down your SQL Pool in Azure Synapse Analytics using an Synapse Pipeline. An easy way so save some cost in your Analytics Environment

Create a Secure Azure Synapse Analytics environment, step by step.

In this session, we will explore the capabilities and features of Azure Synapse Analytics, a fully managed cloud-based data integration, analytics, and visualization platform.

We will start by creating a new Azure Synapse Analytics environment from scratch, including setting up the environment, discussing the different settings and options for the deployment.
Creating secure connections to data sources and building data pipelines.

By the end of this demo rich session, you will have a solid understanding of how to create and use a secure Azure Synapse Analytics environment to analyze and visualize data.

Techorama 2023 Belgium Upcoming

May 2023 Antwerpen, Belgium

Data Saturday Stockholm 2023 Upcoming

May 2023 Stockholm, Sweden

SQLDay 2023 Upcoming

May 2023 Wrocław, Poland

Iberian Technology Summit Upcoming

April 2023 Olhão, Portugal

SQLBits 2023 - General Sessions Upcoming

March 2023 Newport, United Kingdom

Power BI Gebruikersdag 2023 Upcoming

March 2023 Utrecht, Netherlands

Experts Live Netherlands 2022

September 2022 's-Hertogenbosch, Netherlands

DATA:Scotland 2022

September 2022 Glasgow, United Kingdom

Scottish Summit 2022

June 2022 Glasgow, United Kingdom

DataGrillen 2022

June 2022 Lingen, Germany

Data Saturday Stockholm 2022

May 2022 Stockholm, Sweden

SQLBits 2022

March 2022 London, United Kingdom

Data.Toboggan 2022

January 2022

DataMinutes #2

January 2022

#DataWeekender v4.2

November 2021

dataMinds Connect 2019

October 2019 Mechelen, Belgium

Data Saturday Holland

October 2019 Utrecht, Netherlands

Techorama Netherlands 2019

October 2019 Ede, Netherlands

Intelligent Cloud Conference 2018

May 2018 Copenhagen, Denmark

Erwin de Kreuk

Lead Data and AI | Data Platform MVP | Public Speaker | InSpark | Innovate to Accelerate

Rotterdam, Netherlands