Barney Lawrence

Consultant focusing on data analytics and engineering on the Microsoft platform

Chesterfield, United Kingdom

Actions

Barney Lawrence has over a decade's experience helping people make the most of their data on the Microsoft data platform.
Having moved from the guy who knows Excel, to Data Analyst, to Accidental BI Developer and finally to Actual BI Developer, he now works as a consultant.
He currently works as a Senior Consultant for Simpson Associates an award-winning Microsoft partner where he specializes in local government and health data.

Area of Expertise

Government, Social Sector & Education
Health & Medical
Information & Communications Technology
Travel & Tourism

Topics

Databases
Business Intelligence
data engineering
Analytics
SQL

(Almost) Everything You Wanted To Know About Purview Data Governance

Microsoft Purview Data Governance delivers a broad set of tools to allow an organisation to ensure its data assets are discoverable, well understood, and trusted. This session will take anyone looking to understand or to get started using Purview Data Governance on a tour of Purview's Data Map and Unified Catalog. We'll visit all the key features while providing lessons learned from real world projects to help you to get the best from them.

We will also look at how to successfully deliver Purview within your organisation, how to sequence your roll out, who to involve and when, how to effectively deliver a pilot project and how much it will cost you.

Finally, we'll take a peek at the Purview road map focusing on future features you need to be aware of and prepare for.

By the end of the session, you will be ready to take what you have learned and begin preparing for a successful rollout of Purview Data Governance within your organisation.

Transfer Your T-SQL Skills to Spark in Fabric and Beyond

Are you an old school query writer looking to break into the brave new world of Lakehouses and Notebooks in Microsoft Fabric? Perhaps you're eying up Databricks or hoping to get more out of Synapse Analytics? Getting started with a whole new query engine and language might feel intimidating, but this session is here to help.
We'll introduce you to Spark with a focus on starting where you are; someone who knows how to write a decent SQL query looking to pick up a new skill and move to a new platform.
We will cover:
What are the similarities and differences are between spark and a classic SQL Server engine.
How to decide between SparkSQL and PySpark or when to use a combination of both.
How you can leverage the T-SQL you already know and even re-use existing queries to hit the ground running.
Neat new tricks that take you beyond SQL allowing you to do more or equally important, type less.

All this plus demos and example notebooks to take home delivered by two old school data people who learned it the hard way, so you don't have to.

Data Linkage Options: MDM vs Splink (PySpark)

The problem of linking multiple disconnected datasets is a challenging one. It arises when there is no common ID field available to join the different datasets. Instead, other less precise methods must be used such as fuzzy matching across multiple fields.

In this session we will provide an overview comparing options for linking data, including out-of-the-box options (e.g. MDM), code-heavy options (e.g. Spark) and other solutions in between. We will then introduce some of the theory used in probabilistic data linkage models and their differences from deterministic models. For our examples we will use Splink, an open-source Python package developed by the Ministry of Justice in the UK to implement fast probabilistic record linkage and deduplication at scale.

You might want to check out our in-depth session on splink, which you can find on youtube here:
https://www.youtube.com/watch?v=1ijNR3V4v3w

Splinking in a Winter Wonderland

Everyone knows, in his home full of ice. Santa lists children both naughty and nice.
But the lists aren't exclusive, most children are both. And the scripts made to pair them were as slow as a sloth.
So Santa decided a plot he would hatch. He'd ask for some new code to make records match.
He instructed his elves "Make it fast, make it cheap". "Fix it by Christmas" he said with a leap.
So the wisest of elves, he had a big think, he decided on Python and a package named splink!
It pairs up the records, with methods statistic, to match them all up through routes probabilistic!
The process is faster than the one of Saint Nicks'. We can run it on Synapse or perhaps Databricks.
And so in this session we'll tell you their tale. How the elves learnt to link records, not fail.
They know what is what, they can tell who is who, and after our story perhaps you can too!

What's What and Who's Who? Cross System Record Linkage with Fabric

Finding records representing the same real-world entity either in multiple sources or duplicated within a single data set is a common pain point when seeking to build a single definitive view of the truth in a data warehouse.
Variances in format, changes over time, and data entry errors mean connecting our Tom, Dick, and Harries to our Thom, Dirk, and Harriets can a big challenge.

This session introduces one approach to solving this problem using Fabric notebooks to run Splink, an open-source Python library developed by the Ministry of Justice in the UK to implement probabilistic record linkage.
You will be will introduced to the basics of deterministic, fuzzy, and probabilistic methods of matching records, and then we'll dive in with an end to end example using notebooks to create, apply and, analyse the results of a matching model.

By the end of the session you'll be ready to take what we've learned back home with you and start applying it to your own data.

How Do I Transform Thee? Let Me Count the Ways (in Fabric).

There are more tools and approaches available to transform your data in Microsoft Fabric than you can shake a spear at. From low and no code through to multiple full code-based options, sometimes themselves with multiple syntaxes. Choosing the wrong one could lead to a comedy of errors, so how do you make sure you end up with your data as you like it?
In this session we will take the common task of building a table of person data for a data model and implement it multiple times with a different approach each time including using SQL, Data Flows, Data Wrangler, PySpark and, simply asking Copilot to do it for us.
By the end of the session, you will have an overview of the data transformation tools Fabric has in its belt, which ones suit which tasks and audiences best along with the strengths and weaknesses of each. This will help you to choose the right tool for the right job without causing a tempest.

The Metadata Driven Data Warehouse

Tired of writing the same old Insert/Update/Delete statements over and over? Is your pipelines folder overflowing? Need to get your new data warehouse up and running as fast as possible? Simply need to figure out what the contents of a table looks like?

Metadata (data about data) comes to the rescue and in this session we’ll show you how.

We’ll walk through the process of building a data warehouse using meta data every step of the way. From analysing what’s in source systems through driving your pipelines and merging new data into the data warehouse all the way up to creating measures in Power BI.

By the end of the session you’ll be ready to start finding new ways to work with your data through the magic of metadata.

An Introduction to Window Functions

For anyone with a basic understanding of SQL or beyond, window functions are a key feature to open your querying options and take your code to the next level.
Most functions are like a holiday in Vegas, what goes on in a row stays in a row, but window functions throw the curtains wide open, making the rows in your queries aware of what happens around them.
In this session we'll start with the basics of syntax and simple examples such as sequencing rows and aggregating over different levels, then move on to more advanced uses including running totals, common query patterns and tricks Finally we'll consider performance impacts for better and worse.
We'll use SQL Server for our examples, but content will be adaptable to other dialects of SQL.

A DISTINCT Disadvantage

DISTINCT - The keyword that sweeps away your duplicate data worries or a performance time bomb waiting to happen?
This short session will look at what problems DISTINCT may cause, when to use it, when not to use it and the alternative approaches you can take that will perform better.

An Introduction to Querying XML With T-SQL

XML is everywhere, it's probably already in your database and sooner or later you're going to need to get at the data inside it.
This session will, in as short a time as possible, show you the key methods needed to start taking data out of a piece of XML and into your queries.

DAX Explained Through Dance, Memes and Dad Jokes

Moving beyond the basics of DAX can as tough as learning to tap-dance without falling in the sink. Several key concepts such as row and filter contexts that you will need to understand must be grasped before more complex expressions can be confidently written.
This session will help anyone who is looking to move their formulae up to the next level (or looking to support others in doing the same) by providing novel approaches to understanding these important concepts.
Together we will explore filter contexts with punchlines, learn how row contexts are like a dance, and discover what CALCULATE has in common with Oprah Winfrey.

T-SQL - Beyond the Basics

This session is aimed at anyone with basic level T-SQL experience or above looking to improve their code; especially when writing long, complex queries.
We will cover topics including getting the most out of Management Studio, Common Table Expressions, the APPLY operator, Window Functions, methods of pivoting and unpivoting data and more.
Through this you will expand your tool-belt of techniques to help you produce T-SQL code that's not just faster to run but also easier to read and faster to write.

European Microsoft Fabric Community Conference 2025 Sessionize Event

September 2025 Vienna, Austria

SQLBits 2025 - General Sessions Sessionize Event

June 2025 London, United Kingdom

Power BI & Fabric Summit 2025 Sessionize Event

February 2025

Data Toboggan - Winter Edition 2025 Sessionize Event

January 2025

New Stars of Data #8 Sessionize Event

October 2024

FabFest Sessionize Event

October 2024 Birmingham, United Kingdom

New Stars of Data #7 Sessionize Event

April 2024

SQLBits 2024 - General Sessions Sessionize Event

March 2024 Farnborough, United Kingdom

Festive Tech Calendar 2023 Sessionize Event

December 2023

New Stars of Data #6 Sessionize Event

October 2023

Data Relay 2023 Sessionize Event

October 2023

SQLBits 2023 - General Sessions Sessionize Event

March 2023 Newport, United Kingdom

Data Relay 2022 Sessionize Event

October 2022

PASS Data Community Summit 2021 Sessionize Event

November 2021

DataMinutes #1 Sessionize Event

June 2021

Data Céilí 2021 Sessionize Event

May 2021

Data Relay 2019 Sessionize Event

October 2019

Barney Lawrence

Consultant focusing on data analytics and engineering on the Microsoft platform

Chesterfield, United Kingdom

Actions

Please note that Sessionize is not responsible for the accuracy or validity of the data provided by speakers. If you suspect this profile to be fake or spam, please let us know.

Speaker

Barney Lawrence

Actions

Links

Area of Expertise

Topics

Sessions

(Almost) Everything You Wanted To Know About Purview Data Governance

Transfer Your T-SQL Skills to Spark in Fabric and Beyond

Data Linkage Options: MDM vs Splink (PySpark)

Splinking in a Winter Wonderland

What's What and Who's Who? Cross System Record Linkage with Fabric

How Do I Transform Thee? Let Me Count the Ways (in Fabric).

The Metadata Driven Data Warehouse

An Introduction to Window Functions

A DISTINCT Disadvantage

An Introduction to Querying XML With T-SQL

DAX Explained Through Dance, Memes and Dad Jokes

T-SQL - Beyond the Basics

Events

European Microsoft Fabric Community Conference 2025 Sessionize Event

SQLBits 2025 - General Sessions Sessionize Event

Power BI & Fabric Summit 2025 Sessionize Event

Data Toboggan - Winter Edition 2025 Sessionize Event

New Stars of Data #8 Sessionize Event

FabFest Sessionize Event

New Stars of Data #7 Sessionize Event

SQLBits 2024 - General Sessions Sessionize Event

Festive Tech Calendar 2023 Sessionize Event

New Stars of Data #6 Sessionize Event

Data Relay 2023 Sessionize Event

SQLBits 2023 - General Sessions Sessionize Event

Data Relay 2022 Sessionize Event

PASS Data Community Summit 2021 Sessionize Event

DataMinutes #1 Sessionize Event

Data Céilí 2021 Sessionize Event

Data Relay 2019 Sessionize Event

Barney Lawrence

Links

Actions