Paul Andrew

Co-Founder & CTO of Cloud Formations | Microsoft MVP

Derby, United Kingdom

Actions

Paul (AKA @mrpaulandrew) is the Co-Founder & CTO of Cloud Formations, a specialist data consultancy based in the UK. With nearly 20 years’ experience designing and delivering Microsoft data architectures, Paul leads a passionate team of engineers, supporting businesses small and large with scalable cloud platforms. Business value delivered through data insights. Over the years, Paul has covered the breadth and depth of design patterns and industry leading concepts, including Lambda, Kappa, Delta Lake, Data Mesh and Data Fabric.

Paul is also a Microsoft Data Platform MVP, director for the Data Relay community conference, East Midlands user group leader, book author and mentor. In addition to the day job(s), Paul is a father of three, husband, foodie, runner, blood donor, geek, Lego, and Star Wars fan! Lastly, Paul confesses to enjoying a Ramstein playlist when given half a chance to do some coding for a customer project.

Badges

Area of Expertise

Information & Communications Technology

Topics

Azure Data Platform
data mesh
Big Data
Analytics and Big Data
Azure Data Factory
Azure SQL Database
data engineering
Azure Data & AI
Azure Data Lake
Data Platform
Data Warehousing
Microsoft Data Platform
Data Analytics
All things data
Data Visualization
Databases

Simplified File Ingestion With Microsoft Fabric Open‑Mirroring - Design & Build

In this full day hands‑on workshop we’ll explore how you can simplify file ingestion in Microsoft Fabric by applying the concepts of a landing‑zone architecture with the implementation of Open‑Mirroring items. Designing and building end‑to‑end patterns to simply structure and unstructured uploads. We all encounter those businesses still relying on Excel as a data source! This is how we tackle it with Open-Mirroring as a easy to implement ingestion pattern for your analytics solutions.

We’ll start from the high‑level design and implement a landing zone workspace with constrained access and data retention processes. Then integrate file source handlers including a local file watcher service to push datasets into Fabric long side standard data gateway ingestion processes. Each driving files into Fabric in a controlled, repeatable way.

You’ll configure Open‑Mirroring (an open mirrored database) endpoint as the ingestion destination, then use Lakehouse storage to stage data, validating it with data contracts before promoting it into a data curation pattern with bronze, silver and gold layers.

Along the way, we’ll compare theory vs practice, exploring schema drift, data contract enforcement, push vs pull ingestion patterns and scaling ingestion throughput for high volume, high velocity file changes.

We’ll finish by packaging changes through deployment pipelines and reviewing operational guardrails with logging and alerting that help teams move fast without sacrificing governance.

If you want a pragmatic blueprint to move Excel/CSV/Access and similar sources into Fabric with less friction this is the session for you. Leaving with working Fabric assets, design patterns and the knowledge to reuse this simplified approach for those businesses still running on Excel!

Fabric SQL Databases - Use Cases In Your Analytics Platform

Join me for a session focused exploring practical use cases for integrating Microsoft Fabric SQL Databases into your analytics platform.

We will showcase how Fabric SQL Databases can serve as a supplementary component in the delivery of wider data analytics outputs, part of the unified product capabilities.

Considering:
• Privacy Storage
• Metadata Management
• Logging and Alerting
• Reference Data Management (MDM)
• Backends for OLTP Systems (PowerApps)

Whether you're building a new analytics platform or migrating an existing one into Microsoft Fabric, this session will provide actionable insights and architectural patterns to help you get the most out of Microsoft Fabric SQL Database to supplement your solution architecture.

Evolving Azure Real-Time Data Feeds Into Fabric Event-Streams

Many businesses have built real-time data feeds on Azure to capture events at scale and drive timely decisions, but those solutions often relied on a patchwork of resources to ingest and process data making the overall component architecture harder to govern. As the velocity and volume of event data continues to rise, teams want simpler, more unified patterns for near real-time analytics without sacrificing throughput, reliability, or cost control.

In this session, we’ll show how to evolve established Azure real-time feed patterns into Microsoft Fabric Real-Time. We’ll walk through reference architectures for high-throughput event workloads, highlight what changes (and what doesn’t) when you move to Fabric-native streaming, and demonstrate how to blend streaming and batch data for downstream reporting and operational decision-making using both SQL and KQL.

We’ll focus on the Fabric items you’ll use to deliver near real-time outcomes, covering Eventstreams to ingest, filter, enrich, and route data. Before handing off to Eventhouse to store and query event data at speed with KQL. Then finally exploring Real-Time dashboards to turn streaming information into shared, actionable views.

You’ll leave with a practical mapping from Azure real-time feeds to a Fabric unified designs so you can modernise confidently, reduce architectural sprawl, and standardise how real-time data is delivered across your platform.

Fabric Data Activator: Real-Time Data Feeds, Automated Alerts & Stock Intelligence

In today’s fast-paced retail environment, the ability to respond instantly to sales trends and stock fluctuations is critical for maintaining customer satisfaction and maximising revenue. This session explores how Microsoft Fabric’s Real-Time Intelligence (RTI) data capabilities empower retailers, amongst others, to monitor product sales and inventory levels, triggering automated alerts and actions through Data Activator.

We’ll dive into a practical retail scenario: imagine a chain of stores where every transaction and stock movement is streamed into Fabric’s Real-time Hub. Using Event Streams and Event House databases, data is ingested and processed at scale, blending both live and historical feeds with SQL and KQL. Data Activator then monitors these feeds for critical thresholds such as sudden spikes in sales or low stock levels and automatically generates alerts to store managers to trigger replenishment workflows.

Designing & Delivering Data Products: From Mesh Principles to Data Fabric Automation

Modern data teams are asked to ship reliable, reusable data products not just pipelines across both operational and analytical domains. In this session we will explore how to define, build, and govern data products using cloud‑native patterns. Blending data mesh principles (domain ownership, product thinking, federated computational governance) with data fabric concepts (active metadata, automation, and intelligent integration).

We’ll walk through a pragmatic blueprint for productizing data. How to establish clear product contracts for schemas and lineage to make operational and analytical flows first‑class citizens when building event streaming, CDC, Lakehouse and warehouse solutions. Embedding data governance by design so teams move fast without breaking compliance, enriched with business metadata.

Expect actionable patterns and architecture examples. Whether you’re an architect defining domains and standards or an engineer delivering pipelines and notebooks, you’ll leave with a reusable checklist and reference architecture to accelerate your data product portfolio at scale, with Microsoft data platform friendly examples throughout.

Data Modelling: The Lost Art of Turning Inputs into Insights

In today’s data-driven world, business pour vast resources into data engineering pipelines, platforms, and processing often overlooking the true value driver in the solution, data modelling. Without well-defined models, data remains a raw asset rather than a business intelligence enabler. In this session, we will remind ourselves of the foundational principles of dimensional modelling and agile data warehouse development, drawing on the timeless work of Ralph Kimball, Lawrence Corr, and Bill Inmon.

From a data architect’s perspective, we’ll explore why modelling is not just a technical step but a business-critical discipline that transforms inputs into actionable outputs. You’ll learn how robust data entities bridge the gap between engineering and analytics, enabling clarity, consistency, and scalability in delivering valuable insights.

We’ll also discuss practical approaches for embedding modelling into modern data strategies whether you’re building a data warehouse, a Lakehouse, or a semantic layer.

Build a Lakehouse in a Day with Metadata & Open-Source Tools

Unlock the power and speed of a metadata-driven Lakehouse architecture.

In the fast-paced, data-driven world, the ability to swiftly and efficiently deliver a robust data platform is key to maintaining a competitive edge. Join us for an immersive, full-day hands-on workshop, where we will guide you through the process of building a metadata-driven Lakehouse using the open-source product framework, known as CF.Cumulus. Leveraging and abstracting Microsoft cloud native technologies to ease delivery challenges.

During this workshop, participants will get an in-depth understanding of how CF.Cumulus can integrate Azure Data Factory, Azure Databricks, Azure Synapse Analytics, and Microsoft Fabric and other resources to streamline data insight deliveries.

Our expert instructors will provide practical insights on overcoming common data challenges, including fragmented data ingestion, change data capture, and orchestration scalability, using our proven best practices.

Attendees will learn how to utilise metadata, open-standards, and seamless cloud integration to accelerate time-to-insight with minimal technical debt, ensuring cost control and operational resilience. This workshop is ideal for both data engineers and data leaders who are looking to enhance their cloud data platform delivery and unlock the potential to build a Lakehouse in a day using a metadata driven approach.

Data & Community: An Amazing Network Of Peers Supporting Innovation & Growth

When I look back on my career in data, one thing stands out above all else: the incredible network of peers who have inspired, challenged, and collaborated with me along the way. In this autobiographical session, I’ll share my story as a Microsoft Data Platform MVP and CTO of Cloud Formations. Talking about how curiosity led me into the world of data, and how the technology community became the catalyst for every major milestone in my career. From late-night problem-solving with fellow experts to building solutions that push boundaries, sharing knowledge, building open-source code projects and more I’ll explore how community relationships have driven innovation and my personal growth. If you’ve ever wondered how community can transform your career, this is a conversation you won’t want to miss. Sharing my passion and thanks for the community we have created together.

Build a Lakehouse in an Hour with CF.Cumulus

In today’s data-driven world, fast and efficient data platform delivery is crucial for staying ahead of the competition. Join us for an exclusive session hosted by Cloud Formations, where we will demonstrate how our CF.Cumulus product can help you build a metadata-driven Lakehouse using Microsoft cloud native technologies. With CF.Cumulus, leverage Azure Data Factory, Azure Databricks, Azure Synapse Analytics, or Microsoft Fabric to streamline and optimize your data infrastructure.

Discover how Cloud Formations can help you simplify and overcome common obstacles such as fragmented data ingestion, change data capture, and orchestration scalability using our proven best practices. Learn how to leverage automation, open-standards, and seamless cloud integration to accelerate time-to-insight with minimal technical debt. This session is perfect for techies and data leaders seeking to streamline their cloud data platform delivery while maintaining cost control and operational resilience. In summary, unlock the potential to build a Lakehouse in a day with Cloud Formations’ CF.Cumulus product accelerator.

How Can Microsoft Fabric Have An Impact On Your Business

All this talk about Data-Ware-Lake-Delta-Beach-House-Lakes (or some combination of that) and Data, Yarn, Fabric integration, everything has got a bit… Meshy! Yes, my friends. The beat of the technology drum is certainly relentless. And with no limits cloud scale and huge innovations from the biggest brains. Two years, it seems, has become the benchmark for tools to live and die by. Reach three years and you almost have a mature product. That said, Microsoft Fabric, the latest offering from global software giant is no exception. But what does this mean for the real world. For the data analysts, engineers and scientists that need to continue answering everyday problems to inform business decisions. In this session we will firmly ignore the hype and focus on the reality. With the pragmatic view of an experienced architect. The problem of gaining insights from our data hasn’t changed. So, what does this mean if implemented using Microsoft Fabric. What, why and how is the tooling going to change our daily deliverables in the short term, medium and long term. Join me for these answers and more as we explore the impact of Microsoft Fabric-Server, erm, Power. Resource. Thing!

Building Near Real-time Data Solutions in Microsoft Azure & Fabric

The velocity of data is getting faster across many industries, fuelled by the business demand to gain insights and value from sources in near real-time. This necessity is then allowing decision makers to pivot and ultimately stay ahead of the competition. Furthermore, the growth of the internet of things and ‘smart’ devices now means the volume of that high velocity data has exploded. Meeting this demand requires new concepts and new designs for data/solution architects, with high throughput ingestion endpoints and query stream tools that can perform aggregations ‘on the fly’.

In this session, we will address the above head on. Discussing and designing architectures that can scale and burst for high throughput events. Querying using both SQL and KQL to blend stream and batch data feeds for downstream reporting.

As a platform, in Azure we’ll explore Event Hub and Stream Analytics to ingest and handle that initial data stream. Before applying the same patterns to other resources in Microsoft Fabric with and Event Handler and Real-time Dashboards through the Event House. Understanding the patterns to apply as an architect vs the tooling available for delivery.

Microsoft Fabric Platform Governance - Where To Start

In this session, we will explore the administration of Microsoft Fabric, with a focus on the organisation and management of data storage/compute through the configuration of capacities, domains, data products, environments and workspaces. We will discuss the application of data mesh and data fabric concepts in the context of Microsoft Fabric capabilities, including organising data products for effective delivery to business users. Also, considering how Microsoft Fabric compares to the practices and technical standards we’ve applied in Microsoft Azure over the years, making that shift from PaaS to SaaS, in theory and maybe in practical.

Additionally, we will explore how to use domains and separate workspaces to serve reports to business users, providing them with the information they need to make informed decisions. Aligning industry governance standards to Microsoft Fabric features and access controls. Join us to learn how to effectively administer Microsoft Fabric beyond the simple Workspaces inherited from Power BI

Fast-Track Your Lakehouse Build with a Metadata Framework

In today’s data-driven world, fast and efficient data platform delivery is crucial for staying ahead of the competition. Join me for a dynamic session that demonstrates how to build a metadata-driven Lakehouse with Microsoft cloud native technologies. Using your preferred compute and storage resources, Azure Data Factory, Azure Databricks, Azure Synapse Analytics or Microsoft Fabric.

Discover how to simplify and overcome common obstacles such as fragmented data ingestion, change data capture, and orchestration scalability using proven best practices. Learn how to leverage automation, open-standards, and seamless cloud integration to accelerate time-to-insight with minimal technical debt. This session is perfect for techies and data leaders alike, seeking to streamline their cloud data platform delivery while maintaining cost control and operational resilience. In summary, unlock the potential to build a Lakehouse in a day by using an open-source metadata driven product accelerator.

Harnessing the Power of Apache Spark & Delta Lake in the Microsoft Data Ecosystem

Apache Spark is a powerful distributed compute engine that has become an industry leading solution for data processing. In this session we’ll firstly introduce Apache Spark, exploring its core concepts and capabilities. Then we will discuss how Apache Spark can be implemented using various Microsoft products, including Azure Databricks, Azure Data Factory, Azure Synapse Analytics, and Microsoft Fabric, to build robust, scalable data processes to perform advanced analytics, and drive business insights.

We’ll then explore combining Apache Spark with the open standard Delta Lake to offer a comprehensive solution that addresses both compute and storage aspects, allowing us to create a complete cloud-native data platform. Delta Lake enhances Spark's capabilities by providing ACID transactions, scalable metadata handling for both streaming and batch workloads. This integration ensures data reliability and consistency while enabling efficient large-scale data operations. Together, Apache Spark and Delta Lake facilitate the construction of resilient data pipelines, allowing businesses to leverage their data assets fully and achieve seamless data integration, transformation, and analytics within the Microsoft data ecosystem.

An Introduction to SQL – ANSI, Transactional and Spark Flavoured

This session will provide a foundational introduction to SQL, tailored specifically for new data engineers working within the Microsoft ecosystem of data platform tools. We will begin with the very basics of the ANSI standard, ensuring that participants gain a solid understanding of SQL as a language. We will explore the essential concepts of SQL, including data retrieval, modification, and manipulation, all while becoming familiar with the standard syntax and commands that form the basis of SQL across various platforms.

As we progress, we will delve into the specifics of T-SQL and Spark SQL, exploring how these dialects have evolved within the Microsoft SQL family of products and Apache Spark environments. Attendees will gain insights into the unique features and capabilities of each variant, learning how to effectively leverage T-SQL for powerful data management and analysis within SQL Server, as well as how to harness the scalability and performance of Spark SQL for big data processing in Spark implementations. By the end of this session, participants will have the skills and knowledge to confidently structure and query data with the language.

Building a Microsoft Cloud Analytics Platform End-to-End

In today's rapidly evolving data landscape, staying ahead of the curve is essential for data professionals. This talk will explore the latest advancements in building an end-to-end Microsoft focused cloud analytics platform, leveraging all the technologies and industry best practices. Delving into the patterns and capabilities of both Microsoft Fabric and Microsoft Azure.

No longer can we rely on products matured over a decade to deliver all our solution requirements. Today, data platform architectures designed with best intentions and known design patterns can go out of date within months. That said, is there now a set of core components we can utilise in the Microsoft cloud to ingest, curation and deliver insights from our data? When does ETL become ELT? When is IaaS better than PaaS, and what SaaS products can help? Do we need to consider scaling up or scaling out? And should we start making cost the primary factor for choosing certain technologies?

In this session we'll explore the answers to all these questions and more from an architect’s viewpoint. Based on real world experience let’s think about just how far the breadth of our knowledge now needs to reach when starting from nothing and building a complete Microsoft cloud analytics platform.

Data Integration Pipelines – The Fundamentals

Join us for this fundamentals session where we will cover the essentials of data platform orchestration using Microsoft's suite of tools, including Azure Data Factory, Synapse Analytics and Microsoft Fabric. We will learn to construct control flow and data flow components, developing end-to-end processing pipelines. Starting with the basics, we will explore how these cloud-native resources have evolved and build pipelines that ingest data from various sources, transform this data, and make it available to consumers.

Throughout the session, we will delve into the integration pipeline tools within a highly scalable cloud-native architecture, addressing key aspects such as triggering, monitoring, dynamic pipeline content, and CI/CD practices. This session will provide a clear understanding of data ingestion, integration, and orchestration, equipping attendees with the knowledge to implement these practices in their roles as data professionals. By the end of the session, participants will have gained the foundational skills required to apply their new expertise in Azure Data Factory and Microsoft Fabric to real-world scenarios.

An Introduction to Delta Lake and The Lakehouse

Once upon a time, there was a data warehouse, and it lived happily as a set of tables within our relational database management system (RDMS) called Microsoft SQL Server. The data warehouse had three children known as extract, transform, and load. One day a blue/azure coloured cloud appeared overhead, and it started to rain. The data warehouse got wet and was never the same again! Or was it? Spoiler alert, the data warehouse is the same, still happy, and well, it just evolved and moved from its RDMS home to a new home in the cloud. The end!

In this session, we'll look at the evolution of the data warehouse and understand how we can now deliver the same data engineering concepts for our solutions on the Microsoft cloud platform using the open-source Delta.io standard, in Azure and Fabric. We'll introduce the standard (originally developed by Databricks) and then explore the implications it has for our next-generation cloud data warehouse.

The original data warehouse set of tables remain, but now they are delivered using the cloud-native Delta Lake technology with distributed storage/compute as standard. Delta.io gives us those much-needed ACID properties over our data lakes meaning our data warehouse understanding can move to the cloud and is made easier within Azure. The data warehouse just grew up and became a Delta Lake-House.

An Evolution of Cloud Data Architectures - Lambda, Kappa, Delta, Mesh & Fabric

How has advancements in highly scalable cloud native technology influenced the design principles we apply when building data platform solutions? Are we designing for just speed and batch layers or do we what more from our platforms, and who says these patterns must be delivered exclusively?

Let’s disrupt the theory and consider the practical application of all things Microsoft now has to offer, where concepts, patterns, and best practice meet/clash with technology. Can we now utilise cloud technology to build architectures that cater for lambda, kappa, and mesh concepts in a complete stack of services? And should we be considering a solution that offers all these principals in a nirvana of data insight high scalable, decoupled perfection? Lastly, how does Data Fabric as a concept fit with Microsoft Fabric as a product and should we decentralise everything as suggested by the data mesh!?

In this session we’ll explore the answer to all these questions and more in a thought provoking, argument generating look at the challenges every data platform engineers/architects face.

Data Saturday Gothenburg 2023 Sessionize Event

August 2023 Göteborg, Sweden

Data Platform Next Step 2023 Sessionize Event

June 2023 Billund, Denmark

Techorama 2023 Belgium Sessionize Event

May 2023 Antwerpen, Belgium

SQLDay 2023 Sessionize Event

May 2023 Wrocław, Poland

dataMinds Connect 2022 Sessionize Event

October 2022 Mechelen, Belgium

Data Relay 2022 Sessionize Event

October 2022

Future Data Driven Summit 2022 Sessionize Event

September 2022

DATA BASH '22 Sessionize Event

September 2022

DATA:Scotland 2022 Sessionize Event

September 2022 Glasgow, United Kingdom

SQLBits 2022 Sessionize Event

March 2022 London, United Kingdom

Virtual Scottish Summit 2021 Sessionize Event

February 2021

dataMinds Connect 2020 (Virtual Edition) Sessionize Event

October 2020 Mechelen, Belgium

Data Platform Discovery Day Europe Sessionize Event

April 2020

Data Relay 2019 Sessionize Event

October 2019

DATA:Scotland 2019 Sessionize Event

September 2019 Glasgow, United Kingdom

DataGrillen 2019 Sessionize Event

June 2019 Lingen, Germany

Global Azure Bootcamp 2019 Sessionize Event

April 2019 Birmingham, United Kingdom

Intelligent Cloud Conference 2019 Sessionize Event

April 2019 Copenhagen, Denmark

Global Azure Boot camp - Birmingham UK Sessionize Event

April 2018 Birmingham, United Kingdom

Paul Andrew

Co-Founder & CTO of Cloud Formations | Microsoft MVP

Derby, United Kingdom

Actions

Please note that Sessionize is not responsible for the accuracy or validity of the data provided by speakers. If you suspect this profile to be fake or spam, please let us know.

Paul Andrew

Actions

Links

Badges

Area of Expertise

Topics

Data Saturday Gothenburg 2023 Sessionize Event

Data Platform Next Step 2023 Sessionize Event

Techorama 2023 Belgium Sessionize Event

SQLDay 2023 Sessionize Event

dataMinds Connect 2022 Sessionize Event

Data Relay 2022 Sessionize Event

Future Data Driven Summit 2022 Sessionize Event

DATA BASH '22 Sessionize Event

DATA:Scotland 2022 Sessionize Event

SQLBits 2022 Sessionize Event

Virtual Scottish Summit 2021 Sessionize Event

dataMinds Connect 2020 (Virtual Edition) Sessionize Event

Data Platform Discovery Day Europe Sessionize Event

Data Relay 2019 Sessionize Event

DATA:Scotland 2019 Sessionize Event

DataGrillen 2019 Sessionize Event

Global Azure Bootcamp 2019 Sessionize Event

Intelligent Cloud Conference 2019 Sessionize Event

Global Azure Boot camp - Birmingham UK Sessionize Event

Paul Andrew

Links

Actions