Speaker

Michael Victor

Michael Victor

Consultant - data engineering and data science at Cobalt Analytics

Pretoria, South Africa

Actions

I am a Consultant in Data Engineering and Data Science, passionate about solving complex data challenges and enabling businesses to leverage the power of analytics. With over a decade of professional experience, I specialise in designing scalable data pipelines, optimizing work flows, and building predictive models that drive strategic decision-making.

Area of Expertise

  • Business & Management
  • Information & Communications Technology

Topics

  • Data Science & AI
  • Data Science
  • Data Engineering

Clustering data in 20 min

Clustering is an often overlooked yet powerful tool for uncovering hidden patterns in data. This session highlights practical use-cases for clustering, from customer segmentation to anomaly detection, and provides a concise overview of key clustering techniques. Whether you're new to clustering or looking to enhance your data exploration tool kit, this session will equip you with the knowledge to apply clustering effectively in your projects.

Great expectations for your data quality

Poor data quality is a pervasive issue, with studies estimating its cost at up to 25% of an organization's operating profit. Despite these staggering numbers, businesses often accept data quality issues as an inevitable reality.

While initial testing efforts during deployment is common, data quality tends to degrade over time and therefore requires sustained attention. This session introduces a practical framework to combat this challenge, leveraging ‘Great Expectations’, an open-source Python library designed for data quality testing.

The talk begins with an accessible discussion on the importance of maintaining high data quality and the foundational principles of the proposed framework. In the second part, we delve into a technical walkthrough of implementing Great Expectations in real-world scenarios. This session is ideal for anyone who uses or works with data, and attendees with a basic understanding of Python will gain the most from the hands-on examples.

Discover how you can turn the tide on data quality and drive better outcomes for your organization

Quick Guide to Data Quality with Great Expectations

Discover the essentials of data testing and quality evaluation in this concise overview of Great Expectations. Learn how to implement basic data validation tests and establish data quality benchmarks quickly and effectively. This session is perfect for data professionals looking to enhance their workflows with reliable data quality practices

The Power of Naming: Setting up a Naming Convention for Success

Naming things is a challenge, but everything requires a name. Naming is one of the most basic actions you must take when creating something. A name is more than just what something is called, it’s a circular dependency, things should be named according to what they are, but what they are called affects what they become. Naming things is in essence abstracting or encapsulating everything that that name refers to into a single word or phrase, therefore, good names can bring clarity to complex things and incorrect names can confuse even the simplest.

This session discusses the purpose of naming and unpacks current naming conventions for tables, columns, variables and artifacts, adapting where applicable and disregarding established conventions. The focus will on be improving readability and clarity.

SQLBits 2025 - General Sessions Sessionize Event

June 2025 London, United Kingdom

Data and AI Community Day August 2024 Sessionize Event

August 2024 Johannesburg, South Africa

Michael Victor

Consultant - data engineering and data science at Cobalt Analytics

Pretoria, South Africa

Actions

Please note that Sessionize is not responsible for the accuracy or validity of the data provided by speakers. If you suspect this profile to be fake or spam, please let us know.

Jump to top