The Ins & Outs of Data Lakehouse Versioning at the File, Table, and Catalog Level

Data lakehouse versioning is a critical technique for ensuring the accuracy and reliability of data in a data lakehouse. It allows you to track changes to data over time, which can be helpful for troubleshooting problems, auditing data, and reproducing experiments.

This presentation will explore the ins and outs of data lakehouse versioning. We will discuss the different levels of versioning, including catalog, file, and table-level versioning. We will also discuss the benefits of data lakehouse versioning and the pros and cons of each type of versioning.

By the end of this presentation, you will have a better understanding of data lakehouse versioning and how it can be used to improve the accuracy and reliability of your data.

Key takeaways:

- Data lakehouse versioning is a critical technique for ensuring the accuracy and reliability of data in a data lakehouse.

- There are three levels of data lakehouse versioning: catalog, file, and table level versioning.

- Each type of versioning has its own benefits and drawbacks.

- Data lakehouse versioning can be used to troubleshoot problems, audit data, and reproduce experiments.

Alex Merced

Developer Advocate for Dremio

Orlando, Florida, United States

Actions

View Speaker Profile

Please note that Sessionize is not responsible for the accuracy or validity of the data provided by speakers. If you suspect this profile to be fake or spam, please let us know.

Session

The Ins & Outs of Data Lakehouse Versioning at the File, Table, and Catalog Level

Alex Merced

Links

Actions