Session

CI/CD on the Data Lakehouse with Project Nessie

Continuous Integration and Continuous Delivery (CI/CD) is a software development practice that aims to improve the quality and speed of software delivery. In a data lakehouse environment, CI/CD can be used to automate ingesting, transforming, and loading data.

Project Nessie is an open-source project that provides a Git-like approach to version control for data lakehouse tables. Project Nessie can be used to implement CI/CD for data lakehouse environments by providing a way to track changes to data over time and to automate the process of deploying changes to production.

In this presentation, we will discuss the benefits of implementing CI/CD in a data lakehouse environment and how Project Nessie can achieve this. We will also discuss some of the challenges of implementing CI/CD in a data lakehouse environment and how to overcome them.

Key takeaways:

- CI/CD can be used to improve the quality and speed of software delivery in a data lakehouse environment.

- Project Nessie is an open-source project that can be used to implement CI/CD for data lakehouse tables.

- There are a number of challenges to implementing CI/CD in a data lakehouse environment, but these challenges can be overcome.

Alex Merced

Developer Advocate for Dremio

Orlando, Florida, United States

Actions

Please note that Sessionize is not responsible for the accuracy or validity of the data provided by speakers. If you suspect this profile to be fake or spam, please let us know.

Jump to top