Session

Apache Arrow and Go: A match made in Data

With Apache Arrow fast becoming a standard for working with data, most people are primarily familiar with the Python, C++ and Java libraries. This talk instead focuses on the Golang implementations of Apache Arrow and Parquet. The concurrency primitives in Go make it ideal for constructing efficient pipelines for parallel processing of large amounts of data.

This talk will cover getting started using the Go Arrow and Parquet libraries and building a simple data pipeline. It will touch on reading/writing CSV and Parquet data using the Go Arrow modules along with why you'd want to use Go in the first place as opposed to other languages/implementations.

Matt Topol

Author of "In-Memory Analytics with Apache Arrow" | Staff Software Engineer at Voltron Data

Norwalk, Connecticut, United States

Please note that Sessionize is not responsible for the accuracy or validity of the data provided by speakers. If you suspect this profile to be fake or spam, please let us know.

Jump to top