Google TPU-First, GPU-Second: Understanding Data Encoding,Arrow & Parquet’s Impact on Training

Most AI companies shout about GPUs. But here’s the truth: Google TPUs are the real cheat code. It’s not just about raw FLOPs or benchmark chasing, it’s about pipelines. When you design your machine learning stack TPU-first, GPU-second, you unlock massive gains in price and performance. This talk starts at the roots, by building an understanding of machine mentality. We’ll explore foundational concepts like Instructions Per Cycle (IPC) and vector processing, not just as theoretical metrics but as tools to reshape how you prepare, encode, and move data through your pipeline. 

Even if you're new to data encoding, Arrow, Parquet, JAX, or TPUs, you’ll walk away with clear mental models and practical insights on how these pieces connect to build performant ML systems.

Full Title - Google TPU-First, GPU-Second: Understanding Data Encoding—Arrow and Parquet’s Impact on Training Pipelines

Rif Kiamil

I teach and educate about Data, SQL & Google BigQuery

London, United Kingdom

Actions

View Speaker Profile

Please note that Sessionize is not responsible for the accuracy or validity of the data provided by speakers. If you suspect this profile to be fake or spam, please let us know.

Session

Google TPU-First, GPU-Second: Understanding Data Encoding,Arrow & Parquet’s Impact on Training

Rif Kiamil

Links

Actions