Session

Mastering Pandas: An Intermediate Course in Data Analysis and Manipulation

This two-day course is designed to provide intermediate-level training in data science with a focus on the pandas library. Pandas is a powerful open-source data analysis and manipulation tool built on top of the Python programming language.

Throughout the course, attendees will learn advanced techniques for manipulating and visualizing data with pandas, including how to work with large and complex datasets, optimize performance, and handle text and time series data.

By the end of this course, attendees will have a strong foundation in data science with pandas and will be able to use this powerful tool to perform a wide range of real-world data analysis tasks.

Agenda:

Day 1:
- Introduction to Pandas
- Series and DataFrames
- Indexing and selecting data
- Reading from files and other input sources
- Handling missing values
- Basic operations and statistical methods
- Manipulating Data
- Merging, joining, and concatenating data
- Grouping and pivot tables
- Reshaping and pivoting data
- Working with time series data

Day 2:
- Visualizing Data with Pandas
- Line plots, scatter plots, and bar plots
- Histograms and density plots
- Box plots and violin plots
- Heatmaps and pair plots
- Advanced Pandas Techniques
- Handling large and complex datasets
- Performance and optimization tips
- Working with text data and categorical data
- Advanced indexing and reshaping techniques
- Advanced groupby and aggregation operations

This course is designed to be accessible to those with no previous knowledge of the Python programming language. However, having some familiarity with programming concepts will be helpful.

Attendees should bring a laptop with python installed (3.7 or above).

Sebastian Roll

Co-founder ByteBarista

Trondheim, Norway

Actions

Please note that Sessionize is not responsible for the accuracy or validity of the data provided by speakers. If you suspect this profile to be fake or spam, please let us know.

Jump to top