Session
Mastering Pandas: An Intermediate Course in Data Analysis and Manipulation
This two-day course is designed to provide intermediate-level training in data science with a focus on the pandas library. Pandas is a powerful open-source data analysis and manipulation tool built on top of the Python programming language.
Throughout the course, attendees will learn advanced techniques for manipulating and visualizing data with pandas, including how to work with large and complex datasets, optimize performance, and handle text and time series data.
By the end of this course, attendees will have a strong foundation in data science with pandas and will be able to use this powerful tool to perform a wide range of real-world data analysis tasks.
Agenda:
Day 1:
- Introduction to Pandas
- Series and DataFrames
- Indexing and selecting data
- Reading from files and other input sources
- Handling missing values
- Basic operations and statistical methods
- Manipulating Data
- Merging, joining, and concatenating data
- Grouping and pivot tables
- Reshaping and pivoting data
- Working with time series data
Day 2:
- Visualizing Data with Pandas
- Line plots, scatter plots, and bar plots
- Histograms and density plots
- Box plots and violin plots
- Heatmaps and pair plots
- Advanced Pandas Techniques
- Handling large and complex datasets
- Performance and optimization tips
- Working with text data and categorical data
- Advanced indexing and reshaping techniques
- Advanced groupby and aggregation operations
This course is designed to be accessible to those with no previous knowledge of the Python programming language. However, having some familiarity with programming concepts will be helpful.
Attendees should bring a laptop with python installed (3.7 or above).
Please note that Sessionize is not responsible for the accuracy or validity of the data provided by speakers. If you suspect this profile to be fake or spam, please let us know.
Jump to top