PyData Global 2023

An Introduction to Pandas 2, Polars, and DuckDB
12-07, 13:00–14:30 (UTC), Data Track

Pandas, Polars, and DuckDB can influence outcomes like productivity, integration, and velocity. This tutorial offers an introduction to three Python libraries: Pandas 2, Polars, and DuckDB. Attendees will be provided with an opportunity not only to comprehend the functionalities of these libraries but also to engage in hands-on experimentation.


1. Overview (10 minutes)

2. Getting to Know Pandas 2.0 (20 minutes)
- Intro (5 minutes): What's new in Pandas 2.0?
- Interactive Exploration (10 minutes): Guided hands-on session to try out basic financial operations with Pandas 2.0.
- Q&A (5 minutes)

3. A Glimpse of Polars (20 minutes)
- Intro (5 minutes): Highlighting Polars and its potential.
- Interactive Exploration (10 minutes): Dive into Polars comparing some basic tasks with Pandas.
- Q&A (5 minutes):

4. Dipping our Toes into DuckDB (20 minutes)
- Intro (5 minutes): An introduction to DuckDB.
- Interactive Exploration (10 minutes): Participants get to try manipulating financial datasets with DuckDB.
- Q&A (5 minutes)

5. Group Exercise (15 minutes)
- Participants can play around with the introduced tools, experiment, and try out guided exercises.

6. Wrap-Up (5 minutes)


Prior Knowledge Expected

Previous knowledge expected

Matt Harrison spends most of his time teaching Python and Data Science. He has a CS degree from Stanford University. He is a best-selling author on Python and Data subjects. His books Effective XGBoost, Effective Pandas, Illustrated Guide to Learning Python 3, Intermediate Python, Learning the Pandas Library, and Effective PyCharm have all been best-selling books on Amazon. He has taught courses at large companies (Netflix, NASA, Verizon, Adobe, HP, Exxon, and more), Universities (Stanford, University of Utah, BYU), and small companies. He has been using Python since 2000 and has taught thousands through live training, both online and in person.