Data Exploration with Python and Jupyter¶
Jupyter notebooks are a great tool for exploring and interacting with data using the Python programming language and its rich ecosystem of libraries.
In this course we will cover basic usage of the Pandas library to download a dataset, explore its contents, clean up missing or invalid data, filter the data according to different criteria, and plot visualizations of the data.
These slides are actually also generated from Jupyter notebooks!
Press Spacebar
to go to the next slide (or ?
to see all navigation shortcuts)
During the course¶
The course will be hands on - we'll create Jupyter notebooks together on our laptops.
The notebooks I edit during the course will be automatically updated as they are edited at:
It may be helpful to keep the above link open in a browser tab during the course.
Course materials¶
The completed notebooks can later be viewed as slides, used in an interactive colab session online, or downloaded:
- Part 1: Python and Jupyter - online slides, colab interactive notebook, read-only notebook
- Part 2: Pandas with toy data - online slides, colab interactive notebook, read-only notebook
- Part 3: Pandas with real data - online slides, colab interactive notebook, read-only notebook