M6: Data Cleaning & Visualization

This fifth module provides an introduction to data cleaning techniques and visualization.

Pre-recorded Lectures

The pre-recorded lectures are available here. You can also find the videos under the “Panopto” tab on the CAPP 30122 canvas site.

  • 6.1 - Data Cleaning (Part 1)

  • 6.2 - Data Cleaning (Part 2)

  • 6.3 - Data Visualization with Seaborn (Part 1)

  • 6.4 - Data Visualization with Seaborn (Part 2) -> This video is long but is broken down by timestamps via Panopto. You can watch what you find interesting to your project.

Resources

– Home Mortgage Disclosure Act example from class (check your repositories under modules/m6)

John Canny’s slides on Data Cleaning and Integration

Zoom Sessions

You will find the links to the Zoom sessions on Canvas.

  • Week 6
    • Wednesday, February 17th: Data Cleaning (Lab)

    • Friday, February 19th: Data Cleaning & Visualization (extended example, Q&A)

Programming Assignment

No PA will be assigned during Week 6. This gives you the opporunity to work more on your group projects. The next PA will be assigned at the end of Week 7.