Important Documents
- Syllabus (This is currently about as drafty as a zero-sided box.)
Slide Decks and Extra Materials
- Getting Started
- Intro Slides
- Setting Up Tools – Complete all of this ASAP, def by Monday.
- Data Wrangling and EDA
- Data Wrangling High Level Overview
- Pandas
- Part 01 Slides
- Dataset – Baseball Databank (clone this repo to your computer)
- Notebook for today can be found > here <.
- My Fav Pandas Cheat Sheet can be found > here <.
- Part 02 Slides
- Notebook for today can be found > here <.
- Part 01 Slides
- Visualization
- Matplotlib
- Watch this fantastic Intro to Matplotlib.
- You can find the associated Github repo > here <.
- Matplotlib
- The Tidy Data Movement
- EDA Putting It All Together
- Modeling
- Regression
- E2E ML Project Example: See Introduction to the Machine Learning Process for details on what to do for class Wednesday Feb 23, 2022.
- You’ll submit your Jupyter Notebook (described in the above link) on Canvas for a Homework grade by Monday class time.
- More Regression Practice
- Bike Sharing Data Set – Can you forecast the bike rental demand for the Washington DC Bike Sharing program?
- ML Flash Cards
- Regression
- Deployment
Projects
- Project 1 – Predict the Population
- Project 2 – Data Wrangling and EDA
- Project 3 – Choose Your Own Adventure
- Matrix is > here <.
- Project 4 – The Finale
- Handout is > here <.
Homeworks
- TBA