CS109 Data Science
Class Material
Lectures: You can obtain all the lecture slides at any point by cloning 2015, and using git pull as the weeks go on.
Videos: You can see the entire list of videos here. Below we list them by class/section along with a link to the slides.
We’ve also started a YouTube channel for cs109. This channel has smaller videos dealing with nitty gritty stuff on the course.
Labs/Sections: Note that the Lab video and notebook is actually recorded and produced on the Thursday and Friday of the previous week, but is listed under the week that sections pertaining to the material on the Lab are given.
Week 1 (Mon Aug 31 - Fri Sep 4)
Lecture 1: Course Overview  
Slides: On Github, Download   
Video: SD, Mobile
Week 2 (Mon Sep 7 - Fri Sep 11)
Lab 1: Pandas, Python, and Github  
Repository: 2015Lab1  
Video: HD (recommended), SD, Captions, Mobile  
Lecture 2:  
Repository: 2015. The specific notebook  
Video: Web Scraping. Regular Expressions. Data Reshaping. Data Cleanup. Pandas. HD, SD, Mobile  
Lecture 3: Exploratory Data Analysis  
Slides: On Github, Download  
Video: HD HD-captions
Week 3 (Mon Sep 14 - Fri Sep 18)
Lab 2: Scraping, Pandas, Python, and viz  
Repository: 2015Lab2  
Video: HD, HD-captions  
Lecture 4: Pandas, SQL, and the Grammar of Data  
Repository: 2015. The specific notebook  
Slides: On Github,Download  
Video: HD, HD-captions  
Lecture 5: Statistical Models  
Slides: On Github, Download  
Video: HD, HD-captions
Week 4 (Mon Sep 21 - Fri Sep 25)
Lab 3: Probability, Distributions, and Frequentist Statistics  
Repository: 2015Lab3  
Video: HD, HD-captions  
Lecture 6: Story Telling and Effective Communication  
Slides: On Github, Download  
Video: HD, HD-captions  
Lecture 7: Bias and Regression  
Slides: On Github, Download  
Video: HD, HD-captions
Week 5 (Mon Sep 28 - Fri 2 Oct)
Lab 4: Regression, Logistic Regression: in sklearn and statsmodels  
Repository: 2015Lab4  
Video: HD, HD-captions  
Lecture 8: More Regression  
Slides: On Github, Download  
Video: HD, HD-captions  
Lecture 9: Classification. kNN. Cross Validation. Dimensionality Reduction. PCA. MDS.  
Slides: On Github, Download  
Video: HD, HD-captions
Week 6 (Mon Oct 5 - Fri 9 Oct)
Lab 5: Machine Learning  
Repository: 2015Lab5  
Video: HD: Learning a Model, Learning a Model-CAPTIONS HD-Classification,Classification-CAPTIONS  
Lecture 10: SVM, Evaluation.  
Slides: On Github, Download  
Video: HD, HD-Captions  
Lecture 11: Decision Trees and Random Forests  
Slides: On Github, Download  
Video: HD,HD-Captions
Week 7 (Mon Oct 12 - Fri 16 Oct)
Lab 6: Machine Learning 2
Repository: 2015Lab6  
Video: HD: Classification Redux, Classification Redux-CAPTIONS, HD: Comparing Models,Comparing Models-CAPTIONS  
Lecture 12: Ensemble Methods.  
Slides: On Github, Download  
Video: HD, HD-CAPTIONS  
Lecture 13: Best Practices      
Slides: On Github, Download  
Video: HD,HD-Captions
Week 8 (Mon Oct 19 - Fri 23 Oct)
Lab 7: Ensembles  
Repository: 2015Lab7  
Video: HD, HD-Captions  
Lecture 14: Best Practices, Recommendations and MapReduce.  
Slides: On Github, Download  
Video: HD, HD-CAPTIONS  
Lecture 15: MapReduce Combiners and Spark  
Slides for MapReduce: On Github, Download  
Slides and Notebook for Spark: On Github, Download,Notebook  
Video: HD, HD-CAPTIONS
Week 9 (Mon Oct 26 - Fri 30 Oct)
Lab 8: Vagrant and VirtualBox, AWS, and Spark  
Repository: 2015Lab8  
Video: HD, HD-CAPTIONS  
Lecture 16: Bayes Theorem and Bayesian Methods  
Slides: On Github, Download 
Video: HD, HD-CAPTIONS  
Lecture 17: Bayesian Methods Continued  
Slides: On Github, Download  
Video: HD, HD-Captions
Week 10 (Mon Nov 2 - Fri 6 Nov)
Lab 9: Bayes  
Repository: 2015Lab9  
Video: HD-Bayes1,HD-Bayes1-Captions HD-Bayes2,HD-Bayes2-Captions  
Lecture 18: Bayesian Methods Continued,Text Data  
Slides: On Github, Download  
Video: HD, HD-CAPTIONS  
Lecture BONUS: Interactive Visualization  
Video: HD,HD-CAPTIONS
Week 11 (Mon Nov 9 - Fri 13 Nov)
Lab 10: Text and Clustering  
Repository: 2015Lab10  
Video: HD,HD-CAPTIONS  
Lecture 19: Clustering  
Slides: On Github, Download  
Video: HD, HD-CAPTIONS  
Lecture 20: Effective Presentations  
Slides: On Github, Download  
Video: HD,HD-CAPTIONS
Week 12 (Mon Nov 16 - Fri 20 Nov)
Lab 10: Projects, and an example  
Repository: 2015Lab11  
Video: HD,HD-CAPTIONS  
Lecture 21: Experimental Design  
Slides: On Github, Download  
Video: HD, HD-CAPTIONS  
Lecture 22: Deep Networks  
Slides: On Github, Download  
Video: HD, HD-CAPTIONS
Week 13 (Mon Nov 30 - Fri 4 Dec)
Lecture 23: Guest Lecture: Building Data Science  
Video: HD  
Lecture 24: Wrapup, and where to go from here.  
Slides: On Github, Download  
Video: HD
all materials and videos: http://cs109.github.io/2015/pages/videos.html  
github: https://github.com/cs109
