Course webpage for Math 251 Statistical and Machine Learning Classification

Course webpage for Math 251 Statistical and Machine Learning Classification

Back to my homepage

MATH 251 Statistical and Machine Learning Classification 

Fall 2022, San Jose State University

 

image
 

Course information [syllabus]

This is a graduate-level course on the machine learning branch of classification, covering the following topics: 

all based on the benchmark dataset of MNIST Handwritten Digits. Such a teaching strategy was partly inspired by Michael Nielsen's free online book - Neural Networks and Deep Learning, which notes explicitly that this dataset hits a ``sweet spot'' - it is challenging, but ``not so difficult as to require an extremely complicated solution, or tremendous computational power''. In addition, the digit recognition problem is very easy to understand, yet practically important.

Prerequisites: Math 164 Mathematical Statistics and Math 250 Mathematical Data Visualization

Technology requirements: 

Recommended readings:

  1. James, Witten, Hastie and Tibshirani (2017), “An Introduction to Statistical Learning with Applications in R”, Springer 
  2. Hastie, Tibshirani, and Friedman (2009), “The Elements of Statistical Learning: Data Mining, Inference, and Prediction”, Springer-Verlag 
  3. Nielson (2015), “Neural Networks and Deep Learning”, Determination Press
  4. Goodfellow, Bengio, and Courville (2016), “Deep Learning”, MIT Press

Course progress

Updated slides will be continuously posted below (those in light color are from Fall 2020  and will be updated). Please download a new copy of the new slides before each class (remember to refresh your browser).

Dates  Lecture Slides  Further Reading
8/23  Course introduction [slides]

 Chapters 1 and 2 of recommended reading 2

8/25  Instance-based classifiers [slides]

 Sections 2.2.3 and 5.1 of recommended reading 1 

9/1  Dimension reduction for classification [slides]

 2DLDA paper

9/6  Evaluation criteria [slides]  Yining Chen's slides
9/8  Bayes classifiers [slides]  Section 4.4 of recommended reading 1 
9/20  Logistic regression [slides]  Section 4.3 of recommended reading 1 
10/11  Support vector machine [slides]  [Chapter 9 of recommended reading 1] [Lagrange Dual]
11/1  Ensemble learning [slides]

 [Trevor Hastie's slides] [Adele Cutler's lecture]

 [Chapter 8 of recommended reading 1]

 [Cornell CS4780 notes on boosting]

11/15  Neural networks [slides] [Michael Nielsen’s book]
12/1  Introduction to deep learning resources [slides] [MIT 6.S191 course page]  [Standford CS 231n course page]
 

Additional learning resources

Useful course websites


Data sets


Instructor feedback

Feedback at any time of the semester is encouraged and greatly appreciated, and will be seriously considered by the instructor for improving the course experience for both you and your classmates. Please submit your anonymous feedback through this page.