CS 534  Machine Learning
Course Overview
Machine learning impacts many applications including the sciences (e.g., predicting genomeprotein interactions, detecting tumors, personalized medicine) and consumer products (e.g., Amazon’s Alexa, Microsoft Kinect, Neflix). In this course, students will learn the fundamental theory and algorithms of machine learning. Students will also obtain practical experience applying standard machine learning methods to solve a variety of problems. Prerequisites:
 Undergraduatelevel linear algebra
 Undergraduatelevel probability
 Exposure to statistics
 Programming ability in Python, Matlab, Julia, R, or C++
Course Logistics
 Piazza: All annoucements, assignment clarifications, and slide corrections will be posted here. Make sure you check it on a regular basis.
 Office Hours:
 Joyce Ho: M 1:30 PM  3:30 PM, W 9:30 AM  12:00 PM @ MSC W414
 Textbook
 Required: The Elements of Statistical Learning: Data Mining, Inference, and Prediction), by Trevor Hastie, Robert Tibshirani & Jerome Friedman
 Supplemental: Machine Learning: a Probabilistic Perspective, by Kevin Murphy
 Supplemental: Pattern Recognition and Machine Learning, by Christopher Bishop
(Tentative) Course Schedule
The reading material listed below is optional and the lecture plan may deviate over the course of the semester.
#  Date  Topic  References  Assignments  

Introduction  
1  8/23  Overview & Course Logistics 

 
2  8/29  Crash Course in Optimization and Statistics 
 
Supervised Learning I  
3  8/31  Linear Regression 
 
4  9/5  
5  9/7  Naive Bayes & Linear Classification 

 
6  9/12  
Model Assessment & Selection  
7  9/14  Bias & Variance Tradeoff 
 
8  9/19  Model Assessment  
9  9/21  Bootstrap & Model Selection 
 
10  9/26  
Supervised Learning II  
11  9/28  Boosting, Trees & Additive Models  
12  10/3  
13  10/5  Ensembles & Random Forests 

 
14  10/12  Support Vector Machines 
 
15  10/17  
16  10/19  Neural Networks 

 
17  10/24  
18  10/26  K Nearest Neighbors 
 
Unsupervised Learning  
19  10/31  Dimensionality Reduction 
 
20  11/2  Clustering & Mixture Models  
Midterm  
21  11/7  Midterm  
22  11/9  Project Madness & TBD 
 
Other Topics  
23  11/14  Hidden Markov Models 
 
24  11/16  Deep Learning  
25  11/21  
26  11/28  Topic Models 
 
27  11/30  Recommendation Systems 
 
Project Presentations  
28  12/5  Presentations  
29  12/7 
Course Grading
Component  Weight 

Homeworks  35.0% 
Midterm  17.5% 
Project  40.0% 
Participation  7.5% 
Project
You are encouraged to work in groups of 23 for the final project. The goal is to either develop a novel algorithm (novelty bonus points will be given depending on the level of difficulty) or try various ML existing algorithms on the dataset. The project is a critical part of the course and a significant factor in determining your grade. Teams are required to hand in a project proposal, a final project report and prepare two presentations on their work. By default, all team members will receive the same score for their project. If a team feels that this is unfair perhaps due to HIGHLY imbalanced contributions, then every team member needs to provide feedback on the contribution of each of the other team members via email before submission of the final report. After that I will have a meeting with the entire group to mediate. More details on projects are posted on Piazza under the projects folder.
Component  Due Date  Weight 

Proposal  10/25  15% 
Madness  11/8  10% 
Presentation  12/412/11  25% 
Report  12/12  50% 
Assignment and Exam Policy
 Assignments
 Assignments (homeworks 15, final projects) are due electronically on Canvas at 11:59 PM.
 Each student receives 6 late days that can be used across the 5 homeworks throughout the semester. These late days extend the deadline for 24 hours.
 A maximum of 3 late days can be used on a given homework.
 Late days apply to the entire homework, so handing in one problem late counts as a late day towards the whole homework.
 No credit will be given if you submit the homework late and have no remaining late days.
 Exam
 The midterm (openbook, opennotes, no electronic devices) must be taken at the required time.
 Requests for rescheduling the midterm exam will only be considered if the request is made at least a week prior to the exam date.
Honor Code
All class work is governed by the College Honor Code and Departmental Policy. It is acceptable and encouraged to discuss homeworks with other students. However, this should be noted on your submitted homework and all code and writeup must be written by yourself. Any code and writeup that is found to be similar is grounds for an honor code investigation by the Director of Gradute Studies, Laney Graduate School, and the honor council.