Due to the COVID-19, this course will be delivered over Zoom: https://ucsd.zoom.us/j/93540989128
This course mainly focuses on introducing machine learning methods and models that are useful in analyzing real-world data. It will cover classical regression & classification models, clustering methods, and deep neural networks. No previous background in machine learning is required, but all participants should be comfortable with programming, and with basic optimization and linear algebra.
There is no textbook required, but here are some recommended readings:
- The Elements of Statistical Learning by Trevor Hastie, Robert Tibshirani, and Jerome Friedman.
- Data Mining: Concepts and Techniques by Jiawei Han et al.
- Pattern Recognition and Machine Learning by Christopher M. Bishop.
- Dive into Deep Learning book by Aston Zhang et al.
Ability to code in Python: functions, control structures, string handling, arrays and dictionaries.
Familiarity with basic probability, at the level of CSE 21 or CSE 103.
Familiarity with basic linear algebra, at the level of Math 18 or Math 20F.
TAs and Tutors
- Teaching Assistants:
- Dheeraj Mekala (dmekala AT ucsd.edu)
- Xinghan Wang (x2wang AT ucsd.edu)
- Weijian Xu (wex041 AT ucsd.edu)
- Zhenyu Bi (z1bi AT ucsd.edu)
- Yilun Hao (yih301 AT ucsd.edu)
- Joey Hou (z9hou AT ucsd.edu)
- Colin Wang (ziw029 AT ucsd.edu)
- Jingbo Shang
- Office Hour: Wednesdays, 10 to 11 AM
- Zoom link: https://ucsd.zoom.us/my/jshang
- Dheeraj Mekala
- Office Hour: Thursdays, 11 AM to 12 PM
- Zoom link: https://ucsd.zoom.us/j/97330988404
- Xinghan Wang
- Office Hour: Thursdays, 4 to 5 PM
- Zoom link: https://ucsd.zoom.us/j/9287819575
- Weijian Xu
- Office Hour: Fridays, 2 to 3 PM
- Zoom link: https://ucsd.zoom.us/j/97304192865
- Zhenyu Bi
- Office Hour: Mondays, 8 to 9 AM
- Zoom link: https://ucsd.zoom.us/j/98625876507
- Yilun Hao
- Office Hour: Fridays, 7 to 8 PM
- Zoom link: https://ucsd.zoom.us/j/8864309997
- Joey Hou
- Office Hour: Mondays, 8 to 9 PM
- Zoom link: https://ucsd.zoom.us/j/98613657347
- Colin Wang
- Office Hour: Tuesdays, 8 to 9 PM
- Zoom link: https://ucsd.zoom.us/my/zwcolin
Note: all times are in Pacific Time.
- Homework: 15% each. Your lowest (of five) homework grades is dropped (or one homework can be skipped).
- Midterm: 40%.
- You should complete all work individually.
- Late submissions are NOT accepted.
Recording Note: Please download the recording video for the full length. Dropbox website will only show you the first one hour.
HW Note: All HWs due before the lecture time 9:30 AM PT in the morning.
(the schedule is tentative)
|Week||Date||Topic & Slides||Events|
|1||03/30 (Tue)||Introduction: Concepts and Evaluations||HW1 out|
|1||04/01 (Thu)||A Geometric View of Linear Algebra|
|2||04/06 (Tue)||Nearest Neighbor Classification||HW1 due, HW2 out|
|2||04/08 (Thu)||Gradients and Optimization|
|3||04/13 (Tue)||Least-Squares Regression, Logistic Regression, and Perceptron|
|3||04/15 (Thu)||Overfitting and Regularization|
|4||04/20 (Tue)||Support Vector Machine (SVM)||HW2 due, HW3 out|
|4||04/22 (Thu)||SVM: Duality and Kernel|
|5||04/27 (Tue)||K-Means Clustering & its Variants|
|5||04/29 (Thu)||“Soft” Clustering: Gaussian Mixture|
|6||05/04 (Tue)||Principle Component Analysis|
|6||05/06 (Thu)||Midterm (no class, take-home, 24-hour)|
|7||05/11 (Tue)||Naive Bayes and Decision Tree||HW3 due, HW4 out|
|7||05/13 (Thu)||Ensemble Learning: Bagging and Boosting|
|8||05/18 (Tue)||Multi-class Classification|
|8||05/20 (Thu)||Feed-forward Neural Networks|
|9||05/25 (Tue)||Convolutional Neural Networks||HW4 due, HW5 out|
|9||05/27 (Thu)||Semi-supervised Learning|
|10||06/01 (Tue)||Weakly-supervised Learning|
|10||06/03 (Thu)||Bias-Variance in Deep Neural Networks||HW5 due|
Your lowest (of five) homework grades is dropped (or one homework can be skipped).
- HW1: Concepts and Evaluations (15%). This homework mainly focuses on the machine learning concepts and how to evaluate different tasks.
- HW2: KNN and Linear Models (15%). This homework mainly focuses on nearest neighbor, least-square regression, logistic regression, and regularization.
- HW3: SVM and Clustering (15%). This homework mainly focuses on support vector machine, k-means, Gaussian Mixture, and PCA.
- HW4: Ensemble Learning (15%). This homework mainly focuses on decision tree, random forest, and AdaBoost.
- HW5: Neural Networks (15%). This homework mainly focuses on implementation of some simple neural networks.
It is an open-book, take-home exam, which covers all lectures given before the Midterm. Most of the questions will be open-ended. Some of them might be slightly more difficult than homework. You will have 24 hours to complete the midterm, which is expected for about 2 hours.
- Start: May 6, 9:30 AM PT
- End: May 7, 9:30 AM PT
- Midterm problems download: TBD
- Please make your submissions on Gradescope.