CSE 5523: �Machine Learning (ML)
Course information
https://sites.google.com/view/osu-cse-5523-au24-chao/
(for course information, weekly schedule, and reading update)
Dr. Wei-Lun (Harry) Chao (chao.209@osu.edu), Office: DL 587
Assistant professor in CSE (PhD: USC; Postdoc: Cornell)
Tai-Yu Pan (pan.667)
2
A bit about me
Machine learning and its applications to
3
Pancreatic
cancer
A bit about me
Learning with imperfect data sources
4
KITTI
(Germany)
Argoverse
(USA)
nuScenes
(USA, Singapore)
Lyft
(USA)
Waymo
(USA)
A bit about me
Course information
Course information
7
Communications
8
Questions?
Grading and homework
Grading (tentative)
Guidelines
Tentative schedule
Homework
Exams
Policy
Academic integrity
(Re-)grading
Pre-requisites & what to expect?
13
SEI comments
14
Textbook
15
Pattern Recognition and Machine Learning
Machine Learning: A Probabilistic Perspective
Textbook
16
Deep Learning
Understanding Machine Learning
Other great textbooks
17
Learning from Data
Foundations of Machine Learning
Machine Learning:
A Bayesian and Optimization Perspective
The Elements of Statistical Learning
Machine Learning Refined
Introduction to Machine Learning
Other great textbooks
18
Deep Learning:
Foundations and Concepts
Understanding Deep Learning
Dive into Deep Learning
Other excellent ML courses
19
Important for this week
20
How to do well? How to learn ML well?
21
Important dates
22
Questions?
Today
Introduction
Machine learning setup
Course overview and math showtime
24
What is machine learning?
What is machine learning?
A set of methods that can automatically detect patterns in data, and then use the uncovered patterns to predict future data, or to perform other kinds of decision making under uncertainty
Kevin Murphy. Machine learning: a probabilistic perspective.
26
The term machine learning refers to the automated detection of meaningful patterns in data
S. Shalev-Shwartz and S. Ben-David. Understanding machine learning.
What is machine learning?
A set of methods that can automatically detect patterns in data, and then use the uncovered patterns to predict future data, or to perform other kinds of decision making under uncertainty
Kevin Murphy. Machine learning: a probabilistic perspective.
27
The term machine learning refers to the automated detection of meaningful patterns in data
S. Shalev-Shwartz and S. Ben-David. Understanding machine learning.
What is machine learning?
This book is about learning from data.
Sergios Theodoridis. Machine learning: a Bayesian and optimization perspective.
28
We choose the title “learning from data” that faithfully describes what the subject is about.
Y. Abu-Mostafa, M. Magdon-Ismail, H-T Lin. Learning from data.
Example: coin classifier
29
Patterns in the space of weight and size
[Figure credit: Y. Abu-Mostafa, M. Magdon-Ismail, H-T Lin. Learning from data.]
Other examples
30
Key ingredients
31
Memorization vs. generalization
32
cat
Model
Input
Label
Model
Memorization
?
Test data
cat
cat
dog
Training data
Memorization vs. generalization
33
cat
cat
cat
dog
Training data
Agent function
Input
Label
Model
Generalization
cat
Test data
The detected patterns should be able to generalize to future test instances.
Learning algorithms
34
Learning algorithms
Training data
(experience,
past observations)
Learned models & patterns
(knowledge, expertise)
ML is everywhere!
35
Speech Processing & Natural Language Processing
36
Computer Vision
37
[Source: Detectron2]
[Source: Graham Murdoch/Popular Science]
Autonomous Driving
38
Perception
Prediction & Planning
Action & decision
Radar
Sonar
Camera
Others
LiDAR
Questions?
Machine learning setup
Training/testing
Supervised/unsupervised
Classification/regression
Today
Introduction
Machine learning setup
41
“Supervised” ML pipeline
42
Learning algorithms
Training data
Learned models & patterns
[Figure credit: Y. Abu-Mostafa, M. Magdon-Ismail, H-T Lin. Learning from data.]
“Supervised” ML pipeline
43
Learning algorithms
Training data
Learned models & patterns
Test data
[Figure credit: Y. Abu-Mostafa, M. Magdon-Ismail, H-T Lin. Learning from data.]
How to know if the model works well or not?
44
Learning algorithms
Training data
Learned models & patterns
Test data
oracle
25
Ground truth
Difference
(loss)
[Figure credit: Y. Abu-Mostafa, M. Magdon-Ismail, H-T Lin. Learning from data.]
Caution! Must be step-by-step!
45
Learning algorithms
Training data
Learned models & patterns
Test data
oracle
25
Ground truth
Difference
(loss)
1
2
3
[Figure credit: Y. Abu-Mostafa, M. Magdon-Ismail, H-T Lin. Learning from data.]
Caution! Must be step-by-step!
46
Learning algorithms
Training data
Learned models & patterns
Test data
oracle
25
Ground truth
Difference
(loss)
Pretend you don’t know!
[Figure credit: Y. Abu-Mostafa, M. Magdon-Ismail, H-T Lin. Learning from data.]
Test data
47
Today
Introduction
Machine learning setup
Course overview and math showtime
48
Different flavors of ML problems
49
Data and features
50
Time index
Machine learning (ML) is the study of computer algorithms that improve automatically through experience. ..
Supervised learning
51
laptop
laptop
camera
camera
bike
bike
Unsupervised learning
52
Unsupervised learning
53
Unsupervised learning
54
Unsupervised learning
55
Unsupervised learning
56
Evaluation 1:
Visualization
Evaluation 2:
Hide
Compare
Unsupervised learning
57
Training data
Test data
Questions?�(We will focus mainly on supervised learning for the first half of the semester!)
Today
Introduction
Machine learning setup
Course overview and math showtime
59
Classification vs. regression
60
Machine learning: capture patterns from training data
that can be generalized to future unseen data
Classification vs. regression: training data
61
x: distance
y: price
x[1]: year in use
x[2]: miles
Regression (bus ticket):
From x (distance), predict y (price)
Classification (car buying company):
From x (year, miles), predict y (buy or not)
y:
Buy
Not
DO NOT mean the input data are always 1-D or 2-D!
Classification vs. regression: find patterns
62
x: distance
y: price
x[1]: year in use
x[2]: miles
Regression (bus ticket):
From x (distance), predict y (price)
Classification (car buying company):
From x (year, miles), predict y (buy or not)
Linear relationship
Linear boundary
y:
Buy
Not
Classification vs. regression: generalization
63
x: distance
y: price
x[1]: year in use
x[2]: miles
Regression (bus ticket):
From x (distance), predict y (price)
Classification (car buying company):
From x (year, miles), predict y (buy or not)
Linear boundary
Linear relationship
y:
Buy
Not
Classification vs. regression: generalization?
64
x: distance
y: price
x[1]: year in use
x[2]: miles
Regression (bus ticket):
From x (distance), predict y (price)
Classification (car buying company):
From x (year, miles), predict y (buy or not)
Linear boundary
Linear relationship
y:
Buy
Not
?
?
Questions to ML
65
Today
Introduction
Machine learning setup
Course overview and math showtime
66
Course overview
Topics
68
Topics
69
Math showtime
What you will see … (ELBO)
71
Probability, inequality, …
What you will see … (logistic loss)
72
Loss function, log, …
What you will see … (covariance matrix)
73
Linear algebra, …
Math in ML
74
Math quiz: linear algebra-1
75
Math quiz: linear algebra-2
76
Math quiz: linear algebra-2
77
Math quiz: probability
78
Math quiz: probability
79
Math quiz: multivariate calculus
80
Math quiz: multivariate calculus
81
Summary
82