Heart Data: logistic estimation
1
July 17, 2023
Data Science for All 2023
Lecture 1: Classification
Heart Data: logistic estimation
2
July 17, 2023
Data Science for All 2023
Lecture 1: Classification
Using Logistic Regression for Classification
3
Data Science for All 2023
Lecture 1: Classification
4
Multiple Logistic Regression
Data Science for All 2023
Lecture 1: Classification
Multiple Logistic Regression
5
Data Science for All 2023
Lecture 1: Classification
Classifier with two predictors
6
Data Science for All 2023
Lecture 1: Classification
Multiple Logistic Regression
7
Data Science for All 2023
Lecture 1: Classification
Multiple Logistic Regression
8
Data Science for All 2023
Lecture 1: Classification
Fitting Multiple Logistic Regression
9
Data Science for All 2023
Lecture 1: Classification
Interpreting Multiple Logistic Regression: an Example
10
Data Science for All 2023
Lecture 1: Classification
Interpreting Multiple Logistic Regression: an Example
11
Data Science for All 2023
Lecture 1: Classification
Classification boundaries
12
Data Science for All 2023
Lecture 1: Classification
Geometry of Data
13
Data Science for All 2023
Lecture 1: Classification
Geometry of Data
14
Data Science for All 2023
Lecture 1: Classification
Geometry of Data
15
Data Science for All 2023
Lecture 1: Classification
Geometry of Data
16
Data Science for All 2023
Lecture 1: Classification
Interpretable Models
17
Data Science for All 2023
Lecture 1: Classification
18
Decision Tree
Data Science for All 2023
Lecture 1: Classification
Decision Trees
19
Data Science for All 2023
Lecture 1: Classification
Tree-based Models
20
age?
overcast
student?
credit rating?
<=30
>40
no
yes
yes
yes
31..40
no
fair
excellent
yes
no
Root node
Internal nodes
Leaf nodes
Data Science for All 2023
Lecture 1: Classification
The Geometry of Flow Charts
21
Data Science for All 2023
Lecture 1: Classification
The Geometry of Flow Charts
then target value=no
22
age?
student?
credit rating?
<=30
>40
no
yes
yes
yes
31..40
no
fair
excellent
yes
no
Data Science for All 2023
Lecture 1: Classification
The Geometry of Flow Charts
23
Data Science for All 2023
Lecture 1: Classification
The Geometry of Flow Charts
24
Data Science for All 2023
Lecture 1: Classification
The Geometry of Flow Charts
25
Data Science for All 2023
Lecture 1: Classification
Learning the Model
26
Data Science for All 2023
Lecture 1: Classification
Learning the Model
27
Data Science for All 2023
Lecture 1: Classification
The key is to partition data into smaller regions
28
Data Science for All 2023
Lecture 1: Classification
Guideline of Splitting
29
Data Science for All 2023
Lecture 1: Classification
Example
30
Data Science for All 2023
Lecture 1: Classification
Which attribute to choose?
31
Ages
Yes
Yes
No
No
No
Yes
Yes
Yes
Yes
Yes
Yes
Yes
No
No
<=30
31…40
>40
VS.
Credit_Rating
Yes
Yes
Yes
No
No
No
Yes
Yes
Yes
Yes
Yes
YesNo
No
Excellent
Fair
Q: Which attribute is better for the classification task?
Data Science for All 2023
Lecture 1: Classification
How to define purity of each region?
32
Data Science for All 2023
Lecture 1: Classification
Classification Error
33
Data Science for All 2023
Lecture 1: Classification
Classification Error
34
Data Science for All 2023
Lecture 1: Classification
Stopping Conditions & Pruning
35
Data Science for All 2023
Lecture 1: Classification
Variance vs Bias
36
Data Science for All 2023
Lecture 1: Classification
Stopping Conditions
37
Data Science for All 2023
Lecture 1: Classification
Alternative to Using Stopping Conditions
38
Data Science for All 2023
Lecture 1: Classification
The Scalable Analytics Institute (ScAI)�Department of Computer Science�University of California, Los Angeles (UCLA)
Instructor: Jeehyun Hwang
Lecture 2: Clustering
Data Science for All 2023
Data Science for All 2023
Lecture 1: Classification
Image Segmentation
40
Data Science for All 2023
Lecture 1: Classification
Unsupervised Learning
41
Data Science for All 2023
Lecture 1: Classification
Unsupervised Learning
42
Data Science for All 2023
Lecture 1: Classification
Unsupervised Learning
43
Data Science for All 2023
Lecture 1: Classification
WHAT IS CLUSTERING ANALYSIS?
44
July 17, 2023
Section Divider
Section Divider
Section Divider
Data Science for All 2023
Lecture 1: Classification
What is Cluster Analysis?
45
Data Science for All 2023
Lecture 1: Classification
What is Cluster Analysis?
46
Data Science for All 2023
Lecture 1: Classification
What is Cluster Analysis?
47
Data Science for All 2023
Lecture 1: Classification
What is Cluster Analysis? - KNN
48
Data Science for All 2023
Lecture 1: Classification
What is Cluster Analysis? - KNN
49
Data Science for All 2023
Lecture 1: Classification
Applications of Cluster Analysis
50
Data Science for All 2023
Lecture 1: Classification
Clustering: Application Examples - Species
51
Data Science for All 2023
Lecture 1: Classification
Clustering: Application Examples - Marketing
52
Data Science for All 2023
Lecture 1: Classification
Clustering: Application Examples
53
Data Science for All 2023
Lecture 1: Classification
K-means objective
54
Data Science for All 2023
Lecture 1: Classification
K-means objective
55
Data Science for All 2023
Lecture 1: Classification
K-means objective – Iterative Process
56
Data Science for All 2023
Lecture 1: Classification
K-means Example
57
Data Science for All 2023
Lecture 1: Classification
K-means Example
58
Data Science for All 2023
Lecture 1: Classification
K-means Example on Image Segmentation
59
Data Science for All 2023
Lecture 1: Classification
K-means Example
60
Data Science for All 2023
Lecture 1: Classification
Hierarchical Clustering
61
Step 0
Step 1
Step 2
Step 3
Step 4
b
d
c
e
a
a b
d e
c d e
a b c d e
Step 4
Step 3
Step 2
Step 1
Step 0
agglomerative
(AGNES)
divisive
(DIANA)
Data Science for All 2023
Lecture 1: Classification
Pseudo Code
62
Data Science for All 2023
Lecture 1: Classification
Hierchical Clustering
63
Data Science for All 2023
Lecture 1: Classification
Hierchical Clustering
64
Data Science for All 2023
Lecture 1: Classification
Hierchical Clustering
65
Data Science for All 2023
Lecture 1: Classification
Hierchical Clustering
66
Data Science for All 2023
Lecture 1: Classification
Hierchical Clustering
67
Data Science for All 2023
Lecture 1: Classification
Hierchical Clustering
68
Data Science for All 2023
Lecture 1: Classification
Hierchical Clustering
69
Data Science for All 2023
Lecture 1: Classification
Hierchical Clustering
70
Data Science for All 2023
Lecture 1: Classification
Hierchical Clustering
71
Data Science for All 2023
Lecture 1: Classification
Density-Based Clustering Methods
72
Data Science for All 2023
Lecture 1: Classification
DBSCAN: Basic Concepts
|NEps (q)| ≥ MinPts
73
MinPts = 5
Eps = 1 cm
p
q
Data Science for All 2023
Lecture 1: Classification
DBSCAN: Basic Concepts
74
Core
Border
Eps = 1cm
MinPts = 5
Data Science for All 2023
Lecture 1: Classification
DBSCAN: Basic Concepts
75
Core
Border
Noise
Eps = 1cm
MinPts = 5
Data Science for All 2023
Lecture 1: Classification
DBSCAN: Density-Based Spatial Clustering of Applications with Noise
76
Core
Border
Noise
Eps = 1cm
MinPts = 5
Data Science for All 2023
Lecture 1: Classification
DBSCAN: The Algorithm
77
Data Science for All 2023
Lecture 1: Classification
DBSCAN: can detect clusters with arbitrary shapes
78
Data Science for All 2023
Lecture 1: Classification
DBSCAN: sensitive to minimum num of points
79
Data Science for All 2023
Lecture 1: Classification
DBSCAN: sensitive to eps
80
Data Science for All 2023
Lecture 1: Classification
K-means vs DBSCAN
81
Data Science for All 2023
Lecture 1: Classification
K-means vs DBSCAN
82
K-means
DBSCAN
Data Science for All 2023
Lecture 1: Classification
K-means vs DBSCAN
83
K-means
DBSCAN
Data Science for All 2023
Lecture 1: Classification
K-means vs DBSCAN
84
K-means
DBSCAN
Data Science for All 2023
Lecture 1: Classification
K-means vs DBSCAN
85
K-means
DBSCAN
Data Science for All 2023
Lecture 1: Classification
K-means vs DBSCAN
86
Data Science for All 2023
Lecture 1: Classification
Summary
87
Data Science for All 2023
Lecture 1: Classification