�������A Scalable mixed-integer programming based framework for optimal decision tree��Haoran Zhu�University of Wisconsin-Madison�August 16, 2019��
IBM Research
© 2019 IBM Corporation
Introduction to optimal decision tree
© 2019 IBM Corporation
IBM Research
2
IBM Research
Main considerations
Interpretability
Scalability
Training accuracy
Testing accuracy
© 2019 IBM Corporation
IBM Research
3
IBM Research
Why “ODT”
Interpretability
Scalability
Training accuracy
Testing accuracy
© 2019 IBM Corporation
IBM Research
4
IBM Research
Common methods for learning “ODT”
© 2019 IBM Corporation
IBM Research
5
IBM Research
Common methods for learning “ODT”
© 2019 IBM Corporation
IBM Research
6
IBM Research
Common methods for learning “ODT”
Do data-selection before training
A different formulation
© 2019 IBM Corporation
IBM Research
7
IBM Research
Our motivation
© 2019 IBM Corporation
IBM Research
8
IBM Research
Our contribution
© 2019 IBM Corporation
IBM Research
9
IBM Research
Related state-of-art
© 2019 IBM Corporation
IBM Research
10
IBM Research
Tree structure
© 2019 IBM Corporation
IBM Research
11
IBM Research
MIP-ODT formulation
© 2019 IBM Corporation
IBM Research
12
IBM Research
MIP-ODT formulation
© 2019 IBM Corporation
IBM Research
13
IBM Research
Heuristic of the formulation
© 2019 IBM Corporation
IBM Research
14
IBM Research
Further strengthening the formulation
© 2019 IBM Corporation
IBM Research
15
IBM Research
Speedup the computation
© 2019 IBM Corporation
IBM Research
16
IBM Research
Modification for categorical feature
© 2019 IBM Corporation
IBM Research
17
IBM Research
LP-based data-selection
© 2019 IBM Corporation
IBM Research
18
IBM Research
LP-based data-selection
© 2019 IBM Corporation
IBM Research
19
IBM Research
LP-based data-selection
© 2019 IBM Corporation
IBM Research
20
IBM Research
Selecting extreme points
© 2019 IBM Corporation
IBM Research
21
IBM Research
Soft Convex Combination Constraint (CCC)
© 2019 IBM Corporation
IBM Research
22
IBM Research
Balanced data-selection
© 2019 IBM Corporation
IBM Research
23
IBM Research
Initial clustering matters
© 2019 IBM Corporation
IBM Research
24
IBM Research
Initial clustering matters
© 2019 IBM Corporation
IBM Research
25
IBM Research
Iterative ODT training algorithm
© 2019 IBM Corporation
IBM Research
26
IBM Research
Numerical results: medium-sized dataset
© 2019 IBM Corporation
IBM Research
27
IBM Research
Numerical results: medium-sized dataset (contd.)
© 2019 IBM Corporation
IBM Research
28
IBM Research
Numerical results: medium-sized dataset (bad instances)
© 2019 IBM Corporation
IBM Research
29
IBM Research
Further comparison: one more formulation in (Verwer and Zhang, 2019)
© 2019 IBM Corporation
IBM Research
30
IBM Research
Numerical result: large-sized dataset
© 2019 IBM Corporation
IBM Research
31
IBM Research
Numerical result: even larger……
© 2019 IBM Corporation
IBM Research
32
IBM Research
Tree depth D=3
© 2019 IBM Corporation
IBM Research
33
IBM Research
Numerical reports from (Rhuggenaath et al.: IEEE, 2018)
© 2019 IBM Corporation
IBM Research
34
IBM Research
Comparison of different data-selections
© 2019 IBM Corporation
IBM Research
35
IBM Research
IBM data (timeseries_rca)
© 2019 IBM Corporation
IBM Research
36
IBM Research
IBM data (FPAtable__oil_well__as__15D__60D)
1.6 : 1, 892, 6h | MIP-ODT-DS | CART | CART(entire) |
Training acc. | 0.76 | 0.70 | 0.97 |
Testing acc. | 0.88 | 0.89 | 0.97 |
Recall | 0.39 | 0.31 | 0 |
Precision | 0.10 | 0.09 | 0 |
© 2019 IBM Corporation
IBM Research
37
IBM Research
IBM data (FPAtable__oil_well__as__15D__60D)
3.2 : 1, 405, 1h | MIP-ODT-DS | CART | CART(entire) |
Training acc. | 0.82 | 0.76 | 0.97 |
Testing acc. | 0.75 | 0.75 | 0.97 |
Recall | 0.42 | 0.39 | 0 |
Precision | 0.05 | 0.05 | 0 |
© 2019 IBM Corporation
IBM Research
38
IBM Research
Thank you!
© 2019 IBM Corporation
IBM Research
39
IBM Research
Appendix
© 2019 IBM Corporation
IBM Research
40
IBM Research
Appendix
© 2019 IBM Corporation
IBM Research
41
IBM Research
Appendix
© 2019 IBM Corporation
IBM Research
42
IBM Research
Appendix
© 2019 IBM Corporation
IBM Research
43
IBM Research
Appendix
© 2019 IBM Corporation
IBM Research
44
IBM Research
Appendix (Segmentation; D=3)
© 2019 IBM Corporation
IBM Research
45
IBM Research
Appendix (Dermatology; D=3)
© 2019 IBM Corporation
IBM Research
46
IBM Research