JavaScript isn't enabled in your browser, so this file can't be opened. Enable and reload.

1 of 6

Interpretable machine learning:

definitions, methods, and applications

Jamie Murdoch, Chandan Singh, Karl Kumbier, Reza Abbasi Asl, Bin Yu

2 of 6

interpretable machine learning: the use of machine-learning models for the extraction of relevant knowledge about domain relationships contained in data

3 of 6

Predictive accuracy

Descriptive accuracy

�Relevant

An interpretation is relevant if it provides insight for a particular audience into a chosen domain problem.

the degree to which an interpretation method objectively captures the relationships learned by ML models.

4 of 6

Relevancy

5 of 6

Model-based

sparsity
simulatability
modularity
feature engineering

Post hoc

dataset-level

feature importances
visualization
trends + outliers

prediction-level

6 of 6

Future work

Evaluating desiderata (PDR) of different methods

measuring descriptive accuracy
demonstrating relevancy to real-world problems

new model-based methods
new post hoc methods

what should interpretations look like?
improving predictive accuracy with interpretations