Interpretable machine learning:
definitions, methods, and applications
Jamie Murdoch, Chandan Singh, Karl Kumbier, Reza Abbasi Asl, Bin Yu
interpretable machine learning: the use of machine-learning models for the extraction of relevant knowledge about domain relationships contained in data
Predictive accuracy
Descriptive accuracy
�Relevant
An interpretation is relevant if it provides insight for a particular audience into a chosen domain problem.
the degree to which an interpretation method objectively captures the relationships learned by ML models.
Relevancy
Model-based
Post hoc
Future work