Why should I trust you?
Explaining the Predictions of Any Classifier
Trusting a model
Widespread adoption of machine learning requires more trust in a model:
LIME
Local
Interpretable
Model-Agnostic
Explanations
Explanations make model’s output more reliable
Why not just the test set?
Two models that for specific input give the same output, may do so for very different reasons.
It’s easy to see the difference and pick better model when given explanations (due to prior knowledge).
Intuition
General framework
To use LIME for a specific model, we need to specify:
Concrete example - sparse linear explanations
Explaining Inception prediction
Explaining model globally
Explaining model globally
Experiments
Are explanations faithful to the model?
Each model (sparse LR and decision tree) is trained to use at most 10 features.
We test how many of these features are recovered by:
Should I trust this prediction?
25% of the features are randomly selected as untrustworthy. Prediction of a model is considered untrustworthy if the prediction changes when untrustworthy features are removed. We test precision and recall of finding untrustworthy predictions.
Can I trust this model?
We add 10 “noisy” features and train pair of random forests with:
We test if user can identify the better classifier based on explanations from the validation set. Simulated user picks the model with fewer untrustworthy predictions.
MTurk experiments
Can users select the best classifier?
Users (with no knowledge of machine learning) need to decide between two models:
Second model is better but it’s not evident from the test score.
Can non-experts improve a classifier?
Users (who are unfamiliar with feature engineering) need to identify which words from the explanations are unimportant and should be removed. Then new classifiers are trained.
Do explanations lead to insights?
We train classifier distinguishing between wolves and huskies. In the training set each wolf had snow in the background while huskies did not. Users were asked three questions:
tinyurl.com/WhyShouldITrustYou