Information-Theoretic Tools for Responsible Machine Learning
Shahab Asoodeh (McMaster University)
Flavio P. Calmon (Harvard University)
Mario Diaz (Universidad Nacional Autónoma de México)
Haewon Jeong (UC Santa Barbara)
2022 IEEE International Symposium on Information Theory (ISIT)
Part I: Overview
Shahab Asoodeh (McMaster University)
Flavio P. Calmon (Harvard University)
Mario Diaz (Universidad Nacional Autónoma de México)
Haewon Jeong (UC Santa Barbara)
2022 IEEE International Symposium on Information Theory (ISIT)
Shannon established
information theory
1948
Shannon established
information theory
Reliable data communication, processing, and storage
1948
1950~2000s
Shannon established
information theory
Reliable data communication, processing, and storage
Fast development of ML
1948
1950~2000s
2000s~2010s
Shannon established
information theory
Reliable data communication, processing, and storage
Fast development of ML
Emerging challenges in ML
1948
1950~2000s
2000s~2010s
Today
Challenges in Responsible Machine Learning
Data-driven algorithms are increasingly applied to individual-level data to support decision-making in applications of individual-level consequence.
Challenges in Responsible Machine Learning
Protests against discriminatory A-levels grading algorithm
Data-driven algorithms are increasingly applied to individual-level data to support decision-making in applications of individual-level consequence.
Challenges in Responsible Machine Learning
Protests against discriminatory A-levels grading algorithm
Data-driven algorithms are increasingly applied to individual-level data to support decision-making in applications of individual-level consequence.
Challenges in Responsible Machine Learning
Protests against discriminatory A-levels grading algorithm
Microsoft “AI” deployed to predict teenage pregnancy in Salta, Argentina
Data-driven algorithms are increasingly applied to individual-level data to support decision-making in applications of individual-level consequence.
Challenges in Responsible Machine Learning
Protests against discriminatory A-levels grading algorithm
Microsoft “AI” deployed to predict teenage pregnancy in Salta, Argentina
Data-driven algorithms are increasingly applied to individual-level data to support decision-making in applications of individual-level consequence.
Privacy
Fairness
Several companies and Governments are investing in fair and private ML…
Privacy
Fairness
Several companies and Governments are investing in fair and private ML…
Privacy
Fairness
Several companies and Governments are investing in fair and private ML…but responsible ML is hard.
Alphabet (Google’s) 10-K filing (Feb 2019):
Microsoft 10-K filing (Aug 2018):
Several companies and Governments are investing in fair and private ML…but responsible ML is hard.
Alphabet (Google’s) 10-K filing (Feb 2019):
Microsoft 10-K filing (Aug 2018):
Several companies and Governments are investing in fair and private ML…but responsible ML is hard.
Alphabet (Google’s) 10-K filing (Feb 2019):
Microsoft 10-K filing (Aug 2018):
Several companies and Governments are investing in fair and private ML…but responsible ML is hard.
Alphabet (Google’s) 10-K filing (Feb 2019):
Microsoft 10-K filing (Aug 2018):
Several companies and Governments are investing in fair and private ML…but responsible ML is hard.
Alphabet (Google’s) 10-K filing (Feb 2019):
Microsoft 10-K filing (Aug 2018):
Several companies and Governments are investing in fair and private ML…but responsible ML is hard.
Alphabet (Google’s) 10-K filing (Feb 2019):
Microsoft 10-K filing (Aug 2018):
How can Information Theory help?
Shannon established
information theory
Reliable data communication, processing, and storage
Fast development of ML
Emerging challenges in ML
1948
1950~2000s
2000s~2010s
Today
Shannon established
information theory
Reliable data communication, processing, and storage
Fast development of ML
Emerging challenges in ML
Can we apply information theory to address these new challenges?
The information-theoretic blueprint
Problem
Model
Mathematical analysis
Practice
The information-theoretic blueprint
Problem
Model
Mathematical analysis
Practice
Usually abstracts away computational considerations and focuses on underlying “information” modeled by probability measures
The information-theoretic blueprint
Problem
Model
Mathematical analysis
Practice
Usually abstracts away computational considerations and focuses on underlying “information” modeled by probability measures
Tools from probability theory, statistics, optimization, functional analysis, discrete math, etc.
The information-theoretic blueprint
Problem
Model
Mathematical analysis
Practice
Usually abstracts away computational considerations and focuses on underlying “information” modeled by probability measures
Tools from probability theory, statistics, optimization, functional analysis, discrete math, etc.
Operational limits, new algorithms, coding techniques, etc.
The information-theoretic blueprint
Problem
Model
Mathematical analysis
Practice
Today’s goal:
Demonstrate that metrics and methods widely used in private and fair ML can be understood using familiar information-theoretic tools and analyzed using the IT blueprint
Imagine that you are a data scientist…
Imagine that you are a data scientist…
Model
(e.g. past grades, questionnaire answers)
(e.g. academic performance)
Classifier
Model
(e.g. past grades, questionnaire answers)
(e.g. academic performance)
Classifier
Examples:
Model
Classifier
(e.g. past grades, questionnaire answers)
(e.g. academic performance)
“Training” the model
Model
Classifier
Model parameters
“Training” the model
Model
Classifier
Model parameters
Training dataset
“Training” the model
Model
Classifier
vs.
“Training” the model
Model
Classifier
vs.
Empirical loss minimization:
Challenge 1: Privacy
Model
Classifier
vs.
Empirical loss minimization:
Model parameters may reveal private information about the training dataset
Challenge 1: Privacy
Training
Challenge 1: Privacy
Training
Challenge 1: Privacy
Training
Differential privacy: neighboring datasets cannot be (statistically) distinguished
Challenge 2: Fairness
Model
Classifier
(e.g. past grades, questionnaire answers)
(e.g. academic performance)
Challenge 2: Fairness
Model
Classifier
(e.g. past grades, questionnaire answers)
(e.g. academic performance)
(e.g. sex, age, race)
Should I use the group attribute as an input to the model?
Challenge 2: Fairness
Does the model performance change conditioned on a group attribute?
Model
Classifier
(e.g. past grades, questionnaire answers)
(e.g. academic performance)
(e.g. sex, age, race)
Tutorial Outline
Tutorial Outline
Part II
Part III
Part IV
Part V
Challenge
Focus
Privacy
Fairness
Metrics
Methods
Tutorial Outline
Part II
Part III
Part IV
Part V
Challenge
Focus
Privacy
Fairness
Metrics
Methods
Tutorial Outline
Part II
Part III
Part IV
Part V
Challenge
Focus
Privacy
Fairness
Metrics
Methods
Tutorial Outline
Metrics
Tutorial Outline
Metrics + Methods
By the end of the tutorial, you will be able to…
Admistrivia
Website: https://sites.google.com/view/isit2022tutorial/home
Slack Channel: see website
Code will be available soon!