Data Ethics: �Choices and Values
Veronica A. Rivera, Ph.D.
McCoy Family Center for Ethics in Society - HAI
Original slides and content by Kathleen Creel, Diana Acosta-Navas, and Benjamin Xie
We use data to inform our decisions
2
What can we learn from a data set?
3
4
Data is intrinsically values-laden
5
VALUES IN DESIGN
6
PROBLEM FORMULATION
DATA INTERPRETATION
DATA
COLLECTION
What are values?
7
Intentionality of values
8
PROBLEM FORMULATION
9
Problem Formulation Statements
10
Example: Unsubscribe feature
11
Example: Course evaluations
12
DATA COLLECTION
13
What is data bias: Two definitions
14
Examples of social biases in data
Confirmation bias
When we favor information that confirms or strengthens our beliefs or values
15
Selection bias
When the selection of study participants or data is not randomized, so the sample is not representative of the entire population intended to be analyzed
Survivorship bias
A form of selection bias in which ”winners” are overly focused on in a sample
Example: confirmation & selection bias
Confirmation bias
Selection bias
16
DATA INTERPRETATION
17
Descriptive and normative terms
18
Descriptive vs. Normative Language
Descriptive language
19
20
Attendance:
Not
Mandatory
Textbook
Required
Descriptive vs. Normative Language
Normative language:
21
22
AWESOME
GREAT
TEACHER
Normative language:
Descriptive language
23
24
CLEAR
EASY TO
LISTEN TO
24
Distinction between normative & descriptive language is not always clean-cut
Thick Normative Terms
Descriptive AND normative:
25
Example: toxic speech classification & context
26
Bears suck!!!
We should not get rid of normative terms altogether
27
I want to collect & analyze data to solve a problem. What should I do?
28
Check for social biases in your experiment/research setup
Look for embedding of values in data during analysis
Work with multiple stakeholders/people to identify interesting problems
Explicit Values
29
Health
Safety
Efficiency
Public interest
Contact-tracing
Collateral Values
30