1 of 30

Data Ethics: �Choices and Values

Veronica A. Rivera, Ph.D.

McCoy Family Center for Ethics in Society - HAI

Original slides and content by Kathleen Creel, Diana Acosta-Navas, and Benjamin Xie

2 of 30

We use data to inform our decisions

    • Evidence-based
    • Impartial
    • Reliable

2

3 of 30

What can we learn from a data set?

  • Patterns
  • Correlations
  • Distributions

  • Values

3

4 of 30

4

5 of 30

Data is intrinsically values-laden

5

6 of 30

VALUES IN DESIGN

6

PROBLEM FORMULATION

DATA INTERPRETATION

DATA

COLLECTION

7 of 30

What are values?

  • value (n): an individual or community’s belief about what matters
  • Values express what we care about
    • Efficiency
    • Privacy
    • Truth
    • Security
    • Beauty
    • Fairness
  • Values reveal our assumptions about the world, people interacting with our designs, and how our choices affect them

7

8 of 30

Intentionality of values

  • Explicit values: Values that designers intend for their products to embody

  • Collateral values: Values that crop up as side effects of design decisions and the way users interact with those designs. These values are not intentionally designed into the system.

8

9 of 30

PROBLEM FORMULATION

9

10 of 30

Problem Formulation Statements

  • Formulating a problem means treating the desired solution as good or worthy of being done.
  • Why should we care about solving this problem?
  • Who can agree that this is a problem worth solving?
  • Who would benefit from its solution?

10

11 of 30

Example: Unsubscribe feature

  1. People don’t like our app/service & it should be easy for them to unsubscribe
    • Potential design: Unsubscribe button on app homepage. Single click and user is unsubscribed.

  • The user doesn’t like our app/service, we want to understand why so we can make it better for future users
    • Potential design: Unsubscribe button on app homepage 🡪 10 minute questionnaire 🡪 talk with a representative

11

12 of 30

Example: Course evaluations

  1. Students have different learning needs. We want to help students decide which courses best fit their learning style�
  2. We want to promote and hire good teachers

  • We want teachers to have the info they need to improve their classes

12

13 of 30

DATA COLLECTION

13

14 of 30

What is data bias: Two definitions

  • Statistical: Difference between measured and “true” value
  • Social: Human-created biases, such as stereotypes, that arise through embedding of values.

14

15 of 30

Examples of social biases in data

Confirmation bias

When we favor information that confirms or strengthens our beliefs or values

15

Selection bias

When the selection of study participants or data is not randomized, so the sample is not representative of the entire population intended to be analyzed

Survivorship bias

A form of selection bias in which ”winners” are overly focused on in a sample

16 of 30

Example: confirmation & selection bias

Confirmation bias

Selection bias

16

17 of 30

DATA INTERPRETATION

17

18 of 30

Descriptive and normative terms

18

19 of 30

Descriptive vs. Normative Language

Descriptive language

  • Statements of fact
  • What people did
  • What happened

19

  • “Lectures are 90-minutes long”

  • “Assignments take more than two hours to finish”

  • “Sections are mandatory”

20 of 30

20

Attendance:

Not

Mandatory

Textbook

Required

21 of 30

Descriptive vs. Normative Language

Normative language:

  • Evaluative statements
  • Express the speaker’s opinions/reactions
  • How they think things should be

21

  • “right”

  • “wrong”

  • “good”

  • “bad”

  • “should”

  • “should not”

22 of 30

22

AWESOME

GREAT

TEACHER

23 of 30

Normative language:

  • Evaluative statements
  • Express the speaker’s opinions/reactions
  • How things should be

Descriptive language

  • Statements of fact
  • What people did
  • What happened
  • How things “are”

23

24 of 30

24

CLEAR

EASY TO

LISTEN TO

24

Distinction between normative & descriptive language is not always clean-cut

25 of 30

Thick Normative Terms

Descriptive AND normative:

  • Thick normative terms express morally or aesthetically “loaded” descriptions

  • Cowardly
  • Cautious
  • Polite
  • Rude
  • Chill
  • Kind
  • Caring
  • Smart
  • Knowledgeable
  • Professional

25

26 of 30

Example: toxic speech classification & context

26

Bears suck!!!

27 of 30

We should not get rid of normative terms altogether

27

28 of 30

I want to collect & analyze data to solve a problem. What should I do?

28

Check for social biases in your experiment/research setup

Look for embedding of values in data during analysis

Work with multiple stakeholders/people to identify interesting problems

29 of 30

Explicit Values

29

Health

Safety

Efficiency

Public interest

Contact-tracing

30 of 30

Collateral Values

  • Security?
    • Where is information stored?
    • Encryption?
  • Privacy?
    • Who has access to information?
    • Geolocation or bluetooth?
    • What information is accessible to health authorities/ the public?
  • Autonomy?
    • Informed consent?

30