1 of 18

API CAN CODE �Data in Learners’ Lives

Lesson 3: Using Data

This work was made possible through generous support from the National Science Foundation (Award # 2141655).

2 of 18

Warmup

  • Read this article: User data stolen from 23andme
  • What stakeholders are involved?
  • What issues of privacy are �highlighted?
  • How could those involved �have mitigated (decreased) �this risk?
    • In other words: how could�they have better protected�user data privacy?

2

3 of 18

Lesson 1.2 Recap

  • Last lesson, we talked about different kinds of data that we create and consume, and who may be interested in collecting that data�
  • We also discussed some instances where data collection may under-represent important parts of the population, and what effect that might have on the conclusions of those studies

3

4 of 18

From Raw Data to Wisdom

The DIKW Model:

4

Data

Information

Knowledge

Wisdom

i

Numbers and texts without context

Processed data with context

Information acquired by experience

Analysis of �complex knowledge structures

5 of 18

The Epidemic Outbreak

In 1845 there was an outbreak of a cholera epidemic in London.

Within 10 days, more than 500 people died in that neighborhood.

Epidemiologist John Snow realized that the water system was the cause of the outbreak.

How did he figure that out?

Cholera epidemic is a widespread outbreak of a severe diarrheal disease caused by the bacterium Vibrio cholerae.

i

5

6 of 18

The Epidemic Outbreak

John Snow marked the map with all �the deaths as a bar graph.

He found that victims increase near �the Broad Street water pump.

This discovery led to the public’s �conviction of the necessity of a �sewage system.

What’s a recent or local issue like this that you might be interested in exploring?

6

7 of 18

The Epidemic Outbreak

7

Data

Information

Knowledge

Wisdom

i

The number of victims in the area of impact

Mapping the data on the map

Identifying patterns and the source of the problem

A municipal sewer system is needed to improve sanitation

8 of 18

DIKW - COVID-19 Dashboard

Take ~10 minutes to review the DC COVID-19 Data from July 18, 2020 and answer:

What do you recognize on this page? �

What did you learn? �

What are you left wondering about?

8

9 of 18

DIKW - COVID-19 Dashboard

9

10 of 18

DIKW - COVID-19 Dashboard

10

11 of 18

DIKW - COVID-19 Dashboard

11

12 of 18

Heat Sensitivity Exposure Index

Heat Sensitivity Exposure is a health issue in many cities worldwide.

i

12

13 of 18

DIKW - Heat Sensitivity Exposure Index

Review the Heat Sensitivity Exposure Index dataset on OpenDataDC. Answer:�

  • What type of variable is the ”Total 2020 census population?
  • What type of variable is the ”Percent of population below 5 years of age?

13

14 of 18

DIKW - Heat Sensitivity Exposure Index

Review the Heat Sensitivity Exposure Index dataset on OpenDataDC.

�What variables are stored in this dataset? Are they quantitative or qualitative variables?�

At what stage of analysis are we in the DIKW model?�

What would we need to do to progress through the later stages?

What kinds of questions do you think we could answer with this dataset?

14

15 of 18

Local Issue for Investigation

  • Talk with students around you.

  • What are some local issues you might want to investigate with data? �You could use data that already exists, or �collect new data. �
  • Write down a few ideas in your notes so that �you can return to the ideas in later lessons.

15

16 of 18

Conclusion

  • What does each stage of the DIKW model look like?

  • Why do we need to analyze data instead of using it in its raw form?

  • Does the process of analysis always produce useful or correct wisdom? Why or why not?

16

17 of 18

Exit Ticket

  • What stage of the DIKW model does this graph represent?�
  • What do you think the cases or data �in this investigation were?

17

18 of 18

Thanks!

apicancode@umd.edu

18

This work was made possible through generous support from the National Science Foundation (Award # 2141655).

API Can Code is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike

4.0 International (CC BY-NC-SA 4.0) License