1 of 13

API CAN CODE �Computational Foundations of �Data Science

Lesson 2.1: What is Data Science?

This work was made possible through generous support from the National Science Foundation (Award # 2141655).

2 of 13

Class Discussion - Data Science

What do you think of when you think of Data Science?

2

3 of 13

Food Deserts in DC

  • Read the article Food Deserts in DC to understand the problem and consider further information to explore.�
  • What question do you want to answer? Some ideas:
    • Where is the “worst” food desert?
    • Do food deserts have patterns? (income, car, access)
    • Are there solutions?

3

A food desert is a geographical area where it is difficult to buy healthy, nutritious food at an affordable price.

i

4 of 13

Food Deserts in DC

  • Use the OpenDataDC dataset Grocery Store Locations to identify the number of grocery stores and locations by ward.

4

5 of 13

Food Deserts in DC

  • Using locations of grocery stores from the OpenDataDC dataset, students identify the closest Metro stop to each grocery store, and calculate distances. �
  • Enter your data into this Google Sheet, and create any graphs or analysis you want to present a finding you find interesting

5

6 of 13

Food Deserts in DC

  • Present your findings to the class. What answer do you have for your original question? �
  • What further information would you want to continue your investigation?

6

7 of 13

TikTok Viral Video Tracking

Watch the first half (0:00 - 9:30) of What happens after TikTok songs go viral?and discuss:

  • What data do you expect them to collect?
  • Who are the stakeholders?
  • Why are record companies trying to �sign artists with viral TikTok hits?
  • How do you think they know who to �sign?
  • What artist is going viral this week?

7

8 of 13

TikTok Viral Video Tracking

Watch the second half (9:30 - end) of What happens after TikTok songs go viral?and discuss (+ next slide):�

  • Spotify releases “editorial playlists” put out by�the company. What effect do you think this has on�record companies’ control on the music market?�
  • Does this create a better or worse�situation for independent artists?

8

9 of 13

TikTok Viral Video Tracking

Watch the second half (9:30 - end) of What happens after TikTok songs go viral?and discuss (continued):

9

  • What are the sources of Primary Data mentioned in the video?
  • What are the sources of Secondary Data mentioned in the video?

10 of 13

Data Science as Intersection

  • Data science is the overlap of programming, statistics, machine learning/AI, and applications to different fields�
  • It includes data processing, data visualization, working with probability and other skills

10

11 of 13

Data Science Investigations

  • Data Science investigations are ongoing processes from Problem or Question, to Data Collection, to Data Analysis, to Communicating findings�
  • The process typically creates more questions!

  • In this course you will �conduct such investigations

11

Problem

Interpret and �Communicate

Analyze

Data

Collect

Data

Data Science Investigation

12 of 13

Exit Ticket

  1. Find an interesting dataset within OpenDataDC. Paste the link to the dataset in the first response.�
  2. What does this dataset include? (variable types? what kind of source is it? etc.)�
  3. Why do you think this dataset is important/interesting?�
  4. What is one question you have about this data?

12

13 of 13

Thanks!

apicancode@umd.edu

13

This work was made possible through generous support from the National Science Foundation (Award # 2141655).

API Can Code is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike

4.0 International (CC BY-NC-SA 4.0) License