1 of 21

Introduction to Statistics

2 of 21

Objectives

  • How Can We Work With Data?
  • Finding Meaning in Data
  • Your Turn

2

3 of 21

How Can We Work With Data?

We are inundated by data. Every. Single. Day.

1

4 of 21

Consider the Datasets

  • For each dataset:
  • Describe it
  • Be as precise as possible
  • The gold standard is for someone else to recreate the data from your description
  • If that’s not possible, how close can you get to the standard?

4

5 of 21

Dataset 1

5

6 of 21

Dataset 2

6

7 of 21

Dataset 3

7

8 of 21

Differences in Datasets

  • All three datasets have different characteristics
  • What makes them similar?
  • What makes them different?
  • They were all randomly generated
    • Using what rules?
  • How can this be quantified?
  • Made describable?
  • These were random - there is no meaning 😢

8

9 of 21

Discrete vs Continuous Data

  • Each of the datasets was discrete
    • Values can only take on specific values
    • Gaps exist in the data
  • Continuous data smoothly varies

  • Examples
    • Stock market prices vs height of a baby

9

10 of 21

Sampling

  • Continuous data can be sampled
    • Leads to discrete datasets
  • Sampling loses information from the underlying scenario
  • There are different types of sampling
    • Scales of samples
    • Biases introduced
    • More later

10

11 of 21

Datasets

  • Dataset 1
  • Dataset 2
  • Dataset 3

11

12 of 21

Alway Visualize!

  • Software tools will help!
  • TI-83
    • Used for IB exams
    • Practice with this even if you use other tools
    • Stats on the Graphing Calculator (also on class webpage)
  • Google Sheets
    • The most useful mathematical tool in your future!
  • Desmos and their statistics functions and tutorials
    • Limited to 1000 datapoints
  • GNU Octave (requires programming)
  • R at r-project.org for more sophisticated requirements (requires programming)

12

13 of 21

Finding Meaning in Data

What do we see with these numbers?

2

14 of 21

Some Real Data - World Water Stress Levels

  • What do you notice?
  • What do you wonder?
  • Capture the graph and main ideas
  • Discuss with neighbour
  • Full-sized picture

14

15 of 21

World Water Stress Levels

  • Many high population cities might run out of water
  • Increased stress in part due to climate change
  • Water management can be improved with conservation, distribution systems, recycling, wetlands revitalization, less water-intensive crops
  • How can you use data like this to influence social policy?
  • Interactives including projections
  • Source

15

16 of 21

Crime and Undocumented Immigrants

  • What do you notice?
  • What do you wonder?
  • Capture the graph and main ideas
  • Discuss with neighbour
  • Full-sized picture

16

17 of 21

Crime and Undocumented Immigrants

  • Is there a correlation between crime and undocumented immigrants?
  • How accurate can the horizontal axis be?
    • The number of undocumented immigrants, by definition, will be hard to count
  • What does it mean that the data has outliers removed?
  • What narratives can be generated about this data
    • Are these narratives in conflict with each other?
  • Original NYTimes Article
  • Source

17

18 of 21

Your Turn

What is important to you?

3

19 of 21

Your Turn

  • Create a Google Doc
  • Title it “social phenomena”
  • Write down three social phenomena that are important to you or affecting your life
  • Take a stand!
    • Write an assertion about each phenomenon

19

20 of 21

Your Turn

  • For each phenomenon:
    • Search for related data
    • What do you notice?
    • What do you wonder?
    • What is the purpose of the data?
    • How reliable is the data?
    • Does it increase or decrease support for your assertion?
  • Share your assertions and supporting data for the next class

20

21 of 21

Credits

Special thanks to all the people who made and released these awesome resources for free:

  • Presentation template by SlidesCarnival
  • Photographs by Unsplash

21