W03: Teaching the Investigation Process to Improve Statistical Reasoning
Todd Swanson
Hope College
swansont@hope.edu
Allan Rossman
Cal Poly – San Luis Obispo arossman@calpoly.edu
Acknowledgement
Workshop goals
GAISE
GAISE recommendations
Our Schedule
The six steps of the statistical investigation process
Comparing Two Means�
Do dung beetles navigate by the stars?
Dung Beetles
Step 1�Ask a research question
On a dark night (no moon) are dung beetles able to navigate using stars?
Step 2�Design a study and collect data
Step 2�Design a study and collect data
Hypotheses
Parameters
Parameters
Step 3: Explore the data
Here are all 18 times (in seconds) regardless of hat type
Mean = 86.66 sec
SD = 46.93 sec
Step 3: Explore the data
times are in seconds
Does cap type help explain variability in times?
R2
Step 4: Make inferences beyond the data
The Need for Inference
The 3-S Strategy
Black Caps
Clear Caps
They were randomly assigned to two groups where black caps were placed on 9 of them and clear caps were placed on the other 9
18 dung beetles were used in the study
Black Caps
Clear Caps
152.21
123.61
112.78
123.56
156.99
114.29
131.54
84.18
139.77
38.46
34.20
58.13
43.77
16.17
70.70
37.23
49.50
36.86
They were placed on top of a dung ball at the center of a circular arena and were timed to see how many seconds it took each beetle to reach the edge of the arena.
Simulate
Black Caps
Clear Caps
152.21
123.61
112.78
123.56
156.99
114.29
131.54
84.18
139.77
38.46
34.20
58.13
43.77
16.17
70.70
37.23
49.50
36.86
1
Shuffled Differences in Means
Black Caps
Clear Caps
152.21
123.61
112.78
123.56
156.99
114.29
131.54
84.18
139.77
38.46
34.20
58.13
43.77
16.17
70.70
37.23
49.50
36.86
2
Shuffled Differences in Means
Black Caps
Clear Caps
152.21
123.61
112.78
123.56
156.99
114.29
131.54
84.18
139.77
38.46
34.20
58.13
43.77
16.17
70.70
37.23
49.50
36.86
3
Shuffled Differences in Means
Strength of Evidence
20.1
-18.6
-5.6
-15.2
30.0
0.5
4.6
-2.3
-2.0
-4.7
.6.9
-6.7
-10.2
-6.7
-1.2
-9.9
5.6
-1.9
12.9
1.6
1.3
4.3
2.0
10.0
0.2
3.3
6.9
Out of 30 simulated statistics, there aren’t any that are as large or larger than our observed difference in means of 83.77, hence our p-value for this null distribution is 0/30 = 0.
Shuffled Differences in Means
Multiple Means Applet
Strength of Evidence
Step 5: Formulate Conclusions�
Step 5: Formulate Conclusions�
Generalization: Was the sample randomly selected from a larger population?
Causation: Were the observational units randomly assigned to treatments?
Step 6: Look back and ahead
Step 6: Look back and ahead
Comparing Two Proportions
Are metal bands used for tagging harmful to penguins?
Banding Penguins
Research Question
Hypotheses
Hypotheses
Partial Results
Results
Why might a smaller proportion of banded penguins survive?
Simulate statistics
| Banded | Unbanded | Total |
Survived | ? | ? | 47 |
Died | ? | ? | 53 |
Total | 50 | 50 | 100 |
Banded Unbanded
66.7% Survived
33.3% Survived
Survived
Died
Died
Died
Died
Died
Died
Died
Died
Died
Died
Died
Died
Died
Survived
Survived
Died
Survived
Died
Survived
Survived
Survived
Survived
Survived
Survived
Survived
Survived
Survived
Survived
Survived
60.0% Survived
40.0% Survived
0.600 – 0.400 = 0.200
Difference in Simulated Proportions
Applet
https://www.rossmanchance.com/applets/2021/chisqshuffle/ChiSqShuffle.htm?penguins=1
Conclusion
Generalization and Causation
Multivariable Thinking �in Intro Stats
2016 GAISE Guidelines
52
Where do we include multivariable thinking in our intro course?
53
Comparing two groups (quantitative response)
54
Results for their GPAs
55
Comparing GPAs (Breakfast and no Breakfast)
56
Let’s add a third variable
57
Let’s add a third variable
58
GPA Results for Female Students �(Breakfast and not)
60
GPA Results for Male Students (Breakfast or not)
61
Comparing Two Groups: Binary Response
62
Results from National Occupant Protection Use Survey
63
Do State Laws have an Impact?
64
Adding a Third Variable (enforcement) to our Plot
65
COVID and Vaccination Status
66
67
| Unvaccinated | Vaccinated | Total |
Died | 253 (0.167%) | 481 (0.411%) | 734 |
Survived | 150,799 | 116,633 | 267,432 |
Total | 151,052 | 117,114 | 268,166 |
These cases involve the Delta variant of SARS-CoV-2 in England from Feb 1, 2021 to Aug 2, 2021
68
| Unvaccinated | Vaccinated | Total |
Died | 48 (0.033%) | 21 (0.023%) | 69 |
Survived | 147,564 | 89,786 | 237,350 |
Total | 147,612 | 89,807 | 237,419 |
Less than 50 years-old
69
| Unvaccinated | Vaccinated | Total |
Died | 205 (5.96%) | 460 (1.62%) | 665 |
Survived | 3,235 | 27,870 | 31,105 |
Total | 3,440 | 28,330 | 31,770 |
50 years-old or older
Simpson’s Paradox
70