1 of 40

Jill VanderStoep�Hope College – Holland, MI�

EAPOST Workshop:

Test for a Single Mean

2 of 40

Overview

Information on our curriculum
How I run my class
Test for a Single Mean:

Exploring Quantitative Data
Simulation-Based Approach Exploration

Questions

3 of 40

Information on our Curriculum

Created to follow GAISE
Begins with 4 pillars of inference:

Significance
Estimation
Generalization
Causation

Features a spiral approach using the 6 steps of statistical investigation
Utilizes simulation to actively understand the investigation process

4 of 40

Information on our Curriculum

The 6 steps of statistical investigation

Ask a research question
Design a study and collect data
Explore the data
Draw inferences beyond the data (Logic of inference: Significance and estimation)
Formulate conclusions (Scope of inference: Generalization and causation)
Look back and ahead

5 of 40

How I Run my Class

15 weeks instruction/1 week final exams
First 5 weeks, cover the 4 pillars: significance, estimation, generalization, causation.
Second 5 weeks, explore inference comparing two groups (including matched pairs)
Third 5 weeks, explore inference comparing multiple groups as well as regression

6 of 40

How I Run my Class

Students read or watch videos on the upcoming class content through an example.
They take a reading/video quiz before class on that content
Ideally they start the HW
In class we recap the content from the example
In class we go through an exploration of a different content
After class they complete HW

7 of 40

How I Run my Class

Test for a single mean is typically week 3

8 of 40

Test for a Single Mean: Exploring Quantitative Data

9 of 40

Beth Chance workshop on sampling distribution of a single mean linked here

10 of 40

Test for a Single Mean: Simulation-Based Approach Exploration

11 of 40

Backpack Weights

12 of 40

Backpack Weights

Carrying a heavy backpack can be a source of chronic, low-level trauma and can cause long-lasting shoulder, neck, and back pain.
The American Academy of Orthopedic Surgeons recommends that a backpack not weigh more than 10% to 15% of the wearer’s body weight.
College students at a public university in California (about 20,000 enrollment) wondered if college students at their university were following these backpack weight guidelines.

13 of 40

Backpack Weights

STEP 1: Ask a research question
Do university students carry backpacks that weigh on average something different than the recommended 10% of their body weight?
Let’s take a closer look at this study

14 of 40

Backpack Weights

STEP 2: Design a study and collect data
Each of 4 consecutive days, student researchers set up a scale at a different location on their campus. At each location, data was gathered on students who agreed to participate until they had data on 25 students at that location (100 total students).
Each student had their backpack weight recorded to the nearest pound.
Each student filled out a short survey that asked their weight in pounds, biological sex, what college they were in, their major, their year in school, whether they were a graduate or undergraduate student, their total course units, and whether they experienced any back problems.

15 of 40

Backpack Weights

What are the observational units?
The observational units are the 100 students.
What measured variables are categorical?
Biological sex, college, major, year in school, back problems.
What measured variables are quantitative?
The backpack weight (lbs), the student weight (lbs), the number of course units.

16 of 40

Backpack Weights

Other variables calculated
Percentage of student’s body weight the backpack is calculated as a ratio
Spread sheet/data sheet

17 of 40

Hypotheses

State the null and alternative hypotheses

Null hypothesis: The unknown long run mean ratio of backpack weight to body weight is 0.10
Alternative hypothesis: The unknown long run mean ratio of backpack weight to body weight is different from 0.10

18 of 40

Parameters

We can also use a symbol for the unknown parameter we are trying to estimate.
The parameter of interest is:

µ = the unknown long-run mean ratio of backpack weight to body weight

19 of 40

Hypotheses

Mu (µ) is the parameter for population mean ratio.
Using this symbol, we can rewrite the hypotheses as follows:

H₀: µ = 0.10

H_a:µ ≠ 0.10

20 of 40

Hypotheses

Remember

The hypotheses are about the parameter value in the population, not just for these 100 students.
Hypotheses are always about populations or processes, not the sample data.
Step 3: explore the data comes next & we will use the descriptive statistics applet

21 of 40

Step 3: Explore the data

	Sample size	Mean	SD	Min.	First Quart.	Med.	Third Quart.	Max.
Ratio	100	0.078	0.037	0.016	0.052	0.071	0.096	0.181

22 of 40

Results

The sample mean was different than 0.10
What are two possible explanations for why we observed a sample mean that was different from 0.10?
#1 The population mean ratio really is 0.10 and our sample mean of 0.078 from 100 students happened by random chance.
#2 The population mean ratio is different from 0.10 and that is why our sample mean ratio was different from 0.10.

23 of 40

Step 4: Draw inferences beyond the data

Is it possible to get a mean ratio of 0.078 even if in the population the ratio is 0.10?
Yes, it’s possible, how likely though?
A p-value will tell how likely it is to get a mean ratio that different from 0.10.
Let’s use our 3S strategy

25 of 40

Simulation

We can simulate samples of size 100 from a population where the mean ratio of backpack weight to body weight is 0.10
Create a hypothetical population of ratios
Assume the variability seen in our sample is the variability in the population
Make many, many copies of our sample
Slide this distribution to center on 0.10.

26 of 40

Simulation

Take repeated samples of size 100 from this hypothetical population each time calculating the mean ratio
Do this repeated sampling many times to develop a null distribution of mean ratios
Let’s see what this looks like

31 of 40

One mean applet

Let’s see how this process is done with the One Mean applet.
We will use the backpack/bodyweight ratio as the variable we calculated.

32 of 40

P-value

We should have obtained a null distribution like the one shown.
We can see that our observed statistic of 0.078 (or smaller) or the comparable value (0.122 or larger)didn’t even occur once in 1,000 samples.
Therefore, our p-value is less than 1/1,000 or approximately 0.

33 of 40

P-value

What does this p-value mean?
Assuming that the population mean ratio of backpack weight to body weight truly was 0.10, if we were to repeatedly take samples of size 100 and in each sample average the ratios, we would find the study sample mean ratio of 0.078 or one more extreme in about 0 of the repeated samples.
Therefore, we have very strong evidence that the backpack weight to body weight ratio is significantly different from 0.10

34 of 40

Standardized statistic

35 of 40

Theory-based analysis

36 of 40

Confidence Interval

37 of 40

Step 5: Formulate �Conclusions

Was the sample randomly selected from a larger population?

No, the sample was not randomly sampled from a larger population of university students. However, if you believe that university students in California are not different in their body composition or the amount of weight they carry in their backpacks compared to other university/college students across the US, then we can generalize to all college/university students in the US.

Were the observational units randomly assigned to treatments?

No, there was no random assignment in this study, so we cannot make any causal conclusions.

38 of 40

Conclusions

While we have strong evidence that the mean ratio is different from 0.10 (10%), is the difference impressive?
Recall that for children in elementary school and secondary school, pediatric surgeons recommended that the ratio of backpack weight to body weight not exceed 10% to 15%.
The average ratio of 0.08 (7.8%) is well below this recommendation.
It does seem that college/university students are carrying more reasonable weights in their backpacks.

39 of 40

Step 6: Look back and �ahead

Looking back: Did anything about the design and conclusions of this study concern you? In particular, are there things that could have been done to give a better chance finding strong evidence of a true difference between the two groups?

Always nice to have a larger sample size. A random sample. Researchers should try to replicate their results. These replicated studies could be at different colleges/universities across the U.S.

Looking ahead: What should the researchers’ next steps be to fix the limitations or build on this knowledge?

See if these results hold when other variables are looked at. For example do these ratios hold within subpopulations like males and females, or between different majors. Could backpack weights be decreasing due to e-books (all books on one laptop).

1 of 40

2 of 40

3 of 40

4 of 40

5 of 40

6 of 40

7 of 40

8 of 40

9 of 40

10 of 40

11 of 40

12 of 40

13 of 40

14 of 40

15 of 40

16 of 40

17 of 40

18 of 40

19 of 40

20 of 40

21 of 40

22 of 40

23 of 40

24 of 40

25 of 40

26 of 40

27 of 40

28 of 40

29 of 40

30 of 40

31 of 40

32 of 40

33 of 40

34 of 40

35 of 40

36 of 40

37 of 40

38 of 40

39 of 40

40 of 40