1 of 31

STAT-155 Quiz 1 Review

9/21/2025

2 of 31

Overview

Descriptive Statistics

Plots

Simple Linear Regression

Interpretations

Model Evaluation

Transformations (Optional)

1

2

3

4

5

6

3 of 31

National Health and Nutrition Examination Survey

Data Context

4 of 31

Descriptive Statistics

Standard Deviation

Mean

Median

The average. Sensitive to outliers

The middle. Resistant to outliers

Measure of variation from the mean

5 of 31

Descriptive Statistics

Range

Maximum

Minimum

The maximum number for a variable

The minimum number for a variable

Max - Min. Measure of spread in our data

6 of 31

7 of 31

8 of 31

9 of 31

10 of 31

11 of 31

Simple Linear Regression

E[DaysMentHlthBad | SleepHrsNight] = 10.99461 - 0.98457(SleepHrsNight)

12 of 31

Intercept Interpretation

On average, we expect Y to be 𝛽₀ y-units for groups with X = 0.

(Intercept) 10.99461

SleepHrsNight -0.98457

13 of 31

Intercept Interpretation

On average, we expect Y to be 𝛽₀ y-units for groups with X = 0.

On average, we expect an individual to

report 10.99461 bad mental health days within

the last 30 days for those that get 0 hours of

sleep a night.

14 of 31

Slope Interpretation

On average, we expect a 1 x-unit increase in X to be associated with a 𝛽₁ y-unit increase in Y.

(Intercept) 10.99461

SleepHrsNight -0.98457

15 of 31

Slope Interpretation

On average, we expect a 1 x-unit increase in X to be associated with a 𝛽₁ y-unit increase in Y.

On average, we expect a 1 hour increase in

hours of sleep a night to be associated with

a 0.98457 day decrease in the amount of

reported bad mental health days.

16 of 31

Intercept Interpretation - Categorical Predictor

On average, we expect Y to be 𝛽₀ y-units for groups that are the reference category.

(Intercept) 30445 (8th grade edu. reference)

17 of 31

Intercept Interpretation - Categorical Predictor

On average, we expect Y to be 𝛽₀ y-units for groups that are the reference category.

(Intercept) 30445 (8th grade edu. reference)

On average, we expect those that have an

8th grade education to have an average HH

Income of 30445.

18 of 31

Slope Interpretation - Categorical Predictor

On average, we expect a difference between the group we’re looking at and the reference category to be associated with a 𝛽₁ y-unit increase in Y.

Education High School 17403 (8th grade edu. reference)

19 of 31

Slope Interpretation - Categorical Predictor

On average, we expect a difference between the group we’re looking at and the reference category to be associated with a 𝛽₁ y-unit increase in Y.

Education High School 17403 (8th grade edu. reference)

On average, we expect those that have a high

school education to have a HH income that is

17403 dollars higher than those that have a 8th

grade education.

20 of 31

21 of 31

22 of 31

Model Evaluation - R²

The percentage of variation in Y that can be explained by the variation in X

Multiple R-Squared: 0.02823

23 of 31

Model Evaluation - R²

The percentage of variation in Y that can be explained by the variation in X

2.8% of the variation in reported days

of bad mental health within the last 30

days can be explained by the variation

in the reported hours of sleep a night.

24 of 31

Model Evaluation - Residual/Fitted Plots

Is this model wrong? strong? fair?

Is this model wrong? strong? fair? R^2 =0.5609

25 of 31

Model Evaluation - Residual/Fitted Plots

Is this model wrong? strong? fair? R^2 =0.5609

Wrong - Answers may vary. Line is mostly centered on .resid = 0, but the predictions get crazy at about .fitted = 60.

Strong: Depends on context. R^2 is 0.56, so the model does an okay job at the very least.

Fair: Depends on data. What’s going on in the green circle? Red circle?

26 of 31

Transformations - Location

The intercept is roughly -92. If the minimum height is about 100cm, what would a logical transformation be?

27 of 31

Transformations - Location

Intercept is now positive, which in this context makes it meaningful.

28 of 31

Transformations - Scale

Notice how the scale on the X axis is from 0-1. A “one unit increase” would only be relevant for schools with a 0% and a 100% admission rate. What would a logical transformation be?

29 of 31

Transformations - Scale

Multiplying both graduation and admissions rates by 100 would make our slope easier to interpret.

30 of 31

Transformations - Log

Data is not linear, what transformation would be logical?

31 of 31

Transformations - Log

It turns out that if we log the x-axis in this case, the data becomes much more linear.