5.2 – Linear regression
www.doteducation.org +971569242777/ +971568161777/ +971525465652
1
2
Definition: Simple linear regression is used to analyze relationships between two variables.
Two Main Objectives:
3
Establish if there is a relationship between two variables.
ANALYSE AND WRITE RELATIONSHIP BETWEEN THE FOLLOWING :
Examples:
Income and spending :
wage and gender:
student height and exam scores.
We can use regression models to test this
4
Forecast new observations.
5
6
7
8
9
10
11
12
13
When is a Regression Line Suitable?
| | | | |
|
|
|
|
|
English mark
Maths mark
The smaller the total error, the better the straight line fits the data, and the more appropriate the linear model is to represent the data.
Model
When is a Regression Line unreliable?
| | | | |
|
|
|
|
|
Hours of revision
Test score
Model
“We should not use the model to make predictions about hours revised, because score depends on hours revised.”
Here the hours of revision is the explanatory/independent variable, and the test score the response/dependent variable. In general, use of the model should only be used to estimate a value of the response variable.
Comment on this statement.
I can use my regression line to predict to predict the number of hours someone has revised when they score 70% in the test.
When is a Regression Line unreliable?
|
|
|
|
|
Hours of revision
Test score
Model
“10 hours is outside the range of the data, so the estimate would be unreliable.”
Comment on this statement.
interpolation
extrapolation
extrapolation
I can use my regression line to predict to predict someone’s test score if they’ve done 10 hours of revision.
How do we interpret the gradient of 3?
How do we interpret the gradient of 3?
Regression line | Interpretation of gradient | Interpretation of y-intercept |
Hours revised h and maths mark m m=40+10h | For each extra hour revised, students achieve 10 marks better on average. | A student doing no revision would expect to score 40 marks. |
Daily average temperature t (in °C) and daily sales of coffee s (in $) s=5000-150t | Extra increase in temperature of 1°C results in an average decrease in sales of $150 | Sales would be $5000 when the temperature is at freezing point (0°C) |
a
b
Quickfire
Regression line | Interpretation of gradient | Interpretation of y-intercept |
Hours revised h and maths mark m m=40+10h | For each extra hour revised, students achieve 10 marks better on average. | A student doing no revision would expect to score 40 marks. |
Daily average temperature t (in °C) and daily sales of coffee s (in $) s=5000-150t | Extra increase in temperature of 1°C results in an average decrease in sales of $150 | Sales would be $5000 when the temperature is at freezing point (0°C) |
a
b
Quickfire
21
www.doteducation.org +971569242777/ +971568161777/ +971525465652
22
www.doteducation.org +971569242777/ +971568161777/ +971525465652
23
www.doteducation.org +971569242777/ +971568161777/ +971525465652
24
www.doteducation.org +971569242777/ +971568161777/ +971525465652
25
www.doteducation.org +971569242777/ +971568161777/ +971525465652
26
www.doteducation.org +971569242777/ +971568161777/ +971525465652
27
www.doteducation.org +971569242777/ +971568161777/ +971525465652
28
www.doteducation.org +971569242777/ +971568161777/ +971525465652
29
www.doteducation.org +971569242777/ +971568161777/ +971525465652
30
www.doteducation.org +971569242777/ +971568161777/ +971525465652
31
www.doteducation.org +971569242777/ +971568161777/ +971525465652
32
www.doteducation.org +971569242777/ +971568161777/ +971525465652
33
www.doteducation.org +971569242777/ +971568161777/ +971525465652
34
www.doteducation.org +971569242777/ +971568161777/ +971525465652
35
www.doteducation.org +971569242777/ +971568161777/ +971525465652
36
www.doteducation.org +971569242777/ +971568161777/ +971525465652
37
www.doteducation.org +971569242777/ +971568161777/ +971525465652
38