1 of 38

5.2 – Linear regression

www.doteducation.org +971569242777/ +971568161777/ +971525465652

1

2 of 38

2

Definition: Simple linear regression is used to analyze relationships between two variables.

Two Main Objectives:

  1. Identify Relationships – Determine if a statistically significant relationship exists between two variables (positive or negative correlation).
  2. Make Predictions – Use the relationship to forecast new observations.

3 of 38

3

Establish if there is a relationship between two variables.

  • More specifically, establish if there is a statistically significant relationship between the two.

ANALYSE AND WRITE RELATIONSHIP BETWEEN THE FOLLOWING :

Examples:

Income and spending :

wage and gender:

student height and exam scores.

We can use regression models to test this

4 of 38

4

Forecast new observations.

  • Can we use what we know about the relationship to forecast unobserved values?

  • Examples:
  • What will sales be over the next quarter?

  • What will the ROI of a new store opening be contingent on store attributes

5 of 38

5

6 of 38

6

7 of 38

7

8 of 38

8

9 of 38

9

10 of 38

10

11 of 38

11

12 of 38

12

13 of 38

13

14 of 38

When is a Regression Line Suitable?

 

 

 

English mark

Maths mark

 

The smaller the total error, the better the straight line fits the data, and the more appropriate the linear model is to represent the data.

Model

15 of 38

When is a Regression Line unreliable?

 

 

Hours of revision

Test score

Model

“We should not use the model to make predictions about hours revised, because score depends on hours revised.”

Here the hours of revision is the explanatory/independent variable, and the test score the response/dependent variable. In general, use of the model should only be used to estimate a value of the response variable.

Comment on this statement.

I can use my regression line to predict to predict the number of hours someone has revised when they score 70% in the test.

16 of 38

When is a Regression Line unreliable?

 

 

Hours of revision

Test score

Model

“10 hours is outside the range of the data, so the estimate would be unreliable.”

 

Comment on this statement.

interpolation

extrapolation

extrapolation

I can use my regression line to predict to predict someone’s test score if they’ve done 10 hours of revision.

17 of 38

 

 

 

How do we interpret the gradient of 3?

 

 

 

18 of 38

 

 

 

How do we interpret the gradient of 3?

 

 

 

19 of 38

Regression line

Interpretation of gradient

Interpretation of y-intercept

Hours revised h and maths mark m

m=40+10h

For each extra hour revised, students achieve 10 marks better on average.

A student doing no revision would expect to score 40 marks.

Daily average temperature t (in °C) and daily sales of coffee s (in $)

s=5000-150t

Extra increase in temperature of 1°C results in an average decrease in sales of $150

Sales would be $5000 when the temperature is at freezing point (0°C)

a

b

Quickfire

20 of 38

Regression line

Interpretation of gradient

Interpretation of y-intercept

Hours revised h and maths mark m

m=40+10h

For each extra hour revised, students achieve 10 marks better on average.

A student doing no revision would expect to score 40 marks.

Daily average temperature t (in °C) and daily sales of coffee s (in $)

s=5000-150t

Extra increase in temperature of 1°C results in an average decrease in sales of $150

Sales would be $5000 when the temperature is at freezing point (0°C)

a

b

Quickfire

21 of 38

21

22 of 38

www.doteducation.org +971569242777/ +971568161777/ +971525465652

22

23 of 38

www.doteducation.org +971569242777/ +971568161777/ +971525465652

23

24 of 38

www.doteducation.org +971569242777/ +971568161777/ +971525465652

24

25 of 38

www.doteducation.org +971569242777/ +971568161777/ +971525465652

25

26 of 38

www.doteducation.org +971569242777/ +971568161777/ +971525465652

26

27 of 38

www.doteducation.org +971569242777/ +971568161777/ +971525465652

27

28 of 38

www.doteducation.org +971569242777/ +971568161777/ +971525465652

28

29 of 38

www.doteducation.org +971569242777/ +971568161777/ +971525465652

29

30 of 38

www.doteducation.org +971569242777/ +971568161777/ +971525465652

30

31 of 38

www.doteducation.org +971569242777/ +971568161777/ +971525465652

31

32 of 38

www.doteducation.org +971569242777/ +971568161777/ +971525465652

32

33 of 38

www.doteducation.org +971569242777/ +971568161777/ +971525465652

33

34 of 38

www.doteducation.org +971569242777/ +971568161777/ +971525465652

34

35 of 38

www.doteducation.org +971569242777/ +971568161777/ +971525465652

35

36 of 38

www.doteducation.org +971569242777/ +971568161777/ +971525465652

36

37 of 38

www.doteducation.org +971569242777/ +971568161777/ +971525465652

37

38 of 38

www.doteducation.org +971569242777/ +971568161777/ +971525465652

38