1 of 74

MANGALORE INSTITUTE OF TECHNOLOGY & ENGINEERING

(A Unit of Rajalaxmi Education Trust ®, Mangalore)

Autonomous Institute affiliated to VTU, Belgavi, Approved by AICTE, New Delhi

Accredited by NAAC with A+ Grade and ISO 9001:2015 Certified Institution

2 of 74

CONTENT

  • Correlation: Introduction
  • Types of Correlation
  • Correlation and Causation
  • Scatter Diagram Method
  • Karl Pearson’s Coefficient of Correlation
  • Regression: Introduction
  • Linear and Non-Linear Regression
  • Lines of Regression
  • Coefficient of Regression
  • Mean values from the two lines of Regression
  • Regression coefficients and the Correlation Coefficient from the lines of Regression
  • Correlation Analysis Vs. Regression Analysis.

2

MANGALORE INSTITUTE OF TECHNOLOGY & ENGINEERING

MODULE -1:

CORRELATION & REGRESSION

3 of 74

3

MANGALORE INSTITUTE OF TECHNOLOGY & ENGINEERING

LEARNING OBJECTIVES:

Provide strong foundation in Correlation and Regression techniques to solve engineering problems.

RBT LEVEL: L1, L2, L3

4 of 74

4

MANGALORE INSTITUTE OF TECHNOLOGY & ENGINEERING

Response and Explanatory Variables:

Response variable (Dependent Variable)

    • The outcome variable on which comparisons are made.

Example : College GPA

Explanatory variable (Independent variable)

    • When the explanatory variable is categorical, it defines the groups to be compared with respect to values on the response variable.
    • When the explanatory variable is quantitative, it defines the change in different numerical values to be compared with respect to the values for the response variable.

Example : Number of hours a week spent studying.

Correlation and Regression-scatter plot :

MODULE - 1:

Statistics

5 of 74

5

MANGALORE INSTITUTE OF TECHNOLOGY & ENGINEERING

  • The main purpose of data analysis with two variables is to investigate whether there is an association and to describe that association.

  • An association exists between two variables if a particular value for one variable is more likely to occur with certain values of the other variable.

6 of 74

6

MANGALORE INSTITUTE OF TECHNOLOGY & ENGINEERING

Correlation:

Correlation is a statistical technique to ascertain the association or relationship between two or more variables.

variables may be:

If two quantities vary in such a way that movements in one are accompanied by movements in the other, then these quantities are said to be correlated.

7 of 74

7

MANGALORE INSTITUTE OF TECHNOLOGY & ENGINEERING

A positive correlation indicates the extent to which those variables increase or decrease in parallel

A negative correlation indicates the extent to which one variable increases as the other decreases.

8 of 74

8

MANGALORE INSTITUTE OF TECHNOLOGY & ENGINEERING

Example:

AMOUNT OF TELEVISION CHILDREN WATCH

THE LIKELIHOOD THAT THEY WILL BECOME BULLIES

AND

The studies report a correlation, a lack of parental supervision – may be the influential factor (an unknown factor that influences both variables) .

9 of 74

9

MANGALORE INSTITUTE OF TECHNOLOGY & ENGINEERING

Statisticians have developed measures for describing the correlation between two variables

COEFFICIENT OF CORRELATION

The sample correlation coefficient (r) measures the degree of linearity in the relationship between “X” and “Y”

-1 ≤ r ≤ + 1

-1 is a strong negative relationship and +1 has a strong positive relationship

r = 0 indicates no linear relationship

10 of 74

10

MANGALORE INSTITUTE OF TECHNOLOGY & ENGINEERING

Types of Correlation

Types of Correlation

Positive and Negative Correlation

Simple, multiple and Partial Correlation

Linear and Non-Linear Correlation

11 of 74

11

MANGALORE INSTITUTE OF TECHNOLOGY & ENGINEERING

Positive, Negative and Zero Correlation:

Positive Correlation: If both the variables vary in the same direction, correlation is said to be positive. It means if one variable is increasing, the other on an average is also increasing or if one variable is decreasing, the other on an average is also deceasing, then the correlation is said to be positive correlation.

Example: The correlation between heights and weights of a group of persons is a positive correlation.

Height (cm) : X

158

160

163

166

168

171

174

176

Weight (kg) : Y

60

62

64

65

67

69

71

72

12 of 74

12

MANGALORE INSTITUTE OF TECHNOLOGY & ENGINEERING

Negative Correlation: If both the variables vary in opposite direction, the correlation is said to be negative. If means if one variable increases, but the other variable decreases or if one variable decreases, but the other variable increases, then the correlation is said to be negative correlation.

Example: the correlation between the price of a product and its demand is a negative correlation.

Price of Product (Rs. Per Unit) : X

6

5

4

3

2

1

Demand (In Units) : Y

75

120

175

250

215

400

13 of 74

13

MANGALORE INSTITUTE OF TECHNOLOGY & ENGINEERING

Zero Correlation: Actually, it is not a type of correlation but still it is called as zero or no correlation. When we don’t find any relationship between the variables then, it is said to be zero correlation. It means a change in value of one variable doesn’t influence or change the value of another variable.

Example: the correlation between weight of person and intelligence is a zero or no correlation.

Multiple Correlation: When three or more variables are studied, it is a case of multiple correlation. For example, in above example if study covers the relationship between student marks, attendance of students, effectiveness of teacher, use of teaching aids etc, it is a case of multiple correlation.

Partial Correlation: In case of partial correlation, one studies three or more variables but considers only two variables to be influencing each other and the effect of other influencing variables being held constant.

Example : In above example of relationship between student marks and attendance, the other variable influencing such as effective teaching of teacher, use of teaching aid like computer, smart board etc are assumed to be constant.

14 of 74

14

MANGALORE INSTITUTE OF TECHNOLOGY & ENGINEERING

Methods of measurement of correlation:

Quantification of the relationship between variables is very essential to take the benefit of study of correlation. For this, we find there are various methods of measurement of correlation, which can be represented as given below:

Methods of Measurement of Correlation

Graphic Method Algebraic Method

Scatter Diagram Karl Pearson’s Coefficient of Correlation

15 of 74

15

MANGALORE INSTITUTE OF TECHNOLOGY & ENGINEERING

Scatter Diagram: Graphical display of relationship between two quantitative variables:

Horizontal Axis: Explanatory variable, x

Vertical Axis: Response variable, y

MINITAB Scatterplot for Internet use and Facebook use for 33 Countries.The point for Japan is labeled and has coordinates x = 74 and y = 2 .

Question: Is there any point that you would identify as standing out in some way? Which country does it represent, and how is it unusual in the context of these variables?

16 of 74

16

MANGALORE INSTITUTE OF TECHNOLOGY & ENGINEERING

Solution:

Yes, the point representing Japan (74, 2) stands out significantly in the scatterplot. Here's why:

  • The x-coordinate (74) represents the percentage of internet users in Japan, which is relatively high.

  • The y-coordinate (2) represents Facebook use, which is extremely low compared to other countries with similar internet penetration rates.

By inspecting the scatterplot, we observe - Nations with high internet usage tend to have higher Facebook usage. For the countries with relatively low usage of Internet (below 20%) there is little variability in FB use.

Why is Japan Unusual?

However, Japan deviates from this trend by having very low Facebook engagement despite widespread internet access. This suggests that other social media platforms (such as LINE or Twitter) are more popular in Japan, making it an outlier in the dataset.

17 of 74

17

MANGALORE INSTITUTE OF TECHNOLOGY & ENGINEERING

How to Examine a Scatterplot :

We examine a scatterplot to study association. How do values on the response variable change as values of the explanatory variable change?

You can describe the overall pattern of a scatterplot by the trend, direction, and strength of the relationship between the two variables.

    • Trend: Linear, Curved, Clusters, No pattern.

    • Direction: Positive, Negative, No direction.

    • Strength: How closely the points fit the trend.

Also look for outliers from the overall trend.

18 of 74

18

MANGALORE INSTITUTE OF TECHNOLOGY & ENGINEERING

Interpreting Scatterplots: Two quantitative variables x and y are

Positively associated : when high values of x tend to occur with high values of y.

low values of x tend to occur with low values of y.

Negatively associated : when high values of one variable tend to pair with low values of the other variable.

19 of 74

19

MANGALORE INSTITUTE OF TECHNOLOGY & ENGINEERING

A scatter diagram reveals whether the movements in one series are associated with those in the other series.

Positive Correlation: In this case, the points will form on a straight line falling from the lower left hand corner to the upper right hand corner.

Negative Correlation: In this case, the points will form on a straight line rising from the upper lright-handcorner to the lower right hand corner.

Zero (No) Correlation: When plotted points are scattered over the graph haphazardly, then it indicate that there is no correlation or zero correlation between two variables.

20 of 74

20

MANGALORE INSTITUTE OF TECHNOLOGY & ENGINEERING

Example: Given the following pairs of values

(a)Draw a scatter diagram

(b) Do you think that there is any correlation between profits and capital employed?

Is it positive or negative? Is it high or low?

Capital Employed (Rs. In Crore)

1

2

3

4

5

7

8

9

11

12

Profit (Rs. In Lakhs)

3

5

4

7

9

8

10

11

12

14

21 of 74

21

MANGALORE INSTITUTE OF TECHNOLOGY & ENGINEERING

Capital Employed (Rs. in Crore)

14

12

10

8

6

4

2

0

16

14

12

10

8

6

4

2

0

Solution:

From the observation of scatter diagram we can say that the variables are positively correlated. In the diagram the points trend toward upward rising from the lower right-hand corner to the upper right-hand corner, hence it is positive correlation.

Plotted points are in narrow band which indicates that it is a case of high degree of positive correlation.

22 of 74

22

MANGALORE INSTITUTE OF TECHNOLOGY & ENGINEERING

Example: 100 cars on the lot of a used-car dealership. Would you expect a positive association, a negative association or no association between the age of the car and the reading on the odometer?

Solution :

A positive association between the age of the car and the reading on the odometer.

This means that as the age of a car increases, the odometer reading (mileage) generally increases as well. Older cars have typically been driven for more years, accumulating more miles.

While there are exceptions (such as classic cars with low mileage),

the overall trend in a used car lot would likely show that older cars tend to have

higher mileage.

23 of 74

23

MANGALORE INSTITUTE OF TECHNOLOGY & ENGINEERING

Sl No

Particulars

Solution

1

Price of commodity and its demand

2

Yield of crop and amount of rainfall

3

No of fruits eaten and hungry of a person

4

No of units produced and fixed cost per unit

5

No of girls in the class and marks of boys

6

Ages of Husbands and wife

7

Temperature and sale of woollen garments

8

Number of cows and milk produced

9

Weight of person and intelligence

10

Advertisement expenditure and sales volume

Practice problem

State in each case whether there is

(i)Positive Correlation

(ii)Negative Correlation

(iii)No Correlation

24 of 74

24

MANGALORE INSTITUTE OF TECHNOLOGY & ENGINEERING

Karl Pearson’s Coefficient of Correlation

Karl Pearson’s method of calculating coefficient of correlation is based on the covariance of the two variables in a series. This method is widely used in practice and the coefficient of correlation is denoted by the symbol “r”. Pearson's correlation coefficient between two variables is defined as the ”covariance of the two variables divided by the product of their standard deviations”

Direct Method:This method used when given variables are small in magnitude

25 of 74

25

MANGALORE INSTITUTE OF TECHNOLOGY & ENGINEERING

Interpretation of Correlation Coefficient (r) : 

  • The Correlation measures the strength and direction of the linear association between x and y.
  • The Sign of r denotes the nature of the association while the value of r denotes the strength of association

26 of 74

26

MANGALORE INSTITUTE OF TECHNOLOGY & ENGINEERING

 

Firm

1

2

3

4

5

6

7

8

9

10

Advertisement Exp. (Rs. In Lakhs)

11

13

14

16

16

15

15

14

13

13

Sales Volume (Rs. In Lakhs)

50

50

55

60

65

65

65

60

60

50

27 of 74

27

MANGALORE INSTITUTE OF TECHNOLOGY & ENGINEERING

Firm

x

y

XY

1

11

50

-3

9

-8

64

24

2

13

50

-1

1

-8

64

8

3

14

55

0

0

-3

9

0

4

16

60

2

4

2

4

4

5

16

65

2

4

7

49

14

6

15

65

1

1

7

49

7

7

15

65

1

1

7

49

7

8

14

60

0

0

2

4

0

9

13

60

-1

1

2

4

-2

10

13

50

-1

1

-8

64

8

 

140

580

 

22

 

360

70

 

∑x

∑y

 

∑X2

 

∑y2

∑XY

Calculation of Karl Pearson’s coefficient of correlation

28 of 74

28

MANGALORE INSTITUTE OF TECHNOLOGY & ENGINEERING

 

Interpretation: From the above calculation it is very clear that there is high degree of positive correlation i.e. r = 0.7866, between the two variables. i.e. Increase in advertisement expenses leads to increased sales volume.

29 of 74

29

MANGALORE INSTITUTE OF TECHNOLOGY & ENGINEERING

Example:

30 of 74

30

MANGALORE INSTITUTE OF TECHNOLOGY & ENGINEERING

31 of 74

31

MANGALORE INSTITUTE OF TECHNOLOGY & ENGINEERING

Example: Find the correlation coefficient between age and playing habits of the following students using Karl Pearson’s coefficient of correlation method.

Solution:

To find the correlation between age and playing habits of the students, we need to compute the percentages of students who are having the playing habit.

 

Percentage of playing habits = (No. of Regular Players / Total No. of Students) * 100

Now, let us assume that ages of the students are variable X and percentages of playing habits are variable Y.

Age

15

16

17

18

19

20

Number of students

250

200

150

120

100

80

Regular Players

200

150

90

48

30

12

32 of 74

32

MANGALORE INSTITUTE OF TECHNOLOGY & ENGINEERING

 

Age (X)

No of Students

Regular Players

Percentage of Playing

Habits (y)

 

XY

15

250

200

80

-2.5

6.25

30

900

-75

16

200

150

75

-1.5

2.25

25

625

- 37.5

17

150

90

60

-0.5

0.25

10

100

-5

18

120

48

40

0.5

0.25

-10

100

-5

19

100

30

30

1.5

2.25

-20

400

-30

20

80

12

15

2.5

6.25

-35

1225

-87.5

 

 

-240

∑x=105

 

 

∑y=300

 

 

∑XY = -240

 

33 of 74

33

MANGALORE INSTITUTE OF TECHNOLOGY & ENGINEERING

 

Interpretation: From the above calculation it is very clear that there is

high degree of negative correlation

i.e. r = -0.9912, between the two variables of age and playing habits.

Playing habits among students decreases when their age increases.

34 of 74

34

MANGALORE INSTITUTE OF TECHNOLOGY & ENGINEERING

Practice Problems

  1. Calculate Karl Pearson’s coefficient of correlation between the age and weight of the children:

Age (in yrs

1

2

3

4

5

Weight in kg

3

4

6

7

12

2. Coefficient of correlation between X and Y is 0.3. Their covariance is 9. The variance of X is 16. Find the standard devotion of Y .

35 of 74

35

MANGALORE INSTITUTE OF TECHNOLOGY & ENGINEERING

36 of 74

36

MANGALORE INSTITUTE OF TECHNOLOGY & ENGINEERING

Regression:

A study of measuring the relationship between associated variables, wherein one variable is dependent on another independent variable, called Regression.

Regression analysis is a statistical tool to study the nature and extent of functional relationship between two or more variables and to estimate (or predict) the unknown values of dependent variable from the known values of independent variable.

The variable that forms the basis for predicting another variable is known as the Independent Variable and the variable that is predicted is known as dependent variable.

Example: if we know that the two variables price (X) and demand (Y) are closely related we can find out the most probable value of X for a given value of Y or the most probable value of Y for a given value of X.

37 of 74

37

MANGALORE INSTITUTE OF TECHNOLOGY & ENGINEERING

It was Sir Francis Galton who first used the term regression as a statistical concept in 1877 to measure the relationship of height between parents and their children.

  • He made a statistical study that showed that the height of children born to tall parents tends to ‘regress’ (go back) towards the mean height of population

38 of 74

� �� ��

38

MANGALORE INSTITUTE OF TECHNOLOGY & ENGINEERING

  • He used the term regression as a statistical technique to predict one variable (the height of children) from another variable (the height of parents).

  • This is called ‘regression’ or ‘simple regression’ confined to bivariate data.

39 of 74

� �� � The Regression Technique is Primarily Used to��

39

MANGALORE INSTITUTE OF TECHNOLOGY & ENGINEERING

Estimate the relationship that exists, on the average, between the dependent variable and the explanatory variable

Determine the effect of each of the explanatory variables on the dependent variable

Controlling the effects of all other explanatory variables

Predict the value of dependent variable for a given value of the explanatory variable

40 of 74

� ��

40

MANGALORE INSTITUTE OF TECHNOLOGY & ENGINEERING

Difference between Regression and Correlation

  • Regression
  • Correlation
  • Regression is concerned with bringing out the nature of relationship and using it to know the best approximate value of one variable corresponding to a known value of another variable

  • Correlation on the other hand is concerned with quantifying the closeness of such relationship

41 of 74

41

MANGALORE INSTITUTE OF TECHNOLOGY & ENGINEERING

If the relationship is best explained by a straight line, it is said to be Linear

On the contrary, if it is described more appropriately by a curve, the relationship is said to be Non-Linear

42 of 74

42

MANGALORE INSTITUTE OF TECHNOLOGY & ENGINEERING

What is linear regression?

Suppose you are thinking of selling your home. Different sized homes around you have sold for different amounts:

Your home is 3000 square feet. How much should you sell it for? You have to look at the existing data and predict a price for your home. This is called linear regression.

Here's an easy way to do it. Look at the data you have so far:

43 of 74

43

MANGALORE INSTITUTE OF TECHNOLOGY & ENGINEERING

Each point represents one home. Now you can eyeball it and roughly draw a line that gets pretty close to all of these points:

Then look at the price shown by the line, where the square footage is 3000:

44 of 74

44

MANGALORE INSTITUTE OF TECHNOLOGY & ENGINEERING

Your home should sell for $260,000.

Plot your data, eyeball a line, and use the line to make predictions

45 of 74

45

MANGALORE INSTITUTE OF TECHNOLOGY & ENGINEERING

46 of 74

46

MANGALORE INSTITUTE OF TECHNOLOGY & ENGINEERING

Nonlinear Regression

A visual representation of a non-linear regression model. The blue points represent the data, while the red curve is the fitted regression model, capturing the non-linear relationship between the independent and dependent variables.

47 of 74

47

MANGALORE INSTITUTE OF TECHNOLOGY & ENGINEERING

 

48 of 74

48

MANGALORE INSTITUTE OF TECHNOLOGY & ENGINEERING

 

49 of 74

49

MANGALORE INSTITUTE OF TECHNOLOGY & ENGINEERING

 

Method

Regression Coefficient X on Y

Regression Coefficient Y on X

Using the correlation

coefficient (r) and

standard deviation (σ)

When deviations are taken from arithmetic mean

50 of 74

50

MANGALORE INSTITUTE OF TECHNOLOGY & ENGINEERING

Properties of Regression Coefficients

:

 

 

 

51 of 74

51

MANGALORE INSTITUTE OF TECHNOLOGY & ENGINEERING

Example: The following data gives the age and blood pressure (BP) of 10 sports persons.

(a) Find regression equation of Y on X and X on Y (Use the method of deviation from arithmetic mean)

(b)Find the correlation coefficient (r) using the regression coefficients.

(c) Estimate the blood pressure of a sports person whose age is 45.

Name :

A

B

C

D

E

F

G

H

I

J

42

36

55

58

35

65

60

50

48

51

98

93

110

85

105

108

82

102

118

99

52 of 74

52

MANGALORE INSTITUTE OF TECHNOLOGY & ENGINEERING

Name

XY

A

42

98

-8

-2

64

4

16

B

36

93

-14

-7

196

49

98

C

55

110

5

10

25

100

50

D

58

85

8

-15

64

225

-120

E

35

105

-15

5

225

25

-75

F

65

108

15

8

225

64

120

G

60

82

10

-18

100

324

-180

H

50

102

0

2

0

4

0

I

48

118

-2

18

4

324

-36

J

51

99

1

-1

1

1

-1

 

∑X=0

∑Y=0

∑Y2 = 1,120

∑XY = -128

53 of 74

53

MANGALORE INSTITUTE OF TECHNOLOGY & ENGINEERING

 

54 of 74

54

MANGALORE INSTITUTE OF TECHNOLOGY & ENGINEERING

 

 

 

55 of 74

55

MANGALORE INSTITUTE OF TECHNOLOGY & ENGINEERING

 

Computation of coefficient of correlation using regression coefficient:

 

 

56 of 74

56

MANGALORE INSTITUTE OF TECHNOLOGY & ENGINEERING

 

57 of 74

57

MANGALORE INSTITUTE OF TECHNOLOGY & ENGINEERING

Example :

Following data about the sales and advertisement expenditure of a firm is given below

Correlation coefficient is 0.9.

(i) Estimate the likely sale for a proposed advertisement expenditure of Rs.10.crores .

(ii) What should be the advertisement expenditure if the firm proposes a sales target of Rs.60 crores.

Sales in crores

Ad expenditure

Mean

40

6

Standard deviation

10

1.5

58 of 74

58

MANGALORE INSTITUTE OF TECHNOLOGY & ENGINEERING

 

 

59 of 74

59

MANGALORE INSTITUTE OF TECHNOLOGY & ENGINEERING

 

Mean

Standard deviation

Production of Wheat (kg. per unit area)

10

8

Rainfall (cm)

8

2

60 of 74

60

MANGALORE INSTITUTE OF TECHNOLOGY & ENGINEERING

Curve Fitting using method of least squares:

Curve fitting is a statistical technique used to create a curve that best represents a set of data points.

  • It involves adjusting a mathematical function to approximate the relationship between variables in a dataset. The goal is to find a function that minimizes the difference between the observed data points and the values predicted by the function.

  • Curve fitting is a powerful tool that enables the analysis and interpretation of data across various disciplines . This process is essential in various fields, including engineering, economics, biology, and machine learning, where understanding trends and making predictions are crucial.

Least Squares Method: This is the most common approach for curve fitting. It minimizes the sum of the squares of the residuals (the differences between observed and predicted values).

61 of 74

61

MANGALORE INSTITUTE OF TECHNOLOGY & ENGINEERING

Methods of Fitting a Straight Line:

Least Square Method:

      • The least square method of fitting a straight line is the one that statistically minimizes the sum of squares of the deviations
      • It is by virtue of this property that the resultant straight line is indeed the line of best fit

62 of 74

62

MANGALORE INSTITUTE OF TECHNOLOGY & ENGINEERING

Following are the two methods to form the two regression equations that is, equation for:

    • Y on X
    • X on Y

The above equations can be done by 2 methods:

Regression equations through normal equations

63 of 74

63

MANGALORE INSTITUTE OF TECHNOLOGY & ENGINEERING

For Y on X, the equation is, For X on Y, the equation is

Y= a + b X X= a + b Y

For determining the values of ‘ a’ and ‘b’ we determine two normal equations which can be solved simultaneously

Two normal equations:

X on Y

Y on X

ƸX = Na + b (ƸY)

ƸXY= a (ƸY) + b (ƸY2)

ƸY= na + b (ƸX)

ƸXY= a (ƸX) + b (ƸX2)

Regression Equations Through Normal Equations

64 of 74

64

MANGALORE INSTITUTE OF TECHNOLOGY & ENGINEERING

The equation for a straight line is

Where

    • Y is the dependent variable
    • X is the independent variable
    • a is the Y-intercept, which is the point at which the regression line crosses the Y-axis (the vertical axis) and
    • b is the slope of the regression line.

It should be noted that the values of both a and b will remain constant for any given straight line.

Y= a + bX

65 of 74

65

MANGALORE INSTITUTE OF TECHNOLOGY & ENGINEERING

The two normal equations are:

ƸY = na + b ƸX

ƸXY = a ƸX + b ƸX2

  where:

ƸY = the total of Y series

n = number of observations

ƸX = the total of X series

ƸX2 = the total of squares of individual items in X series

ƸXY = the sum of XY column

a and b are the Y-intercept and the slope of the regression line, respectively.

66 of 74

66

MANGALORE INSTITUTE OF TECHNOLOGY & ENGINEERING

Example :

Given the bivariate data:

    • Fit a straight line on Y on X and hence predict Y , if X=20
    • Fit the regression line on X on Y and hence predict X, if Y= 5

Solution:

In this case both the regression lines are needed. It is necessary to find ‘a’ and ‘b’. Given X, Y can be estimated and vice versa

X:

2

6

4

3

2

2

8

4

Y:

7

2

1

1

2

3

2

6

67 of 74

67

MANGALORE INSTITUTE OF TECHNOLOGY & ENGINEERING

X

Y

X2

Y2

XY

2

7

4

49

14

6

2

36

4

12

4

1

16

1

4

3

1

9

1

3

2

2

4

4

4

2

3

4

9

6

8

2

64

4

16

4

6

16

36

24

ƸX = 31

ƸY = 24

ƸX2 = 153

ƸY2 = 108

ƸXY= 83

68 of 74

68

MANGALORE INSTITUTE OF TECHNOLOGY & ENGINEERING

 

 

 

Y= -1.8375

69 of 74

69

MANGALORE INSTITUTE OF TECHNOLOGY & ENGINEERING

FITTING OF PARABOLA

 

70 of 74

70

MANGALORE INSTITUTE OF TECHNOLOGY & ENGINEERING

T

Overtime hours(x)

1

1

2

2

3

3

4

5

6

7

Additional units

2

7

7

10

8

12

10

14

11

14

 

71 of 74

71

MANGALORE INSTITUTE OF TECHNOLOGY & ENGINEERING

x

y

xy

1

2

1

1

1

2

2

1

7

1

1

1

7

7

2

7

4

8

16

14

28

2

10

4

8

16

20

40

3

8

9

27

81

24

72

3

12

9

27

81

36

108

4

10

16

64

256

40

160

5

14

25

125

625

70

350

6

11

36

216

1296

66

396

7

14

49

343

2401

98

686

34

95

154

820

4774

377

1849

72 of 74

72

MANGALORE INSTITUTE OF TECHNOLOGY & ENGINEERING

 

 

73 of 74

73

MANGALORE INSTITUTE OF TECHNOLOGY & ENGINEERING

 

1.0

1.5

2.0

2.5

3.0

3.5

4.0

y

1.1

1.3

1.6

2.0

2.7

3.4

4.1

 

x

0

1

2

3

4

y

1

1.8

3.3

4.5

6.3

Ans: [ a= 1.04; b= -0.198 ; c =0.244 ]

Ans: [ a= 1.33 ; b= 0.72

74 of 74

74

MANGALORE INSTITUTE OF TECHNOLOGY & ENGINEERING

 

-3

-2

-1

0

1

2

3

y

4.63

2.11

0.67

0-09

0.63

2.15

4.58

2. Using the method of least squares, fit a linear relation of the form P = a+ bW to the given data and estimate P when W is150

W:

50

70

100

120

P:

12

15

21

25

Ans: [ a= 0.1329 ; b= -0.00393 ; c 0.4975]

Ans: [ a= 2.2785 ; b= 0.1879 and P= 30.4635 when w=150]