1 of 66

UNIT-III

CLASSIFICATION

2 of 66

CLASSIFICATION: οΏ½

Meaning and objectives of classification – types of classification – formation of a Discrete and continuous Frequency Distribution – Histogram – Frequency Polygon – Limitations of Diagrams and Graphs.

3 of 66

Meaning of Classification: οΏ½

  • It is the process of arranging data into homogeneous (similar) groups or classes according to resemblances and similarities.
  • Raw data cannot be easily understood and it is not fit for further analysis and interpretation.
  • This arrangement of data helps users in comparison and analysis. For example, the Population of town can be grouped according to Gender, age, marital status etc.

4 of 66

Definition: οΏ½

  • The process of grouping a large number of individual facts or observation on the basis of similarity among the items is called Classification.

-Stockton and Clark

5 of 66

  • Classification is the process of arranging data into sequences and groups according to their common characteristics or separating them into different but related parts.

-Secrist

6 of 66

Chief characteristics of classification

  • All the facts are classified into homogenous groups by the process of classification
  • The basis of classification is unity in diversity.
  • Classification may be according to either similarities or dissimilarities.
  • Classification may be either real or imaginary.
  • It should be flexible to accommodate adjustments.

7 of 66

Objectives of Classification: οΏ½

  • To condense the mass of data
  • To present the facts in a simple form
  • To bring out clearly the points of similarity and dissimilarity
  • To facilitate comparison
  • To bring out the relation among the data
  • To prepare data for tabulation
  • To facilitate statistical treatment of the data.
  • To facilitate easy interpretation
  • To eliminate unnecessary details.

8 of 66

οΏ½Requirements of a good classification / Rules of classification: οΏ½

The classification of data is a pre-requisite for the statistical analysis. Thus, a good classification should fulfil the following requirements:

  • Exactness
  • Mutually exclusive
  • Stability
  • Flexibility
  • Suitability
  • Homogeneity
  • Mathematical Accuracy

9 of 66

Frequency Distribution

  • A frequency distribution or frequency table is a table which the data are grouped into classes and number of cases in each class are recorded.
  • The number in each class is referred to as β€œfrequency”.
  • There are individual observations, discrete frequency distribution and continuous frequency distribution.

10 of 66

Individual Observation

  • Individual series refers to that series in which items are listed in single, i.e. each item is given a separate value of the measurement such as height, weight etc., are also called simple frequency distribution.
  • Example:

The following are the marks obtained in Statistics:

40, 50, 55, 78, 58, 60, 73, 35, 43, 48, 31, 77, 55, 45, 37.

11 of 66

Discrete frequency distribution

  • In this distribution, we have to count the number of times a particular value is repeated. This is called the frequency of that class.
  • For example: the following are the marks in statistics secured by 15 students in a class.

MARKS

12

25

35

45

49

NO.OF STUDENTS

3

5

2

2

1

12 of 66

Steps in the construction of discrete frequency distribution

  • Prepare three columns – one for the variable, another for bars and third for the frequency corresponding to the variables.
  • Arrange the given values from the lowest to the highest in the first column.
  • In the second column, put a bar (vertical line) opposite to the particular value that it is repeated.
  • Count the bars and place it opposite to the value of the variable in the third column.

13 of 66

For Example

  • Following are the marks obtained by 25 students (out of 10). Make a discrete frequency distribution.

1

2

4

3

4

2

5

3

2

2

4

1

2

3

5

1

3

5

1

3

3

1

3

1

1

14 of 66

Solution

MARKS

TALLY BARS

NO OF STUDENTS (FREQUENCY)

1

1111 I I

7

2

I I I I

5

3

I I I I I I

7

4

I I I

3

5

I I I

3

TOTAL

25

15 of 66

The following are the discrete frequency distribution of 25 students :

MARKS

1

2

3

4

5

NO OF STUDENTS

7

5

7

3

3

16 of 66

Class work Problem:

  • Following are the marks obtained by 30 students. Make a discrete frequency distribution:

9

7

5

3

4

8

6

0

6

5

9

1

7

2

3

8

6

8

7

4

9

4

5

10

6

5

9

6

9

5

17 of 66

18 of 66

19 of 66

The following are the discrete frequency distribution of 30 students:

MARKS

0

1

2

3

4

5

6

7

8

9

10

NO OF STUDENTS

1

1

1

2

3

5

5

3

3

5

1

20 of 66

Home Work

  • the following are the marks obtained by 40 students. Make a discrete frequency distribution with tally bars.

19

12

11

14

15

17

16

20

13

18

18

15

18

20

13

12

18

15

14

11

20

15

18

12

11

16

13

19

14

17

17

19

15

17

19

11

12

13

16

14

21 of 66

Continuous frequency distribution

  • When items are arranged in groups or classes, because they are not exactly measurable, they form continuous series.
  • A collection of items, which cannot be exactly measured, but placed within certain limits, is called continuous series.
  • Class – Limits: are the lowest and the highest values that can be included in the class.
  • For example: take the class 30-40. the lowest limit is 30 and the highest limit is 40.

22 of 66

  • Β 

23 of 66

Methods of Class-Interval

  • Exclusive method
  • Inclusive method

24 of 66

Exclusive method

  • Under this method, the upper limit of one class-interval is the lower limit of the next class.
  • For Example :

MARKS

NO OF STUDENTS

10-20

15

20-30

20

30-40

10

Total

45

25 of 66

Inclusive Method (Non-overlapping)

  • In this method the upper limit of one class is included in that class itself, for example:

MARKS

NO OF STUDENTS

10-19

17

20-29

15

30-39

12

40-49

10

TOTAL

54

26 of 66

  • Marks scored by 30 students are given below:

  • Arrange in Ascending order
  • Arrange in descending order
  • Convert the marks into a continuous series of a class- interval of 10.

41

55

48

47

53

48

33

32

42

55

44

38

60

65

71

80

41

53

47

48

55

20

31

34

42

51

35

30

26

25

PROBLEM:

27 of 66

Solution

  • (A) Marks arranged in Ascending order:

  • (B) Marks arranged in Descending order:

20

25

26

30

31

32

33

34

35

38

41

41

42

42

44

47

47

48

48

48

51

53

53

55

55

55

60

65

71

80

80

71

65

60

55

55

55

53

53

51

48

48

48

47

47

44

42

42

41

41

38

35

34

33

32

31

30

26

25

20

28 of 66

  • (C) Formation of continuous series:

Marks

Tally Marks

Frequency

20-30

III

3

30-40

I I I I I I

7

40-50

I I I I I I I I

10

50-60

IIII I

6

60-70

II

2

70-80

II

2

TOTAL

30

29 of 66

  • The following are the Continuous class frequency of 30 students:

MARKS

20-30

30-40

40-50

50-60

60-70

70-80

NO.OF STUDENTS

3

7

10

6

2

2

30 of 66

Home work

  • In a survey of 30 workers in a factory, the amount of wage earned per worker was recorded and the following data were obtained:

80

100

100

87

137

80

94

150

75

146

87

125

106

131

112

112

119

125

106

100

94

69

75

131

125

106

112

119

137

144

31 of 66

Cumulative Frequency Distribution

  • The cumulative frequencies are derived by the cumulation of the frequencies of successive values. Cumulative frequency of a given variable or class represents the total frequency of all previous variables including the variable or the class.

32 of 66

There are two methods of expressing the cumulative frequencies. They are:

    • More than cumulative frequency distribution
    • Less than cumulative frequency distribution

33 of 66

Less than Cumulative Frequency Distribution

  • For any value of the variable or class is obtained by adding successively the frequencies of all the previous variables including the variable or lass against which it is written.
  • The cumulation is started from the lowest size to the highest size.

34 of 66

1. Calculate less than cf for the following data:

MARKS

FREQUENCY

20-30

5

30-40

18

40-50

20

50-60

15

60-70

18

70-80

40

80-90

68

90-100

16

TOTAL

200

35 of 66

Solution:

MARKS

FREQUENCY

CUMULATIVE FRQUENCY

β€˜LESS THAN’

20-30

5

-

5

30-40

18

(5+18)

23

40-50

20

(23+20)

43

50-60

15

(43+15)

58

60-70

18

(58+18)

76

70-80

40

(76+40)

116

80-90

68

(116+68)

184

90-100

16

(184+16)

200

TOTAL

200

36 of 66

Classwork problem

  • Find less than cumulative frequency distribution:

CLASS-INTERVAL

FREQUENCY

Less than 20

7

Less than 30

12

Less than40

15

Less than 50

20

Less than60

30

TOTAL

84

37 of 66

Solution:

  • Less than C.F.D

CLASS-INTERVAL

LESS THAN C.F

10-20

-

7

20-30

(7+12)

19

30-40

(19+15)

34

40-50

(34+20)

54

50-60

(54+30)

84

38 of 66

Homework problem

40

36

43

57

81

90

92

74

66

85

41

57

34

63

84

93

71

55

56

63

39

44

59

43

90

82

88

72

73

45

53

64

79

85

95

68

65

69

83

80

  1. Make a frequency distribution with class intervals of 10.
  2. Also prepare less than cumulative frequency distribution.
  3. Also prepare more than cumulative frequency distribution.

39 of 66

40 of 66

41 of 66

More than Cumulative Frequency Distribution

  • It is obtained by finding the cumulation total of frequencies starting from the highest to the lowest variable or class.

42 of 66

1. Calculate More than cf for the following data:

43 of 66

Solution

MARKS

FREQUENCY

CUMULATIVE FRQUENCY

β€˜MORE THAN’

20-30

5

-

200

30-40

18

(200-5)

195

40-50

20

(195-18)

177

50-60

15

(177-20)

157

60-70

18

(157-15)

142

70-80

40

(142-18)

124

80-90

68

(124-40)

84

90-100

16

(84-68)

16

TOTAL

200

44 of 66

4. Find more than cumulative frequency distribution:

CLASS-INTERVAL

FREQUENCY

More than 10

7

More than 20

12

More than 30

15

More than 40

20

More than 50

30

TOTAL

84

45 of 66

Solution

CLASS-INTERVAL

MORE THAN C.F

10 -20

-

84

20-30

(84-7)

77

30-40

(77-12)

65

40-50

(65-15)

50

50-60

(50-20)

30

46 of 66

DIAGRAMMATIC PRESENTATION

  • A diagram is a visual form for presentation of statistical data.
  • Diagram refers to the various types of devices such as bars, circles, maps, pictorials, cartograms, etc. these devices can take many attractive forms.
  • An ordinary man can understand pictures more easily than figures.

47 of 66

ADVANTAGES OF DIAGRAM

  • They are attractive and impressive
  • It helps to save time and labour.
  • They have universal applicability.
  • They make data simple.
  • It helps in comparison.
  • They provide more information.

48 of 66

LIMITATIONS OF A DIAGRAM

  • Diagrams cannot be analysed further
  • It shows only approximate values.
  • It exposes only limited facts. All details cannot be presented diagrammatically.
  • It is a supplement to the tabular presentation but not an alternative to it.
  • Minute readings cannot be made.
  • Diagrams are drawn when comparison needed, otherwise, they are of little in use.

49 of 66

RULES FOR MAKING A DIAGRAM

  • HEADING - Every diagram must have suitable title. The title, in bold letters, conveys the main facts depicted by the diagram.
  • SIZE – It should be neither be too big nor too small. It must watch with the size of the paper. It should be in the middle of the paper.
  • LENGTH AND BREADTH – An appropriate proportion should be maintained between length and breadth.

50 of 66

  • SELECTION OF APPROPRIATE DIAGRAM – All types of diagrams are not suitable for all types of data. A wrong selection may distort the true characteristics of the phenomenon to be presented and might lead to very wrong and misleading interpretation.
  • CLEANLINESS - Diagrams must be as simple as possible. Further they must be quite neat and clean. They should also be descent to look at.
  • INDEX - Every diagram or graph must be accompanied by an index. This illustrates different types of lines, shades or colors used in the diagram.

51 of 66

TYPES OF DIAGRAM

  • One dimensional diagram ( Line and Bar )
  • Two-dimensional diagram (rectangle, square etc )
  • Three-dimensional diagram ( cube, sphere etc )
  • Pictogram
  • Cartogram

52 of 66

ONE-DIMENSIONAL DIAGRAM

  • LINE DIAGRAM –PROBLEM
  • The following data show the number of child birth to 125 females in a hospital in one year . Draw a suitable diagram.

No. of Children

1

2

3

4

5

6

7

No. of females

4

20

16

13

22

30

25

53 of 66

54 of 66

Home work Problem

  • The following data show the number of accidents sustained by 100 drivers of a company in a particular year. Draw a suitable diagram.

No. of accidents

1

2

3

4

5

6

7

8

No. of drivers

2

18

15

10

13

22

9

11

55 of 66

BAR DIAGRAM

  • The following data show the seasonal fluctuations in production of wheat during 2010. Draw (A) Horizontal Bar diagram and (B) Vertical Bar diagram.

Month

Sept.

Oct.

Nov.

Dec

Production (in tonnes)

150

300

500

400

56 of 66

(A) Horizontal Bar Diagram

57 of 66

58 of 66

  • The number of students in a university is given below. Represent the data by a suitable diagram.

YEAR

ARTS

COMMERCE

SCIENCE

TOTAL

2015

10000

5000

2500

17500

2016

13000

4500

3500

21000

2017

15500

4750

3750

24000

59 of 66

60 of 66

61 of 66

ANGULAR OR PIE DIAGRAM

  • The following table shows the area in millions of square kilometres of the oceans of the world:

  • Draw a pie diagram to represent the data.

Ocean

Area (million sq.km)

Pacific

70.8

Atlantic

41.2

Indian

28.5

Antarctic

7.6

Arctic

4.8

62 of 66

Solution

Ocean

Area

Degree

Pacific

70.8

70.8/152.9 x 360 = 167

Atlantic

41.2

41.2/152.9 x 360 = 97

Indian

28.5

28.5/152.9 x 360 = 67

Antarctic

7.6

7.6/152.9 x 360 = 18

Arctic

4.8

4.8/ 152.9 x 360 = 11

152.9

360

63 of 66

64 of 66

Classwork problem

  • Represent the following data by pie diagram.

Items

Expenditure (in Rs)

Food

87

Clothing

24

Recreartion

11

Education

13

Rent

25

Miscellaneous

20

65 of 66

Solution

Items

Expenditure (in Rs)

Angle of the circle

(Degree)

Food

87

87/180 x 360 = 174

Clothing

24

24/180 x 360 = 48

Recreartion

11

11/180 x 360 = 22

Education

13

13 /180 x 360 = 26

Rent

25

25 /180 x 360 = 50

Misc.

20

20 /180 x 360 = 40

180

360

66 of 66