INTRODUCTION TO STATISTICS
DEFINITION OF STATISTICS
MAIN COMPONENTS INVOLVED IN STATISTICS
PROCEDURES FOR DATA COLLECTION
ADVANTAGES AND DISADVANTAGES OF DATA COLLECTION METHODS
Cont’n
Cont’n
Advantages
Disadvantages
Cont’n
Data Collection Issues
Population and Samples�
Sampling Techniques
PROBABILITY OR STATISTICAL SAMPLING TECHNIQUE
N0N-PROBABILITY OR NON-STATISTICAL SAMPLING TECHNIQUE
VARIABLES
Cont’n���
EXPERIMENTS
LEVELS OF MEASUREMENT
TYPES OF DATA
OTHER TYPES OF DATA
DESCRIPTIVE STATISTICS
Example 1: construct a frequency distribution table from the raw data below as marks for a quiz taken by level 300 students in UCC.
12, 13, 17, 12,12,12,12,13,14,14,18,15,17,15,15,10, 11,11.
FREQUENCY DISTRIBUTION: GROUPED DATA
number of classes
FREQUENCY DISTRIBUTION: GROUPED DATA
Construct a frequency distribution table for the group data
10,8,12,3,4,15,14,15,24,26,29,28,27,21,22,22,23,34,43,48,46,49,56,57,62,67,1,11,53.
FREQUENCY DISTRIBUTION
A histogram is a display is statistical information that uses rectangles to show the frequency of data in items in successive numerical intervals of equal size.
CONSTRUCTING FREQUENCY HISTOGRAMS
Construct a histogram from example 1 and 2.
RELATIVE FREQUENCY
PRESENTING DATA BY CHARTS
Is a graphical representation of a category data set in which a rectangle is drawn over each class.
Example: drugs sold at ATM pharmacy in January 2012
ATM PHARMACY | JANUARY SALES |
Paracetamol | 2,000 |
Vitamin c | 3,600 |
Cold relief | 4,500 |
Metrolex | 7,000 |
Amocyclin | 8,600 |
Constructing Bar Chart
Pie Chart
Disease | Number recorded |
Malaria | 5,000 |
Typhoid | 1,000 |
Fever | 2,000 |
Respiratory infection | 3,500 |
Anemia | 4,200 |
Constructing a Pie Chart
Measures of Central Tendency
The three central tendency are
Comparing Mode, Median and Mean�
The mode is appropriate to use when
Do not use when;
Comparing Mode, Median and Mean
The median is appropriate when;
Do not use when;
The distribution of the data is symmetrical because the mean is preferred.
Comparing Mode, Median and Mean
the data is symmetrical or at least not really skewed.
Do not use when;
Computing the Central Tendencies and Location
Computing the Central Tendencies and Location
Is the center value that divides a data array into two halves.
Computing the Median
Example: find the median for the following data set 11,12,19,9,4,24,4,23,16,5,22,4,14,11,12
Computing the Central Tendencies and Location
Is the number that is repeated more often than any other.
Example: given the data set 5, 5.5,4.9,4.85,5.25,5.05,4.9, find the mode.
Try:
Find the mean, median and mode for the following list of values: 13,18,13,14,13,16,14,21,13.