Lecture 27
The Normal Distribution
DATA 8
Fall 2019
Weekly Goals
Announcements
Review: SD and Standard Units
How Far from the Average?
5 4 3 2 1
Standard Units
(Demo)
Review: Chebyshev’s Inequality
Chebyshev’s Bounds
Range | Proportion |
average ± 2 SDs* | at least 1 - 1/4 (75%) |
average ± 3 SDs* | at least 1 - 1/9 (88.888…%) |
average ± 4 SDs* | at least 1 - 1/16 (93.75%) |
average ± 5 SDs* | at least 1 - 1/25 (96%) |
No matter what the distribution looks like
(Demo)
* Not including the endpoints
Review: The Normal Distribution
Bell Curve
How Big are Most of the Values?
No matter what the shape of the distribution,
the bulk of the data are in the range “average ± a few SDs”
If a histogram is bell-shaped, then
“average ± 3 SDs”
Bounds and Normal Approximations
A “Central” Area
Central Limit Theorem
Sample Averages
Central Limit Theorem
If the sample is
Then, regardless of the distribution of the population,
the probability distribution of the sample sum
(or the sample average) is roughly normal
(Demo)
Distribution of the �Sample Average
Why is There a Distribution?
Distribution of the Sample Average
(Demo)
Specifying the Distribution
Suppose the random sample is large.
Center of the Distribution
The Population Average
The distribution of the sample average is roughly a bell curve centered at the population average.
Variability of the Sample Average
Why Is This Important?
(Demo)
Discussion Question
The gold histogram shows the distribution of __________ values, each of which is _________________________.
The Two Histograms
(Demo)
Variability of the Sample Average
(Demo)
Discussion Question
A city has 500,000 households. The annual incomes of these households have an average of $65,000 and an SD of $45,000. The distribution of the incomes [pick one and explain]: