Lecture 29
Designing Experiments
DATA 8
Summer 2017
Slides created by John DeNero (denero@berkeley.edu), Ani Adhikari (adhikari@berkeley.edu), Sam Lau (samlau95@berkeley.edu)
Announcements
Discussion Questions
Discussion Question 1
Population: Incomes with mean $10k & SD $20k
Sample: 100 chosen uniformly at random with replacement
Question: What's the chance that the sample average is� above $14k?
Discussion Question 2
Population: A perfect bell shape. Mean 10; SD 20
Sample: 100 chosen uniformly at random with replacement
Question: What's the chance that all are below 50?
Discussion Question 3
You want to estimate the height of the tallest person on campus. You sample 100 people at random and compute a 99.9999% confidence interval using the bootstrap. Its upper bound is 6'4".
A 6'5" person walks by! What might have gone wrong?
Discussion Question 4
You want to estimate the average compensation for SF public workers.
How many people should you sample at random in order to get a 95% confidence interval with a width of $10000 or less?
(Demo)
Choosing a Sample Size
Width of 95% Confidence Interval
Problems:
Attendance
Discussion Question 5
You want to estimate what percent of voters will vote for Candidate A in an upcoming election.
How many opinions should you sample at random in order to get a 95% confidence interval with a width of 3% or less?
(Demo)
Width of 95% Confidence Interval
= 4 x (SD of 0-1 population)/√(sample size)
Control the Width
(Demo)
Bound the 0-1 Population SD