Lecture 5: Base Rates and Reference Classes
Jacob Steinhardt
Stat 165, Spring 2024
Announcements
Warm-up Question
“What is the probability that Joe � Biden is President of the United � States on Nov. 1st, 2024?”
�[Forecast made as of Jan. 2022]
What information would you look at to answer this question?
Information I used
�These are all examples of using base rates.
Decomposing the Problem
Just like with Fermi estimates, can improve accuracy �by decomposing the problem.
Reasons Biden wouldn’t complete his term:
Death by Natural Causes
Death by Assassination
Impeachment / Resignation / Coup
Adding it all up
Other Examples
Brainstorming Exercise
Pick one of these and list 2-3 base rates you would look up.�
Then come up with one additional forecasting question you are personally interested in, and list 2-3 base rates you could use for that.
Dangers of Base Rates
The reference class they used: �“Candidates that are good with the media and give them something to write about but, let’s be real, could never be president”
Base Rates for Events that Have Never Happened
Here are some examples where a “naive” base rate would give zero:
Laplace’s Rule of Thumb
If an event has has n opportunities to occur but has never happened, we assign probability 1/(n+2) to it happening the next time.
Question. What are other places where this is reasonable? Any places where it’s obviously unreasonable?
Revisiting our examples
For each of the following examples, decide on a reference class and determine the value of n. Is the 1/(n+2) estimate reasonable, or would you give a different number?
Laplace’s Rule with a Prior
Brainstorming Exercise
For the George Washington example, suppose we wanted to set p using a base rate (instead of just guessing 25%). What reference class(es) could we use?