1 of 23

Introduction to Likelihood

Adapted from slides by:

Cari van Schalkwyk, PhD

South African Centre for Epidemiological Modelling and Analysis

Stellenbosch University, Stellenbosch, South Africa

MMED 2024

Delivered by:

Bobby Reiner, PhD

University of Washington

000

2 of 23

Goals

  • Define probability distribution function
  • Define Likelihood
  • Define MLE
  • Discuss some uses

2

00

3 of 23

Public Health, Epidemiology, and Models (Day 1)

Simple Models (Day 1)

Foundations of Dynamic Modeling (Day 1)

(Hidden) Assumptions of Simple ODE’s (Day 2)

Breaking Assumptions!

Consequences of Heterogeneity (Day 6)

Introduction stochastic simulation models (Day 3)

Heterogeneity tutorial (Day 6)

Introduction to Infectious Disease Data (Day 1)

Thinking about Data (Day 2)

Data management and cleaning (Day 9)

Creating a Model World (Day 4)

Study design and analysis in epidemiology (Day 3)

Introduction to Statistical Philosophy (Day 4)

Variability, Sampling Distributions, & Simulation (Day 10)

HIV in Harare tutorial (Day 3)

Integration!

Introduction to Likelihood (Day 4)

Fitting Dynamic Models I – III (Day 5, 8, & 9)

Modeling for Policy (Day 11)

Model Assessment (Day 10)

MCMC Lab (Day 9)

MLE Fitting SIR model to prevalence data (Day 5)

Likelihood Lab (Day 4)

4 of 23

Binomial Distribution

4

Number of Heads

Probability

0

6.25%

1

25%

2

37.5%

3

25.5%

4

6.25%

 

00

5 of 23

Binomial Distribution

5

If we tossed our coin 4 times and observed 3 heads, would we think our coin is fair?

If someone told us the coin was unfair, what would our best guess be for the altered chance of observing a head?

If we tossed our coin 100 times and observed 75 heads, would we think our coin is fair?

What probability of heads would best explain the observed 75 heads out of 100?

 

 

00

6 of 23

pdf versus likelihood

6

Probability Distribution Function (pdf)

Defines the probability of observing a range of outcomes assuming a specific ‘truth’ about the distribution of outcomes

Likelihood function

Defines the probability of observing a specific outcome for a range of possible ‘truths’ about the distribution of outcomes

 

 

 

 

00

7 of 23

pdf versus likelihood

7

Instead of coin flips, suppose we thought that seroprevalence of SARS-CoV-2 antibodies in a population was 50%.

Just as with our coin example, we could sample 4 individuals from the population.

If we found 3 of these 4 individuals had antibodies associated with a past SARS-CoV-2 infection (i.e., seroprevalence was 75%), is it plausible that the true value of seroprevalence in the population is 50%?

00

8 of 23

pdf versus likelihood

8

 

 

 

 

4.6875%

42.1875%

0%

25%

 

0.75

0.421875

Maximum Likelihood Estimate (mle)

00

9 of 23

pdf versus likelihood

9

Now imagine we sampled 100 individuals and found 75 of them had antibodies associated with a past SARS-CoV-2 infection (i.e., again, seroprevalence is 75%).

It seems unlikely that the true seroprevalence is 50%. What values are plausible?

00

10 of 23

pdf versus likelihood

10

 

 

 

 

 

 

 

0.75

 

Much less than 0.421875

mle

 

 

00

11 of 23

Likelihood versus log-likelihood

11

 

 

Instead of trying to maximize this

it is frequently easier to maximize this

00

12 of 23

Deriving the maximum likelihood estimator

12

 

 

 

 

 

 

 

 

00

13 of 23

Building a confidence interval

13

 

 

00

14 of 23

Building a confidence interval

14

 

00

15 of 23

Building a confidence interval

15

 

00

16 of 23

Building a confidence interval

16

 

00

17 of 23

Building a confidence interval

17

 

65.95%

82.78%

00

18 of 23

Building a confidence interval

18

 

Recall our original “survey” of 4 individuals.

We can use the same approach to calculate a 95% CI for seroprevalence.

00

19 of 23

Building a confidence interval

19

 

27.76%

98.37%

00

20 of 23

Likehood of all the data (TOMORROW!)

20

 

00

21 of 23

Frequentist paradigm

21

  • What we just did

- MLE

  • Classical tests/regression analyses

- Likelihood ratio rest, AIC

- R – lm(); glm() or Stata – regress; logit

Bayesian paradigm

  • Prior distributions for parameters
  • Use the likelihood function to update
  • Use posterior distributions

 

 

00

22 of 23

Summary

22

  • Likelihood is conditional on data, looking at probabilities from different PDFs, with varying parameters

  • Useful for comparing hypotheses to each other

  • Useful for fitting dynamic models

00

23 of 23

000

000

000

000

This presentation is made available through a Creative Commons Attribution-Noncommercial license. Details of the license and permitted uses are available at� http://creativecommons.org/licenses/by/3.0/

Attribution:

Clinic on Meaningful Modeling of Epidemiological Data

Source URL:

For further information please contact figshare@ici3d.org.

© 2014-2023 International Clinics on Infectious Disease Dynamics and Data