1 of 80

Epidemiologic measures: Incidence & prevalence

Victor J. Schoenbach, PhD home page

Department of Epidemiology�Gillings School of Global Public Health�University of North Carolina at Chapel Hill

www.unc.edu/epid600/

1/25/2011

Incidence and prevalence

1

Principles of Epidemiology for Public Health (EPID600)�

2 of 80

Famous last words

5/20/2002

Incidence and prevalence

2

Quotations that demonstrate the value of humility about predicting the future�(authenticity not established)

Courtesy of Suzanne Cloutier, 11/17/1998

3 of 80

Louis Pasteur's theory of germs is ridiculous fiction.”

- Pierre Pachet, Professor of Physiology at Toulouse, 1872

5/20/2002

Incidence and prevalence

3

FAMOUS LAST WORDS: quotations that demonstrate the value of humility in predicting the future

4 of 80

“This `telephone´ has too many shortcomings to be seriously considered as a means of communication. The device is inherently of no value to us.”

- Western Union internal memo, 1876

(Source: 2000 National Ernst & Young Entrepreneur of the Year Awards special insert in USA Today, 2/11/2000, p9B)

5/20/2002

Incidence and prevalence

4

FAMOUS LAST WORDS: quotations that demonstrate the value of humility in predicting the future

5 of 80

“Everything that can be invented has been invented.”

- Charles H. Duell, Commissioner, US Patent Office,1899

5/20/2002

Incidence and prevalence

5

FAMOUS LAST WORDS: quotations that demonstrate the value of humility in predicting the future

6 of 80

“The wireless music box has no imaginable commercial value. Who would pay for a message sent to nobody in particular?”

- David Sarnoff's associates in response to his urgings for investment in radio in the 1920s.

9/11/2005

Incidence and prevalence

6

FAMOUS LAST WORDS: quotations that demonstrate the value of humility in predicting the future

7 of 80

“My thesis in this lecture is that macroeconomics . . . has succeeded: Its central problem of depression-prevention has been solved, for all practical purposes, and has in fact been solved for many decades.”

- Robert E. Lucas, Jr., American Economics Association Presidential Address, January 10, 2003�http://home.uchicago.edu/~sogrodow/homepage/paddress03.pdf

1/25/2011

Incidence and prevalence

7

FAMOUS LAST WORDS: quotations that demonstrate the value of humility in predicting the future

8 of 80

1/25/2011

Incidence and prevalence

8

The population perspective requires measuring disease in populations

  • Science is built on classification and measurement.
  • Reality is infinitely detailed, infinitely complex.
  • Classification and measurement seek to capture the essential attributes.

9 of 80

1/25/2011

Incidence and prevalence

9

Deriving meaning from stimuli

Vase or faces?

Which line is longer?

10 of 80

1/25/2011

Incidence and prevalence

10

Measurement “captures” the phenomenon

Classification and measurement are based on:

    • Objective of the classification
    • Conceptual model (understanding of the phenomenon)
    • Availability of data (technology)

11 of 80

5/20/2002

Incidence and prevalence

11

O

O

O

O

O

O

O

















An example population (N=200)

12 of 80

1/25/2011

Incidence and prevalence

12

















O

How can we quantify disease in populations?

13 of 80

1/25/2011

Incidence and prevalence

13

















O

O

How can we quantify disease in populations?

14 of 80

5/20/2002

Incidence and prevalence

14

















O

O

O

How can we quantify disease in populations?

15 of 80

5/20/2002

Incidence and prevalence

15

















O

O

O

O

O

How can we quantify disease in populations?

16 of 80

5/20/2002

Incidence and prevalence

16

















O

O

O

O

O

O

How can we quantify disease in populations?

17 of 80

1/25/2011

Incidence and prevalence

17

















O

O

O

O

O

O

How can we quantify the frequency?

18 of 80

1/25/2011

Incidence and prevalence

18

















O

O

O

O

O

O

Rate of occurrence of new cases�per unit time (e.g., 1 per month)

19 of 80

5/20/2002

Incidence and prevalence

19

















O

1 new case in month 1

20 of 80

5/20/2002

Incidence and prevalence

20

















O

O

1 new case in month 2

21 of 80

5/20/2002

Incidence and prevalence

21

















O

O

O

1 new case in month 3, for a total of 3 cases

22 of 80

5/20/2002

Incidence and prevalence

22

















O

O

O

O

O

2 new cases in month 4

23 of 80

5/20/2002

Incidence and prevalence

23

















O

O

O

O

O

O

1 new case in month 5 (total=6)

24 of 80

5/25/2011

Incidence and prevalence

24

















O

O

O

O

O

O

O

1 case in month 6

25 of 80

5/20/2002

Incidence and prevalence

25

















O

O

O

O

O

O

O

O

1 new case in month 7

26 of 80

5/20/2002

Incidence and prevalence

26

















O

O

O

O

O

O

O

O

O

O

2 new cases in month 8

27 of 80

5/20/2002

Incidence and prevalence

27

















O

O

O

O

O

O

O

O

O

O

O

O

2 cases in month 9

28 of 80

1/9/2007

Incidence and prevalence

28

















O

O

O

O

O

O

O

O

O

O

O

O

Rate of occurrence of new cases during 9 months: 1 case/month to 2 cases/month

29 of 80

1/9/2007

Incidence and prevalence

29

Number of cases depends on length of interval

Divide by length of time interval, so can compare across intervals

Number of new cases�Rate of new cases = –––––––––––––––––� Time interval

= 12 cases / 9 months = 1.33 cases / month

30 of 80

1/25/2011

Incidence and prevalence

30

Number of cases depends on population size

So, divide by population and time:

Number of new cases�Incidence rate = ––––––––––––––––––

Population-time

31 of 80

1/25/2011

Incidence and prevalence

31

How to estimate population-time?

Population at risk: the people eligible to become a case and to be counted as one.

In this example that population declines as each case occurs.

So estimate population-time as . . .

32 of 80

1/25/2011

Incidence and prevalence

32

Population-time =

Method 1: Add up the time that each person is at risk

Method 2: Add up the population at risk during each time segment

Method 3: Multiply the average size of the population at risk by the length of the time interval

33 of 80

1/9/2007

Incidence and prevalence

33

Estimating population-time - method 2

Total population-time over 9 months =

200 + 199 + 198 + 197 + 195 + 194 + 193 + 192 + 190

= 1,758 person-months

= 146.5 person-years

However, cases are not at risk for a full month.

34 of 80

1/9/2007

Incidence and prevalence

34

Estimating population-time - method 2 - better

Total population-time over 9 months =

199.5 + 198.5 + 197.5 + 196 + 194.5 + 193.5 + 192.5 + 191 + 189

= 1,752 person-months

= 146 person-years

assuming that cases develop, on average, in the middle of the month

35 of 80

1/9/2007

Incidence and prevalence

35

Estimating population-time - method 3

Average size of the population at risk during the 9 months = 195.3 (1,758 / 9) or approximately: (200 + 188) /2 = 194

Population-time = 195.3 x 9 months or (approximately) 194 x 9 months

= 1,746 person-months

= 145.5 person-years

36 of 80

1/9/2007

Incidence and prevalence

36

Equivalent to - method 3

Take initial size of population at risk and reduce it for time the people were not at risk due to acquiring the disease:

200 - 12/2 = 194 (approximately)

Population-time = 194 x 9 months

= 1,746 person-months

= 145.5 person-years

37 of 80

1/25/2011

Incidence and prevalence

37

Incidence rate (“incidence density”)

Number of new cases� –––––––––––––––––––––––––––––––�Avg population at risk × Time interval

Number of new cases� = –––––––––––––––––––– � Population-time

38 of 80

1/25/2011

Incidence and prevalence

38

O

O

O

O

O

O

O

















What proportion of the population at risk are affected after 5 months?

39 of 80

1/30/2004

Incidence and prevalence

39

















O

What proportion of the population is affected after 1 month? (1/200)

40 of 80

5/20/2002

Incidence and prevalence

40

















O

O

What proportion of the population is affected after 2 months? (2/200)

41 of 80

5/20/2002

Incidence and prevalence

41

















O

O

O

What proportion of the population is affected after 3 months? (3/200)

42 of 80

5/20/2002

Incidence and prevalence

42

















O

O

O

O

O

What proportion of the population is affected after 4 months? (5/200)

43 of 80

1/9/2007

Incidence and prevalence

43

















O

O

O

O

O

O

6 / 200 = 0.03 = 3% = 30 / 1,000 in 5 months

44 of 80

1/25/2011

Incidence and prevalence

44

Incidence proportion (“cumulative incidence”)

Number of new cases�5-month CI = –––––––––––––––––––� Population at risk

Incidence proportion estimates risk.

45 of 80

1/25/2011

Incidence and prevalence

45

Incidence rate versus incidence proportion

  • Incidence rate measures how rapidly cases are occurring.
  • Incidence proportion is cumulative.
  • When care only about the “bottom line” (i.e., what has happened by the end of given period): incidence proportion (CI).

46 of 80

1/25/2011

Incidence and prevalence

46

Incidence rate versus incidence proportion

  • If risk period is long (e.g., cancer), we usually observe only a portion.
  • To compare results from studies with different length of follow-up, use incidence rate (IR)
  • If risk period is short, we usually observe all of it and can use incidence proportion.

47 of 80

Incidence rate versus incidence proportion�(rare disease, IR = 0.005 / month)

1/25/2011

Incidence and prevalence

47

(see spreadsheet at epidemiolog.net/studymat/)

48 of 80

Incidence rate versus incidence proportion� (common disease, IR = 0.1 / month)

2/7/2012

Incidence and prevalence

48

49 of 80

5/20/2002

Incidence and prevalence

49

Case fatality rate

“Case fatality rate” (but it’s really a proportion)

= proportion of cases who die� (in a specified time interval)

  • Like a “cumulative incidence of death” in cases�[ “incidence rate of death” in cases = “termination rate” = 1/(average survival time)]

50 of 80

1/30/2007

Incidence and prevalence

50

Mortality rate

Number of deaths�Mortality rate = ––––––––––––––––––––––––––––� Population at risk × Time interval

Number of deaths�Annual mortality rate = ––––––––––––––––––––––� Mid-year population (x 1 yr)

51 of 80

6/6/2002

Incidence and prevalence

51

Mortality rate (more notes)

Number of deaths�Mortality rate = ––––––––––––––––––––––––––––� Population at risk × Time interval

Number of deaths�Annual mortality rate = ––––––––––––––––––� Mid-year population

52 of 80

5/20/2002

Incidence and prevalence

52

Mortality rates versus incidence rates

  • Mortality data are more generally available
  • Fatality reflects many factors, so mortality rates may not be a good surrogate of incidence rates
  • Death certificate cause of death not always accurate or useful

53 of 80

1/9/2007

Incidence and prevalence

53

Prevalence – another important proportion

Number of existing (and new) cases�Prevalence = –––––––––––––––––––––––––––––––� Population at risk

54 of 80

5/20/2002

Incidence and prevalence

54

















O

O

O

O

O

O

O

1 new case, 1 death

55 of 80

5/20/2002

Incidence and prevalence

55

















O

O

O

O

O

O

O

O

1 new case, 1 new death

56 of 80

5/20/2002

Incidence and prevalence

56

















O

O

O

O

O

O

O

O

O

O

2 new cases, no deaths

57 of 80

5/20/2002

Incidence and prevalence

57

















O

O

O

O

O

O

O

O

O

O

O

O

2 new cases, 1 new death

58 of 80

5/20/2002

Incidence and prevalence

58

















O

O

O

O

O

O

O

O

O

O

O

O

What is the prevalence? (9 / 197)

59 of 80

5/20/2002

Incidence and prevalence

59

Fine points . . .

  • Who is “at risk”?
    • Endometrial cancer? Prostate cancer? Breast cancer?
    • Only women who have not had a hysterectomy?

“Could” develop the condition + “would” be counted.

60 of 80

5/20/2002

Incidence and prevalence

60

More fine points

  • Age?
  • Immunity?
  • Genetically susceptible?

61 of 80

5/20/2002

Incidence and prevalence

61

More fine points . . .

  • How do we measure time?
    • Are 10 people followed for 10 years the same as 100 people followed for 1 year?
    • Aging of the cohort? Secular changes?

62 of 80

9/22/2005, 9/8/2008

Incidence and prevalence

62

Fine points . . .

  • Importance of stating units and scaling unless they are clear from the context
    • e.g., 120 per 100,000 person-years = 10 per 100,000 person-months
    • Hazards from lack of clarity

63 of 80

“You can never, never take anything for granted.”

Noel Hinners, vice president for flight systems at Lockheed Martin Astronautics in Denver, concerning the loss of the Martian Climate Orbiter due to the Lockheed Martin spacecraft team’s having reported measurements in English units whiles the orbiter’s navigation team at the Jet Propulsion Laboratory (JPL) in Pasadena, California assumed the measurements were in metric units.

1/30/2004

Incidence and prevalence

63

64 of 80

5/20/2002

Incidence and prevalence

64

Relation of incidence and prevalence

  • Prevalence depends on incidence
  • Higher incidence leads to higher prevalence if duration of cases does not change.
  • Limitation of the bathtub analogy – flow rate needs to be expressed relative to the size of the source
  • Introducing a new analogy . . .

65 of 80

9/23/2002

Incidence and prevalence

65

66 of 80

5/20/2002

Incidence and prevalence

66

Population at risk

Existing cases

Deaths, cures, etc.

67 of 80

1/25/2011

Incidence and prevalence

67

Incidence, prevalence, duration of hospitalization

Remote community of 101,000 people

One hospital, patient census = 1,000

Steady state

500 admissions per week

Prevalence = 1,000/101,000 = 9.9/1,000

IR = 500/100,000 = 5/1,000/week

Duration Prevalence / IR = 2 weeks

68 of 80

1/25/2011

Incidence and prevalence

68

Relation of incidence and prevalence

Under somewhat special conditions,� Prevalence odds = incidence × duration� Prevalence incidence × duration

(see spreadsheet at www.epidemiolog.net/studymat/)

69 of 80

5/20/2002

Incidence and prevalence

69

Standardization

  • When objective is comparability, need to adjust for different distributions of other determinants
  • Strategy:
    • Analyze within each subgroup (stratum)
    • Take a weighted average across strata
    • Use same weights for all populations

(See the Evolving Text on www.epidemiolog.net)

70 of 80

8/17/2009

Incidence and prevalence

70

Familiar example of weighted averages

  • Liters of petrol per kilometer - differs for Interstate (0.050 LpK) and non-Interstate (0.100 LpK) driving.
  • To compare different cars, can:
    • Compare them for each type of driving separately (stratified analysis)
    • Average for each car, using one set of weights (e.g., 80% Interstate, 20% non-Interstate)
  • E.g. = 0.80 x 0.050 LpK + 0.20 x 0.100 LpK = 0.060 LpK

71 of 80

8/17/2009

Incidence and prevalence

71

Comparing a Suburu and a Mazda

Juan drives a Suburu 800 km on Interstate highways and 200 km on other roads. His car uses 0.050 LpK on Interstates and 0.100 LpK on other roads, for a total of 60 liters of petrol, an average of 0.060 LpK (60 L / 1000 km). His overall LpK can be expressed as a weighted average:

(800/1000) x 0.050 LpK + (200/1000) x 0.100 LpK

= 0.80 x 0.050 LpK + 0.20 x 0.100 LpK = 0.060 LpK

72 of 80

8/17/2009

Incidence and prevalence

72

Comparing a Suburu and a Mazda

Shizu drives her Mazda on a different route, with only 200 km on Interstate and 800 km on other roads. She uses 0.045 lpk on Interstate highways and 0.080 LpK on non-Interstate. She uses a total of 73 liters, or 0.073 LpK. Her overall LpK can be expressed as a weighted average:

(200/1,000) x 0.045 LpK + (800/1,000) x 0.080 LpK

= 0.20 x 0.045 LpK + 0.80 x 0.080 LpK =0.073 LpK

73 of 80

8/17/2009

Incidence and prevalence

73

How can we compare their fuel efficiency?

74 of 80

8/17/2009

Incidence and prevalence

74

Total fuel efficiency is not comparable because weights are different

75 of 80

8/17/2009

Incidence and prevalence

75

By adopting a “standard” set of weights we can compare fairly

76 of 80

8/17/2009

Incidence and prevalence

76

Comparing a Suburu and a Mazda

  • Juan’s Suburu:

= 0.60 x 0.050 LpK + 0.40 x 0.100 LpK =0.070 LpK

  • Shizu’s Mazda:

= 0.60 x 0.045 LpK + 0.40 x 0.080 LpK =0.059 LpK

The choice of weights may often affect the results of the comparison.

77 of 80

“I'm just glad it'll be Clark Gable who's falling on his face and not Gary Cooper.”

- Gary Cooper on his decision not to take the leading role in “Gone With The Wind”

5/20/2002

Incidence and prevalence

77

FAMOUS LAST WORDS: quotations that demonstrate the value of humility in predicting the future

78 of 80

“A cookie store is a bad idea. Besides, the market research reports say America likes crispy cookies, not soft and chewy cookies like you make.”

- Response to Debbi Fields' idea of starting Mrs. Fields' Cookies.

5/20/2002

Incidence and prevalence

78

FAMOUS LAST WORDS: quotations that demonstrate the value of humility in predicting the future

79 of 80

“Computers in the future may weigh no more than 1.5 tons.”

- Popular Mechanics, forecasting the relentless march of science, 1949

5/20/2002

Incidence and prevalence

79

FAMOUS LAST WORDS: quotations that demonstrate the value of humility in predicting the future

80 of 80

“I think there is a world market for maybe five computers.”

-Thomas Watson, chairman of IBM, 1943

5/20/2002

Incidence and prevalence

80

FAMOUS LAST WORDS: quotations that demonstrate the value of humility in predicting the future