Study Design and Analysis in Epidemiology: ��Where does modelling fit?
MMED 2025
Carl
Earlier contributors: Cari van Schalkwyk & Steve Bellan
Slide Set Citation: DOI:10.6084/m9.figshare.5038784
000
Goals
2
Defining Epidemiology
“The study of the distribution and determinants of health-related states and events in populations, and the application of this study to control health problems.”
John M Last �Dictionary of Epidemiology
3
4
Risk Factor: A characteristic that is correlated with a measure of disease.�
Varieties of Infectious Disease Epidemiology
Varieties of Infectious Disease Epidemiology
5
How does mathematical modeling fit?
6
�
Importance of knowledge breadth
Goals
7
Measures of Disease
8
Measures of Covariates (risk factors)
9
Measures of Effect
Example
How could you measure whether circumcision influences the risk of HIV infection?�
10
Measures of Effect
11
Goals
12
Epidemiological Studies
13
Observational
Experimental
Most studies are both
Ecological Studies
14
a.k.a
Correlational studies
15
Bousquet et al (2021) Cabbage and fermented vegetables: From death rate heterogeneity in countries to candidates for mitigation strategies of severe COVID-19. Allergy DOI: 10.1111/all.14549
Example
How could you measure whether circumcision influences the risk of HIV infection?�
16
Ecological study:�HIV prevalence (UNAIDS) vs�circumcision prevalence (DHS)
17
Williams et al. The Potential Impact of Male Circumcision on HIV in Sub-Saharan Africa. Plos Medicine 2006
Circumcision prevalence
HIV prevalence
Cross-Sectional Studies
18
a.k.a
Surveys
Prevalence studies
Relative Risk: Prevalence Ratio
19
| Disease | No Disease | Total (Margins) |
Exposed | a | b | a+b |
Not exposed | c | d | c+d |
Total (Margins) | a+c | b+d | a+b+c+d |
Prevalence Ratio (PR):�prevalence in exposed population divided by prevalence in unexposed population.
PR < 1 exposure correlates with reduced risk of disease
PR > 1 exposure correlates with increased risk of disease
Contingency table or 2x2 table:
Example
How could you measure whether circumcision influences the risk of HIV infection?�
20
Cross-sectional study:�HIV prevalence (DHS) vs�circumcision prevalence (DHS)�����SA 2016 DHS:
21
Njeuhmeli et al. Voluntary Medical Male Circumcision: Modeling the Impact and Cost of Expanding Male Circumcision for HIV Prevention in Eastern and Southern Africa. Plos Medicine 2011
| HIV+ | HIV- | Total |
Circumcised | 143 | 1121 | 1264 |
Not Circumcised | 172 | 760 | 932 |
Total | 315 | 1881 | 2196 |
Case-Control Studies
22
Case Control Studies: Odds Ratios
23
| Disease | No Disease | Total (Margins) |
Exposed | a | b | a+b |
Not exposed | c | d | c+d |
Total (Margins) | a+c | b+d | a+b+c+d |
Odds ratio is the ratio of odds in the diseased population divided by the odds in the �non-diseased population.
OR < 1 means exposure correlates with reduced risk of disease
OR > 1 means exposure correlates with increased risk of disease
Controls: Number chosen by researcher.
Example
How could you measure whether circumcision influences the risk of HIV infection?�
24
Case-control study:�
25
| HIV+ | HIV- | Total |
Circumcised | 15 | 19 | 34 |
Not Circumcised | 475 | 220 | 695 |
Total | 490 | 239 | 729 |
| HIV+ | HIV- | Total |
Circumcised | 48 | 121 | 169 |
Not Circumcised | 101 | 272 | 373 |
Total | 149 | 393 | 542 |
Cohort Studies
26
a.k.a
Prospective studies
Longitudinal studies
Follow-up studies
27
Relative Risk: Cumulative Incidence Ratio
28
| Disease | No Disease | Total (Margins) |
Exposed | a | b | a+b |
Not exposed | c | d | c+d |
Total (Margins) | a+c | b+d | a+b+c+d |
Cumulative Incidence Ratio (CIR):�cumulative incidence in exposed population divided by cumulative incidence in unexposed population.
CIR < 1 exposure correlates with reduced risk of disease
CIR > 1 exposure correlates with increased risk of disease
Cohort Data and Person-Time
X marks occurrence of disease X
O Marks death
Cumulative Incidence
Total number of person-time
Immunity
Time joining the study
Death
5 / 12 = 0.42
Cohort Data and Person-Time
X marks occurrence of disease X
O Marks death
Incidence rate
Number of disease occurrence
= 5
Total number of person-time (X confers immunity)
= 26 years
Incidence rate=5/26=0.19 per person year
Relative Risk: Incidence Rate Ratios
31
| Disease | No Disease | Total (Margins) |
Exposed | a | - | PYe |
Not exposed | c | - | PY0 |
Total (Margins) | a+c | - | PYe + PY0 |
Incidence Rate Ratio is the ratio of the incidence rate of the exposed population to that of the unexposed population.
IRR < 1 means exposure correlates with reduced risk of disease
IRR > 1 means exposure correlates with increased risk of disease
Example
How could you measure whether circumcision influences the risk of HIV infection?�
32
Cohort study:�
33
Cameron et al. Female to male transmission of human immunodeficiency virus type 1: risk factors for seroconversion in men. The Lancet 1989
| HIV+ | HIV- | Total Pwks |
Circumcised | 6 | - | 2966 |
Not circumcised | 18 | - | 1016 |
Total (Margins) | 24 | - | 3982 |
Randomised Controlled Trials
34
Example
How could you measure whether circumcision influences the risk of HIV infection?�
35
Randomised Controlled Trial:�
36
Auvert et al. Randomized, Controlled Intervention Trial
of Male Circumcision for Reduction of HIV
Infection Risk: The ANRS 1265 Trial. Plos Medicine 2005
| HIV+ | HIV- | Total PYs |
Circumcised | 20 | - | 2,354 |
Not circumcised | 49 | - | 2,339 |
Total (Margins) | 69 | - | 4693 |
Goals
37
Random Error
38
Estimate =
Bias
Difference between observed value and true value due to all causes other than random error.���
Bias does not go away with greater sample size!
Bias must be dealt with during study design!�
39
Selection Bias
Error due to systematic differences between those who take part in the study and those who do not.
John Last, Dictionary of Epidemiology�
40
Information Bias
A flaw in measuring exposure or outcome data that results in different quality (accuracy) of information between comparison groups.
John Last, Dictionary of Epidemiology�
�
Survivor Bias
41
42
Non-response Bias
Confounding
43
Literacy
HIV Status
| HIV+ | HIV- |
Literate | 660 | 340 |
Illiterate | 180 | 820 |
What if some of the study population were much younger than others?
Confounding
44
Pooled | HIV+ | HIV- |
Literate | 660 | 340 |
Illiterate | 180 | 820 |
| | |
6-15 years old | HIV+ | HIV- |
Literate | 30 | 270 |
Illiterate | 90 | 810 |
| | |
16-24 years old | HIV+ | HIV- |
Literate | 630 | 70 |
Illiterate | 90 | 10 |
6-15 year olds: Literacy = 300/1200 = 25%�
16-24 year olds: Literacy = 700/800 = 87.5%
Confounding
45
Literacy
HIV Status
| HIV+ | HIV- |
Literate | 660 | 340 |
Illiterate | 180 | 820 |
Age
CONFOUNDING
How do you deal with confounding?
46
Goals
47
48
By developing dynamic models in a probabilistic framework, we can account for dependence, random error, and bias while linking patterns at multiple scales.
Statistical Models
Dynamical Models
Questions in Epidemiology
49
Statistical Models
Dynamic Models
Summary
50
Title: Study Design and Analysis in Epidemiology: Where does modelling fit?
https://figshare.com/collections/International_Clinics_on_Infectious_Disease_Dynamics_and_Data/3788224
000
000
000
000
This presentation is made available through a Creative Commons Attribution-Noncommercial license. Details of the license and permitted uses are available at� http://creativecommons.org/licenses/by/3.0/
Attribution:
Clinic on the Meaningful Modeling of Epidemiological Data
Source URL:
For further information please contact figshare@ici3d.org.
© 2014-2021 International Clinics on Infectious Disease Dynamics and Data