1 of 33

Bayesian inference for a principal stratum estimand on recurrent events truncated by death

Tianmeng Lyu, Björn Bornkamp, Guenther Mueller-Velten and Heinz Schmidli

March 31, 2023

Statistical Methodology

2 of 33

Agenda

  • Introduction
    • The estimands framework
    • Principal stratum estimand
  • Recurrent event project
    • Motivating example
    • Proposed analysis method
    • Real data application: PARAGON-HF
  • Concluding remarks

2

3 of 33

Introduction

3

4 of 33

Motivating example

  • The following example is a simplified version of the example in the ICH E9(R1) training materials available on the ICH website
  • Imagine that there is a clinical trial to compare treatment vs. control in patients with diabetes and the endpoint of interest is change from baseline in HbA1c at week 24
  • A certain proportion of patients in both arms took rescue medication before week 24

4

In my analysis, I excluded the HbA1c values collected after rescue medication

In my analysis, I included HbA1c values regardless of rescue medication

  • They got different results
  • Which one will you choose?

5 of 33

Motivating example

  • The difference between their analysis approaches may seem to be a pure statistical question, however, they are actually answering different questions!

5

  • For the approach where data collected after use of rescue are excluded (and some imputed values are used instead), this is trying to answer the question “what would be the treatment effect if rescue medications were not taken/not allowed
  • For the approach where data collected after use of rescue are included, this is trying to answer the question “what is the treatment effect regardless of rescue medication”, i.e. it is comparing “treatment with rescue (taken as required) vs. control with rescue (taken as required)
  • The estimands framework helps to clarify such ambiguity

6 of 33

The estimands framework

  • It was introduced in ICH E9 (R1) “Addendum on estimands and sensitivity analysis in clinical trials”, in 2019
  • Estimand is defined as “a precise description of the treatment effect reflecting the clinical question posed by the trial objective. It summarises at a population-level what the outcomes would be in the same patients under different treatment conditions being compared

6

Trial objective

Estimand

Estimator

  • Trial planning:

7 of 33

The estimands framework

  • The description of an estimand involves specification of five attributes:
    • Treatment
    • Population
    • Variable
    • Intercurrent events
    • Population-level summary

7

  • Intercurrent event: events occurring after treatment initiation that affect either the interpretation or the existence of the measurements associated with the clinical question of interest
    • See next slide for examples

8 of 33

Intercurrent events

8

Patient 2

Patient 1

Patient 3

Patient 4

Death

Patient 5

TIMELINE

Randomisation

Data collection for the variable

Change in background therapy

Treatment switch

Treatment discontinuation due to lack of efficacy

Use of rescue medication

Treatment discontinuation due to adverse events

9 of 33

Strategies to handle intercurrent events

  • ICH E9 (R1) introduced five strategies to handle intercurrent events:

9

    • Treatment policy: the occurrence of the intercurrent event is considered irrelevant in defining the treatment effect of interest
    • Hypothetical: a scenario is envisaged in which the intercurrent event would not occur
    • Composite: an intercurrent event is considered in itself to be informative about the patient’s outcome and is therefore incorporated into the definition of the variable
    • While on treatment: response to treatment prior to the occurrence of the intercurrent event is of interest
    • Principal stratum: the target population might be taken to be the “principal stratum” in which an intercurrent event would (or would not) occur. The clinical question of interest relates to the treatment effect only within the principal stratum

10 of 33

Potential outcomes framework

  1.  

10

Patient ID

Y(0)

Y(1)

1

2.5

3.5

2

1.8

2.3

3

3.0

4.2

*Note: the numbers in the table above are just some random fake numbers for illustration purpose only

11 of 33

Principal stratum – motivation

  • Motivation: interested in the treatment effect in a subpopulation defined by a post-treatment variable (S)
    • For example, “S” could be intake of rescue medication, death, etc.

11

 

 

12 of 33

Principal stratum – introduction�Frangakis & Rubin (2002)

  1.  

12

0 (treatment non-tolerance)

1 (tolerance)

0 (treatment non-tolerance)

Tolerate neither

Tolerate test treatment only

1 (tolerance)

Tolerate control only

Tolerate both

Principal strata in terms of treatment tolerance

13 of 33

Principal stratum – introduction

  1.  

13

0 (treatment non-tolerance)

1 (tolerance)

0 (treatment non-tolerance)

Tolerate neither

Tolerate test treatment only

1 (tolerance)

Tolerate control only

Tolerate both

14 of 33

Recurrent event project

14

15 of 33

Recurrent events with death as an intercurrent event

  • Recurrent events: An event that can be observed repeatedly for a single subject
  • Death complicates defining the estimand for recurrent events
  • Examples of patient journey:

15

  • Different follow up times
  • Data after death doesn’t exist
  • E.g. patient 1 lives longer than patient 3, but also has more recurrent events

Start of study

End of study

Recurrent event

Death

Drop out

1

2

3

4

16 of 33

Introduction to PARAGON-HF study

  • A multicenter, randomized, double-blind, parallel group, active-controlled phase III study completed in 2019
    • Population: heart failure patients (NYHA Class II-IV) with preserved ejection fraction
    • Treatment: sacubitril/valsartan vs. valsartan
    • Primary endpoint: composite endpoint of CV death and total heart failure hospitalizations
  • Results show that there is no obvious difference between the two arms on death
  • Treatment effect on recurrent hospitalizations is clinically relevant
    • Analysis of total hospitalizations in CSR (joint frailty model): RR=0.85, 95% CI: 0.72 - 1.00 

16

17 of 33

Principal stratum strategy

  1.  

17

18 of 33

Proposed analysis method – full Bayesian approach

  1.  

18

19 of 33

Proposed analysis method – full Bayesian approach

  1.  

19

20 of 33

Models

  1.  

20

21 of 33

Likelihood

  1.  

21

22 of 33

Estimation of the causal estimand

22

 

For each bth posterior sample:

...

...

 

 

 

...

...

...

...

Summarize across the B posterior samples: point estimate (median) and 95% credible interval

23 of 33

Revisit PARAGON-HF example

  • Dataset overview:
    • 4796 patients included
    • Baseline covariates were determined based on data availability and analyses from previous heart failure studies (Simpson et al., 2020*), e.g. gender, NYHA III/IV, NT-proBNP, etc.

23

*Simpson, J., Jhund, P.S., Lund, L.H., Padmanabhan, S., Claggett, B.L., Shen, L., Petrie, M.C., Abraham, W.T., Desai, A.S., Dickstein, K., Køber, L., Packer, M., Rouleau J.L., Mueller-Velten G., Solomon, S.D., Swedberg, K., Zile, M.R., and McMurray, J.J.V., 2020. Prognostic models derived in PARADIGM-HF and validated in ATMOSPHERE and the Swedish Heart Failure Registry to predict mortality and morbidity in chronic heart failure. JAMA Cardiology5(4), pp.432-441.

 

 

 

 

24 of 33

Estimation results

  • Estimated causal effect:
    • Rate ratio among patients who would be alive under both arms by the end of t year(s):

24

𝒕=𝟏

𝒕=𝟐

𝒕=𝟑

P(principal stratum)

93.3%

85.6% 

78.5% 

Rate ratio 

(95% CI)

0.773 

(0.639, 0.933)

0.858 

(0.725, 1.009)

0.891 

(0.753, 1.044)

25 of 33

Model checking

  • Assess the model fit by posterior predictive checks
  • Main idea: generate replicate datasets from the fitted models with random frailty and compare non-parametric measures based on the replicate data vs. observed data
  • We check the model fit with the following:
    • Kaplan-Meier estimator for time to all-cause death (KM)
    • Ghosh-Lin estimator for the mean number of recurrent events (HF hospitalizations) (GL)
  • Next we show how to do the checking with KM as an example

25

26 of 33

Model checking – detailed steps

  1.  

26

 

 

Observed data

 

For each bth posterior sample of parameters

Compare based on posterior predictive p-value

27 of 33

Model checking – p value

  1.  

27

28 of 33

Model checking – visual comparisons

  • KM plot for observed data (sold line) vs. KM plots for 100 replicate data (dashed lines)

28

29 of 33

29

  • Mean number of recurrences plot for observed data (sold line) vs. mean number of recurrences plots for 100 replicate data (dashed lines)

30 of 33

Concluding remarks

30

31 of 33

Concluding remarks

  • The proposed estimator for principal stratum estimand:
    • Straightforward and easy interpretation by joint modeling of survival and recurrent event processes
    • Flexible by including covariates and allowing time dependent intensity/hazard functions (Weibull)
    • Computationally efficient by working with the marginalized likelihood
  • The proposed model fits the PARAGON-HF data reasonably well
  • The discussed estimand is also called survivor average causal effects (SACE)
  • Limitations:
    • Relies on the assumption that all processes share the same frailty term – unverifiable
      • Hopefully can be mitigated by allowing different parameter values in recurrent event/survival process and also in treatment/control group
    • Parametric models – model checking is recommended

31

32 of 33

References

  • ICH (2019) ICH E9 (R1): addendum on estimands and sensitivity analysis in clinical trials to the guideline on statistical principles for clinical trials.
  • Lyu, T., Bornkamp, B., Mueller‐Velten, G. and Schmidli, H., Bayesian inference for a principal stratum estimand on recurrent events truncated by death. Biometrics.
  • Frangakis, C.E. & Rubin, D.B. (2002) Principal stratification in causal inference. Biometrics58(1), 21–29.
  • Comment, L., Mealli, F., Haneuse, S. & Zigler, C. (2019) Survivor average causal effects for continuous time: a principal stratification approach to causal inference with semicompeting risks. arXiv:1902.09304.
  • Simpson, J., Jhund, P.S., Lund, L.H., Padmanabhan, S., Claggett, B.L., Shen, L., et al. (2020) Prognostic models derived in PARADIGM-HF and validated in ATMOSPHERE and the Swedish Heart Failure Registry to predict mortality and morbidity in chronic heart failure. JAMA Cardiology5(4), 432–441.
  • Solomon, S.D., McMurray, J.J.V., Anand, I.S., Ge, J., Lam, C.S.P., Maggioni, A.P., et al. & PARAGON-HF Investigators and Committees (2019) Angiotensin–neprilysin inhibition in heart failure with preserved ejection fraction. New England Journal of Medicine381(17), 1609–1620.

32

33 of 33

Thank you