1 of 28

Causal Design Patterns

Emily Riederer

@emilyriederer

February 15, 2021

Based on emily.rbind.io/post/causal-design-patterns/

2 of 28

A

B

A

B

A

B

A

B

A

B

A

B

A

B

A

B

A

B

A

B

A

B

A

B

A

B

A

B

A

B

A

B

A

B

A

B

A

B

A

B

A

3 of 28

Can’t test	Ethics Reputational risk Logistics Made a mistake
Expensive to test	Direct costs Implementation cost Opportunity cost
Why wait?	Long term endpoints More historical variants

4 of 28

Why observational causal inference?

Can’t test	Ethics Reputational risk Logistics Made a mistake
Expensive to test	Direct costs Implementation cost Opportunity cost
Why wait?	Long term endpoints More historical variants

5 of 28

Unifying themes

Compare potential outcomes of the observed versus the counterfactual �
Create a counterfactual by exploiting any semi-random variation �
Exploit variation in distribution, in assignment, and across time

6 of 28

Four strategies for causal inference

7 of 28

When we have imbalance...

When you have:

- “similar” treated and untreated individuals

- different distributions

- on few relevant dimensions

Tries to:

Rebalance to make groups more comparable

8 of 28

Stratification overview

Assumption:

All common causes of treatment and outcome are accounted for
All observations have positive probability of treatment
Few variables require adjustment

Recipe:

Bin population by subgroups
Calculate average by group
Weight average across groups

9 of 28

Stratification application

Scenario:

Attempt to A/B test “one-click instant checkout” on Black Friday
Due to a glitch, Chrome users see the button 50% of the time but Mozilla users only 30%
Mozilla users spend less on average

Saw button

Didn’t see

Chrome

Mozilla

10 of 28

When we have imbalance along many dimensions...

When you have:

- “similar” treated and untreated individuals

- different distributions

- on many dimensions

Tries to:

Rebalance to make groups more comparable

11 of 28

Propensity Score Weighting overview

Assumption:

All common causes of treatment and outcome are accounted for
All observations have positive probability of treatment

Recipe:

Model probability of receiving treatment based on traits
Derive weights from predicted probabilities
Apply weights when calculating average outcome by group

12 of 28

Propensity Score Weighting application

Scenario:

We sent a text message to all customers with a valid number and want to measure the effect on likelihood of purchase
Only customers for whom we lack a phone number are untreated
Number-less customers are also less active on average

Have phone

No phone

P(Phone|X)

No phone - Reweighted

13 of 28