Accelerating Recursive Partition-Based Causal Structure Learning
1Department of Computer Science and Engineering, University of Dhaka, Bangladesh
2School of Electrical and Computer Engineering, Purdue University, USA
3Department of Computer Science and Engineering, University of South Carolina, USA
Md Musfiqur Rahman1
Research Assistant
musfiq14shohan@gmail.com
@musfiq_shohan
Ayman Rasheed1
Research Assistant
aymanrasheed7
@gmail.com
Md. Mosaddek Khan1
Assistant Professor
mosaddek@du.ac.bd
Mohammad Ali Javidian2
Postdoctoral Scholar
mjavidia@purdue.edu
@ali_javidian
Pooyan Jamshidi3
Assistant Professor
pjamshid@cse.sc.edu
@PooyanJamshidi
Md. Mamun-Or-Rashid1
Professor
mamun@cse.du.ac.bd
Correlation does not imply causation!
2
Observational Data
Existing machine learning approaches
Correlation does not imply causation!
3
Observational Data
Existing machine learning approaches
Causal Inference
Correlation does not imply causation!
4
Observational Data
Existing machine learning approaches
Causal Inference
Incorrect decision-making!
Correlation does not imply causation!
5
Observational Data
Existing machine learning approaches
Causal Inference
Incorrect decision-making!
Causal structures/graphs
Causal Graphs and Causal Discovery
6
Causal Graphs and Causal Discovery
7
How we infer causal relations.
8
How we infer causal relations.
9
How we infer causal relations.
10
How we infer causal relations.
11
Recursive Causal Discovery is appropriate for large problems.
12
Recursive Causal Discovery is appropriate for large problems.
13
Recursive Causal Discovery is appropriate for large problems.
14
Recursive Causal Discovery is appropriate for large problems.
15
Recursive Causal Discovery is appropriate for large problems.
16
Recursive Causal Discovery is appropriate for large problems.
17
Benchmark algorithms are not perfect. Therefore, they execute refinement steps to remove undesired/redundant edges. But these refinement procedures are very expensive.
18
Existing approaches experience expensive refinement.
SADA
Costly partitioning with violation. Removes the less significant edges.
19
Existing approaches experience expensive refinement.
SADA
Costly partitioning with violation. Removes the less significant edges.
CAPA
Severe execution time overhead due to the third partition.
20
Existing approaches experience expensive refinement.
SADA
Costly partitioning with violation. Removes the less significant edges.
CAPA
Severe execution time overhead due to the third partition.
CP
D-separation violation. CI-tests for every pair of variables.
21
The Dsep-CP algorithm
22
In terms of contributions, Dsep-CP:
The Dsep-CP algorithm
23
In terms of contributions, Dsep-CP:
The Dsep-CP algorithm
24
In terms of contributions, Dsep-CP:
Preliminaries for Dsep-CP algorithm
25
Preliminaries for Dsep-CP algorithm
26
1⟂5 | {4}
The Dsep-CP Algorithm: The improved recursive learning
27
The Dsep-CP Algorithm: The improved recursive learning
28
The Dsep-CP Algorithm: The improved recursive learning
29
The Dsep-CP starts with a variable set and observational data
30
Finds efficient causal partitions to split the variable set.
31
Recursively solves the sub-problems with PC algorithm
32
PC algorithm is not always efficient.
Merges the discovered sub-graphs (May include false edges).
33
Detects the false edges with Y-structure.
34
Expensive!
Detects the false edges with Y-structure.
35
Detects the false edges with Y-structure.
36
Detects the false edges with Y-structure.
37
Detects the false edges with Y-structure.
38
Detects the false edges with Y-structure.
39
Detects the false edges with Y-structure.
40
Detects the false edges with Y-structure.
41
Detects the false edges with Y-structure.
42
The final discovered graph
Y-structures can detect false edges- Theoretically proved
43
Experimental Evaluation
44
Dsep-CP produces solutions of similar or better quality.
Experiments on Simulated Structures
45
Dsep-CP produces solutions of similar or better quality.
Experiments on Simulated Structures
46
Dsep-CP produces solutions of similar or better quality.
Experiments on Simulated Structures
47
Experiments on Simulated Structures
48
For 200-400 nodes, Dsep-CP runs 83-92% faster than SADA and 13-14% faster than CP.
Experiments on Simulated Structures
49
Experiments on Simulated Structures
50
Experiments on Simulated Structures
51
Reduces refining time and refining CI-tests by 98-99% compared to SADA and by 73-75% compared to CP.
Experiments on Simulated Structures
52
Experiments on Real-World Structures (Sample size = 500)
53
Experiments on Real-World Structures (Sample size = 500)
54
Experiments on Real-World Structures (Sample size = 500)
55
Experiments on Real-World Structures (Sample size = 500)
56
Produces solutions of similar or better quality for all eight real-world networks.
Experiments on Real-World Structures (Sample size = 500)
57
Experiments on Real-World Structures (Network: Mildew)
58
Experiments on Real-World Structures (Network: Mildew)
Also, for Large real-world networks (200+ nodes), 37- 58% faster than SADA and around 11-13% faster than CP.
59
Experiments on Real-World Structures (Network: Mildew)
60
Experiments on Real-World Structures (Network: Mildew)
61
Experiments on Real-World Structures (Network: Mildew)
Reducing refining time and CI-tests by 51-81% compared to SADA and by 81-86% compared to CP.
62
63
64
65
66