Hierarchical Core Decomposition in Parallel:�From Construction to Subgraph Search
1
Core Decomposition
2
Limitations of Core Decomposition
3
2.
1. Hierarchical Core Decomposition (HCD)
4
Applications of HCD
5
2. Subgraph Search on HCD
Work-Efficient�Parallel Algorithms
6
Score=0.8
Score=0.6
Work-Efficient: #steps matches the best serial time complexity
Score=0.5
Score=0.4
Applications of Subgraph Search on HCD
7
[1] D. Chu, F. Zhang, X. Lin, W. Zhang, Y. Zhang, Y. Xia, and C. Zhang, “Finding the best k in core decomposition: A time and space optimal solution,” in ICDE. IEEE, 2020, pp. 685–696.
Roadmap
8
K-Core and Core Decomposition
9
⬤ Nodes with coreness 3
⬤ Nodes with coreness 2
⬤ Nodes with coreness 1
1. Hierarchical Core Decomposition
10
Definition: Community Scoring Metric
Primary Value (for a subgraph 𝑺)
Most metrics are based on Primary Values
Community Scoring Metric (for a subgraph 𝑺)
To measure community quality
11
Type-A
Type-B
Higher-Order
2. Subgraph Search on HCD
12
Roadmap
13
Existing Works of HCD Construction
14
[1] D. W. Matula and L. L. Beck, “Smallest-last ordering and clustering and graph coloring algorithms,” J. ACM, vol. 30, no. 3, pp. 417–427, 1983.
Intuition of Our Solution
15
Vertex Rank and Pivot
16
pivot
⬤ Nodes with coreness 3
⬤ Nodes with coreness 2
⬤ Nodes with coreness 1
Vertex Rank and Pivot
1. Group Vertices
2. Find Parent
17
pivot
old pivot
pivot
Our Solution
18
Our Solution
19
pivot
old pivot
add ● to the graph
Our Solution
20
pivot
old pivot
Our Solution
21
2. Union ● with neighbors in union-find and pivot is changed
pivot
old pivot
Our Solution
22
3. Group ● by pivot into tree node
pivot
old pivot
Our Solution
23
4. In the same CC, the tree node containing ● and ● have parent-child relation
pivot
old pivot
Union-find with Pivot
24
Parallel Vertex Rank Computation
25
2. Group vertices by coreness
1. Distribute vertices
is parallel
……
……
……
……
3. Place back into array, obtain vertex rank
PHCD Analysis
26
Roadmap
27
Existing Works of Subgraph Search
Limitations of BKS [1] (time- and space-optimal serial solution)
28
[1] D. Chu, F. Zhang, X. Lin, W. Zhang, Y. Zhang, Y. Xia, and C. Zhang, “Finding the best k in core decomposition: A time and space optimal solution,” in ICDE. IEEE, 2020, pp. 685–696.
Our Solution
29
Our Solution
30
3. Score Computation on HCD (Intuition)
3. Type-A Score Computation on HCD
32
5v, 8e
4v, 6e
9v, 14e
9v, 9e
5v, 8e
4v, 6e
18v, 28e
27v, 37e
i. Vertex-centric
contribution
……
……
ii. Tree node contribution
Aggregate
iii. Primary value of k-cores
Bottom-Up
Summation
Compute
is parallel
3.2
3
3.11
2.74
iv. Score of k-cores
Score
3. Type-B Score Computation on HCD
33
Our Solution
34
The highest score
PBKS Time Analysis
35
Roadmap
36
Dataset Statistics & Experimental Setting
37
Parallel HCD Construction Time
Significantly faster than LCPS
38
Increasing network size
Parallel HCD Construction Time
Our PHCD is close to lower bound
39
Parallel HCD Construction Time
Our PHCD is close to lower bound
D&C paradigm is inefficient in building HCD
40
Parallel Subgraph Search Time
Significantly faster than BKS
41
Increasing network size
Application: Densest Subgraph
42
[1] Fang, Yixiang, et al. Efficient algorithms for densest subgraph discovery. PVLDB 2019.
[2] D. Chu, F. Zhang, X. Lin, W. Zhang, Y. Zhang, Y. Xia, and C. Zhang, “Finding the best k in core decomposition: A time and space optimal solution,” in ICDE. IEEE, 2020, pp. 685–696.
Better is bold
Application: Maximum Clique
43
Roadmap
44
Conclusion
45
Q & A
46