MTC: Multiresolution Tensor Completion from Partial and Coarse Observations
KDD 2021
Chaoqi Yang1, Navjot Singh1, Cao Xiao2, Cheng Qian3, Edgar Solomonik1, Jimeng Sun1
1UIUC, 2Amplitude, 3IQVIA
1
Outline
2
Partial and Coarse Data
3
Features | Coarse granular | Fine granular |
Mode 1: Location | state code (a) | Zip code (A) |
Mode 2: Disease | CCS code (b) | ICD-10 code (B) |
Mode 3: Time | Week (c) | Date (C) |
A
B
C
A
b
C
a
B
C
?
?
?
?
?
?
?
survey data at specific locations for severe diseases at important dates
different agencies have different data accessibility
from local government
hospital
Fine-granular Tensor Completion
4
Features | Coarse granular | Fine granular |
Mode 1: Location | state code (a) | Zip code (A) |
Mode 2: Disease | CCS code (b) | ICD-10 code (B) |
Mode 3: Time | Week (c) | Date (C) |
A
B
C
A
b
C
a
B
C
?
?
?
?
?
?
?
known
Partial observation
Coarse observations
A
B
C
unknown
Low-rank
recover
Outline
5
Background
6
Solution Sketch
7
Assume the tensor admits a low-rank CP structure
The tensor is related to the observations by operations on factors
Design objective on the observations
Optimize the objective function and obtain the factors
Our multiresolution approach
Recover
Relations to known observations
8
?
?
?
?
?
?
?
Partial observation
Coarse observations
A
B
C
A
B
C
A
b
C
a
B
C
B
b
A
a
binary mask
Objective Function
9
Parameters:
Optimize the Loss
10
Fewer iterations are needed in high resolution!
Reformulate Objective
11
Transform the loss function
New loss function (for k+1 iteration)
Original loss
Build Solver
12
New objective
Overall Framework
13
(1)
(2)
(3)
(4)
(5)
(6)
(7)
Subsampling and Interpolation
14
Continuous mode
Categorical mode
Continuous mode (e.g., date)
15
Intuition: smoothness allows subsampling
Categorical mode (e.g., ICD-10)
16
Intuition: dense part provides better estimation
Outline
17
Datasets
18
Baselines and Metric
19
Metric:
where
Main Comparison Results
20
Compared to Reference Models
21
Comparison on Optimization Landscape
22
Comparison on Initialization
23
Comparison on Future Tensor Prediction
24
…
…
Gaussian process prediction
Predicted tensor
Procedures:
Results:
Outline
25
Conclusion
26
References
[1] Lars Karlsson, Daniel Kressner, and André Uschmajew. 2016. Parallel Algorithms for Tensor Completion in the CP Format. Parallel Comput.57, C (2016).
[2] Yangyang Xu and Wotao Yin. 2013. A block coordinate descent method for regularized multi convex optimization with applications to nonnegative tensor factorization and completion. SIAM Journal on imaging sciences6, 3 (2013).
[3] Faisal M Almutairi, Charilaos I Kanatsoulis, and Nicholas D Sidiropoulos. 2021. PREMA: principled tensor data recovery from multiple aggregated views. IEEE Journal of Selected Topics in Signal Processing(2021).
[4] Evrim Acar, Tamara G Kolda, and Daniel M Dunlavy. 2011. All-at-once optimization for coupled matrix and tensor factorizations. arXiv:1105.3422(2011).
[5] J. Park, K. Carr, S. Zheng, Y. Yue, and R. Yu. 2020. Multiresolution Tensor Learning for Efficient and Interpretable Spatial Analysis. In ICML. PMLR, 7499–7509
[6] Faisal M Almutairi, Charilaos I Kanatsoulis, and Nicholas D Sidiropoulos. 2020. Tendi: Tensor disaggregation from multiple coarse views. In PAKDD. Springer.
27
MTC: Multiresolution Tensor Completion from Partial and Coarse Observations
Thanks for Listening!
Chaoqi Yang1, Navjot Singh1, Cao Xiao2, Cheng Qian3, Edgar Solomonik1, Jimeng Sun1
1UIUC, 2Amplitude, 3IQVIA
28