Conditional Chow-Liu Tree Structures for Modeling Discrete-Valued Vector Time Series
Sergey Kirshner, UC Irvine
Padhraic Smyth, UC Irvine
Andrew Robertson, IRI
July 10, 2004
UAI-2004 © Sergey Kirshner, UC Irvine
Overview
2
UAI-2004 © Sergey Kirshner, UC Irvine
Snapshot of the Data
3
1
2
3
4
5
1
2
3
4
5
6
7
8
…
…
T
N
UAI-2004 © Sergey Kirshner, UC Irvine
Data Aspects
4
UAI-2004 © Sergey Kirshner, UC Irvine
Modeling Precipitation Occurrence
5
Southwestern Australia, 1978-92
Western US, 1952-90
UAI-2004 © Sergey Kirshner, UC Irvine
A Bit of Notation
6
A1
B1
Z1
C1
R1
A2
B2
Z2
C2
R2
AT
BT
ZT
CT
RT
UAI-2004 © Sergey Kirshner, UC Irvine
Weather Generator
7
R1
R2
RT
A1
B1
Z1
C1
A2
B2
Z2
C2
AT
BT
ZT
CT
UAI-2004 © Sergey Kirshner, UC Irvine
Hidden Markov Model
8
R1
R2
Rt
RT-1
RT
S1
S2
St
ST-1
ST
UAI-2004 © Sergey Kirshner, UC Irvine
HMM-Conditional-Independence
9
Rt
St
St
At
Ct
Zt
Bt
=
R1
R2
Rt
RT-1
RT
S1
S2
St
ST-1
ST
UAI-2004 © Sergey Kirshner, UC Irvine
HMM-CI: Is It Sufficient?
10
UAI-2004 © Sergey Kirshner, UC Irvine
Chow-Liu Trees
11
UAI-2004 © Sergey Kirshner, UC Irvine
Illustration of CL-Tree Learning
12
0.3126
0.0229
0.0172
0.0230
0.0183
0.2603
AB
AC
AD
BC
BD
CD
(0.56, 0.11, 0.02, 0.31)
(0.51, 0.17, 0.17, 0.15)
(0.53, 0.15, 0.19, 0.13)
(0.44, 0.14, 0.23, 0.19)
(0.46, 0.12, 0.26, 0.16)
(0.64, 0.04, 0.08, 0.24)
A
C
B
D
A
C
B
D
AB
AC
AD
BC
BD
CD
0.3126
0.0229
0.0172
0.0230
0.0183
0.2603
(0.56, 0.11, 0.02, 0.31)
(0.51, 0.17, 0.17, 0.15)
(0.53, 0.15, 0.19, 0.13)
(0.44, 0.14, 0.23, 0.19)
(0.46, 0.12, 0.26, 0.16)
(0.64, 0.04, 0.08, 0.24)
A
C
B
D
UAI-2004 © Sergey Kirshner, UC Irvine
Chow-Liu Trees
13
UAI-2004 © Sergey Kirshner, UC Irvine
HMM-Chow-Liu
14
R1
R2
Rt
RT-1
RT
S1
S2
St
ST-1
ST
Rt
St
Bt
Dt
Ct
Bt
Dt
Ct
Bt
Dt
Ct
St
St=1
St=2
St=3
=
T1(Rt)
T2(Rt)
T3(Rt)
At
At
At
UAI-2004 © Sergey Kirshner, UC Irvine
Improving on Chow-Liu Trees
15
UAI-2004 © Sergey Kirshner, UC Irvine
Conditional Chow-Liu Forests
16
UAI-2004 © Sergey Kirshner, UC Irvine
Example of CCL-Forest Learning
17
B’
A’
C’
B
A
C
0.3126
0.0229
0.0230
0.1207
0.1253
0.0623
0.1392
0.1700
0.0559
0.0033
0.0030
0.0625
AB
AC
BC
A’A
A’B
A’C
B’A
B’B
B’C
C’A
C’B
C’C
(0.56, 0.11, 0.02, 0.31)
(0.51, 0.17, 0.17, 0.15)
(0.44, 0.14, 0.23, 0.19)
(0.57, 0.11, 0.11, 0.21)
(0.51, 0.17, 0.07, 0.25)
(0.54, 0.14, 0.14, 0.18)
(0.52, 0.07, 0.16, 0.25)
(0.48, 0.10, 0.11, 0.31)
(0.47, 0.11, 0.21, 0.21)
(0.48, 0.20, 0.20, 0.12)
(0.41, 0.26, 0.17, 0.16)
(0.53, 0.14, 0.14, 0.19)
AB
AC
BC
A’A
A’B
A’C
B’A
B’B
B’C
C’A
C’B
C’C
0.3126
0.0229
0.0230
0.1207
0.1253
0.0623
0.1392
0.1700
0.0559
0.0033
0.0030
0.0625
B’
A’
C’
B
A
C
B’
A’
C’
B
A
C
B’
A’
C’
B
A
C
UAI-2004 © Sergey Kirshner, UC Irvine
AR-HMM
18
R1
Rt
RT
S1
St
ST
Rt-1
St-1
R2
S2
UAI-2004 © Sergey Kirshner, UC Irvine
HMM-Conditional-Chow-Liu
19
St
Rt-1
Rt
R1
Rt
RT
S1
St
ST
Rt-1
St-1
R2
S2
At-1
Bt-1
Ct-1
Dt-1
At
Bt
Ct
Dt
Dt-1
Ct-1
Bt-1
At-1
Ct
Dt
At
Bt
Dt-1
Ct-1
Bt-1
At-1
Dt
Ct
At
Bt
St
St=1
St=2
St=3
=
UAI-2004 © Sergey Kirshner, UC Irvine
Inference and Learning for HMM-CL and HMM-CCL
20
UAI-2004 © Sergey Kirshner, UC Irvine
Chain Chow-Liu Forest (CCLF)
21
R1
Rt
RT
Rt-1
R2
Rt
Rt-1
Bt
Ct
Dt
At
At
Bt
Ct
Dt
=
UAI-2004 © Sergey Kirshner, UC Irvine
Complexity Analysis
22
Model Criterion | HMM-CI | HMM-CL | HMM-CCL |
# params | K2+MK(V-1) | K2+K(M-1)(V2-1) | K2+KM(V2-1) |
Time (per iteration) | O(NTK(K+M)) | O(NTK(K+M2V2)) | O(NTK(K+ +M2V2)) |
Space | O(NTK(K+M)) | O(NTK(K+M)+KM2V2) | O(NTK(K+M)+ +KM2V2) |
N – number of sequences
T – length of each sequence
K – number of hidden states
M – dimensionality of each vector
V – number of possible values for each vector component
UAI-2004 © Sergey Kirshner, UC Irvine
Experimental Setup
23
UAI-2004 © Sergey Kirshner, UC Irvine
Australia (log-likelihood)
24
UAI-2004 © Sergey Kirshner, UC Irvine
Australia (predictive error)
25
UAI-2004 © Sergey Kirshner, UC Irvine
Deeper Look at Weather States
26
UAI-2004 © Sergey Kirshner, UC Irvine
Western U.S. (log-likelihood)
27
UAI-2004 © Sergey Kirshner, UC Irvine
Western U.S. (predictive error)
28
UAI-2004 © Sergey Kirshner, UC Irvine
Summary
29
UAI-2004 © Sergey Kirshner, UC Irvine
Future Work
30
UAI-2004 © Sergey Kirshner, UC Irvine
Acknowledgements
31
UAI-2004 © Sergey Kirshner, UC Irvine
A Bit of Notation
32
A1
B1
Z1
C1
R1
A2
B2
Z2
C2
R2
At
Bt
Zt
Ct
Rt
AT
BT
ZT
CT
Rt
UAI-2004 © Sergey Kirshner, UC Irvine
Weather Generator
33
A1
B1
Z1
C1
R1
A2
B2
Z2
C2
R2
AT
BT
ZT
CT
RT
UAI-2004 © Sergey Kirshner, UC Irvine
Illustration of CL-Tree Learning
34
A
C
B
D
AB
AC
AD
BC
BD
CD
(0.56, 0.11, 0.02, 0.31)
(0.51, 0.17, 0.17, 0.15)
(0.53, 0.15, 0.19, 0.13)
(0.44, 0.14, 0.23, 0.19)
(0.46, 0.12, 0.26, 0.16)
(0.64, 0.04, 0.08, 0.24)
0.3126
0.0229
0.0172
0.0230
0.0183
0.2603
A
C
B
D
UAI-2004 © Sergey Kirshner, UC Irvine
Mixture of Chow-Liu Trees
35
Rt
St
Bt
Dt
Ct
Bt
Dt
Ct
Bt
Dt
Ct
St
St=1
St=2
St=3
=
T1(Rt)
T2(Rt)
T3(Rt)
At
At
At
UAI-2004 © Sergey Kirshner, UC Irvine
HMM-Chow-Liu
36
UAI-2004 © Sergey Kirshner, UC Irvine
HMM-Conditional-Chow-Liu
37
UAI-2004 © Sergey Kirshner, UC Irvine
Conditional Chow-Liu Forests
38
UAI-2004 © Sergey Kirshner, UC Irvine
Chain Chow-Liu Forest (CCLF)
39
UAI-2004 © Sergey Kirshner, UC Irvine
Weather Generator
40
R11
R12
R14
R13
R1
R21
R22
R24
R23
R2
RT1
RT2
RT4
RT3
RT
UAI-2004 © Sergey Kirshner, UC Irvine
Complexity Analysis
41
| HMM-CI | HMM-CL | HMM-CCL | CCLF |
# params | K2+MK(V-1) | K2+K(M-1)* (V2-1) | K2+KM(V2-1) | M(V2-1) |
Time | O(TK2M) | O(TK2M) | O(TK2M) | O(TM) |
Space | O( | | | O(TV+) |
UAI-2004 © Sergey Kirshner, UC Irvine