1 of 16

GreenEyes: An Air Quality Evaluating Model based on WaveNet

Kan Huang, the Hong Kong University of Science and Technology

Kai Zhang, Lehigh University

Ming Liu, the Hong Kong University of Science and Technology

2 of 16

Outlines

  • Motivation
  • Contributions
  • Datasets
  • Methodology
  • Experiments
  • Conclusion

3 of 16

Motivation

  • WaveNet’s residual layers can be used as feature extractors;
  • Attention and LSTM are good output endpoints for time series fitting;
  • Piece-wise linear function is a simple but useful representation of the trends of the target time series.

4 of 16

Contributions

  • We use WaveNet’s residual layers as a feature extractor block;
  • Stack several WaveNet blocks to build the model’s main body;
  • Put Attention and LSTM at the endpoint as output layers, and make ablation study.

5 of 16

Datasets

  • Four sensors of the same model;
  • At the same place;
  • The data are slightly different. Can they be augmented to make better fitting?

6 of 16

Methodology

Figure 4: Overview of the residual block and the entire architecture.

Oord, Aaron van den, et al. "Wavenet: A generative model for raw audio." arXiv preprint arXiv:1609.03499 (2016).

7 of 16

WaveNet Layer

 

tanh

1 Layer

 

 

Dilated

Conv

 

Residual

Input

8 of 16

WaveNet Block

Causal

Conv

 

tanh

k Layers

 

 

Dilated

Conv

 

Residual

Input

9 of 16

WaveNet Model

  • WaveNet Layer
  • WaveNet Block

 

tanh

k Layers

 

 

Dilated

Conv

 

Residual

Input

 

tanh

 

 

Dilated

Conv

 

Residual

Left layers…

dilation rate: 1

dilation rate: 2

dilation rate: 4, 8, 16…

10 of 16

WaveNet_LSTM Model

WaveNet_LSTM Model: Sequential/Cascading of WaveNet Block

Causal Conv

 

tanh

 

 

 

Dilated Conv

 

Residual

 

tanh

 

 

Dilated Conv

 

Residual

 

 

tanh

 

 

Dilated Conv

 

Residual

 

Input sequence

Target y

Bidirectional

LSTM

Attention

11 of 16

Data Processing

PM2.5 IAQI vs Time

Concentration thresholds of IAQI w.r.t. pollutant

categories, USA

12 of 16

Data Processing

Figure: Sensor 0’s 𝑃𝑀2.5 origin and labeled IAQI level.

Figure: Sensor 0’s labeled 𝑃𝑀2.5 IAQI level (solid line) and its polygonal line (dash line).

13 of 16

Experiments

Best metrics during training when applying Temporal

Attention.

  • The less the stride, the better the results.
  • Above conclusion is suitable for every channel’s data

14 of 16

Experiments - Ablation Study

Table: Test MSE and MAE for model with and w/o Attention. The model w/o Attention can perform better or is equivalent to the model applied with the Attention layer.

Figure: Validation MSE curves when applying official Attention

(𝑠𝑡𝑟𝑖𝑑𝑒 = 10).

Figure: Validation MSE curves when applying Temporal Attention (𝑠𝑡𝑟𝑖𝑑𝑒 = 10).

The model with the Temporal Attention layer can obtain

smaller loss during training

15 of 16

Experiments -

Figure: Window size 7200 vs 3600.

16 of 16

Thank you!

Q & A