Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis
Wang et al. (ICML 2018)
Agenda
2
Motivation
3
Related work / SOTA
4
Model Architecture - I
5
Model Architecture - II
6
Model Architecture - III
7
Model Architecture - IV
8
Model Architecture - IV
9
Inference
10
Interpretation of GST
11
How are GSTs capturing prosody?
12
13
Experiments - Style Control and Transfer
14
Experiments - Style Control and Transfer
15
Experiments - Unlabelled Noisy Found Data
16
Experiments - Unlabelled Noisy Found Data
17
Experiments - Unlabelled Noisy Found Data
18
Doubts / Things to discuss
19
Relation to our project
20