Quality Estimation
for
Machine Translation
Fabio Kepler
Unbabel AI
August 2019
BUILDING UNIVERSAL UNDERSTANDING
Outline
2
Why Quality Estimation?
Is Machine Translation Solved?
4
“The AI Research team wobbles!”
We still need humans in the loop
MT Quality
What could we do if we knew the quality of a translation?
5
Example Unbabel’s Pipeline
6
Definition
MT Quality Estimation (QE)
8
Datasets
9
MT: I really like Machine Translation
PE: I love Machine Translation !
delete
replace
insert
BAD
BAD
BAD
OK
OK
OK
Word-level tags
Sentence-level
HTER =
PE words
edit distance
=
5
3
= 0.6
BAD
WMT QE Shared Task
10
WMT QE Shared Task
11
Models
QUETCH QUality Estimation from ScraTCH
13
Kreutzer, J., Schamoni, S., & Riezler, S. (2015). “QUality Estimation from ScraTCH (QUETCH): Deep Learning for Word-level Translation Quality Estimation.” WMT@EMNLP.
NuQE Neural Quality Estimation
14
2 x FF
2 x 400
2 x FF
2 x 200
2 x FF
100 + 50
...
...
BiGRU
100
...
...
BiGRU
200
softmax
OK/BAD
source word
source POS
target word
target POS
embeddings
3 x 64
3 x 50
3 x 64
3 x 50
Martins, A.F., Astudillo, R.F., Hokamp, C., & Kepler, F. (2016). “Unbabel's Participation in the WMT16 Word-Level Translation Quality Estimation Shared Task.” WMT16.
Linear Model
15
APE for QE
16
SOTA 2016-2017
17
Predictor-Estimator
18
Hyun Kim, Jong-Hyeok Lee, and Seung-Hoon Na. (2017). "Predictor-estimator using multilevel task learning with stack propagation for neural quality estimation." WMT17@ACL.
Predictor-Estimator
19
Predictor
20
sj
ELMo
21
Source: The Illustrated BERT, ELMo, and co. (How NLP Cracked Transfer Learning), Jay Alammar, 2019.
BERT
22
Source: The Illustrated BERT, ELMo, and co. (How NLP Cracked Transfer Learning), Jay Alammar, 2019.
Estimator
23
Unfortunately, no reference implementation for any of these models
24
Implemented Models
26
OpenKiwi scoreboard
27
OpenKiwi toolkit
Goals
28
In a Nutshell
29
OpenKiwi production-easy
30
OpenKiwi production-easy
31
OpenKiwi production-easy
Or train and predict in one go
32
Simple example
Source
This is a simple sentence .
MT
C’ est une phrase simple .
33
BAD Example
Source
This is a simple sentence .
MT
C' est une phrase simple qui ajoute beaucoup de mots inutiles .
34
Demonstration
(not publicly available, yet)
Example: Unbabel’s Pipeline
36
WMT19 QE Shared Task
Surfing the wave
38
Predictor Flavors
39
Validation Results English-German
40
Validation Results English-Russian
41
Back to APE for QE
42
Ensemble methods
Word level
Sentence level
43
Dev set results English-{German,Russian}
44
Official Results Word-Level
45
Official Results Sentence-Level
46
47
History
Key Takeaways
48
Key Takeaways
49
Reference
Kepler, F., Trénous, J., Treviso, M., Vera, M., Góis, A., Farajian, M.A., Lopes, A.V. and Martins, A.F. “Unbabel’s Participation in the WMT19 Translation Quality Estimation Shared Task.” WMT19@ACL.
50
Finally
Future Directions
52
Thanks!
kepler@unbabel.com