Revisiting and Improving Scoring Fusion for Spoofing-aware Speaker Verification Using Compositional Data Analysis
Xin Wang , Tomi Kinnunen, Kong Aik Lee, Paul-Gauthier Noe, Junichi Yamagishi
NII, JST PRESTO, UEF, PolyU, Inria
Interspeech 2024
A4-O2.3 #442
wangxin@nii.ac.jp
1
Summary in one slide
Bayesian decision theory
2
Background: spoofing CM
protect human listeners
protect ASV
Spoofing CM
bona fide
spoofed
bona fide
3
Background: spoofing CM protecting ASV
ASV
Spoofing CM
enroll
bona fide
matched
bona fide
not matched
4
Background: spoofing-robust ASV (SASV)
A single deep neural network (DNN)
ASV
Spoofing CM
enroll
SASV
5
Background: spoofing-robust ASV (SASV)
A single deep neural network (DNN)
ASV
Spoofing CM
enroll
A single deep neural network (DNN)
6
Background: spoofing-robust ASV (SASV)
enroll
Fusion
ASV
Spoofing CM
7
Question: how to properly fuse ASV and CM
enroll
ASV
Spoofing CM
Fusion
8
Question: how to properly fuse ASV and CM
Jee-weon Jung, Hemlata Tak, Hye-jin Shim, Hee-Soo Heo, Bong-Jin Lee, Soo-Whan Chung, Ha-Jin Yu, Nicholas Evans, and Tomi Kinnunen. 2022. SASV 2022: The first spoofing-aware speaker verification challenge. In Proc. Interspeech, 2022. 2893–2897.
ASV
Spoofing CM
+
9
Question: how to properly fuse ASV and CM
Jee-weon Jung, Hemlata Tak, Hye-jin Shim, Hee-Soo Heo, Bong-Jin Lee, Soo-Whan Chung, Ha-Jin Yu, Nicholas Evans, and Tomi Kinnunen. 2022. SASV 2022: The first spoofing-aware speaker verification challenge. In Proc. Interspeech, 2022. 2893–2897.
ASV
Spoofing CM
+
10
Question: how to properly fuse ASV and CM
Jee-weon Jung, Hemlata Tak, Hye-jin Shim, Hee-Soo Heo, Bong-Jin Lee, Soo-Whan Chung, Ha-Jin Yu, Nicholas Evans, and Tomi Kinnunen. 2022. SASV 2022: The first spoofing-aware speaker verification challenge. In Proc. Interspeech, 2022. 2893–2897.
ASV
Spoofing CM
+
tanh
Any thoery to support the good pratice?
11
Answers by this work
ASV subsystem
ASV subsystem
+
Logistic regression
ASV
Spoofing CM
+
tanh
12
Answers by this work
We explain the practice in this talk
13
Method 1: linear fusion in good practice
ASV
Spoofing CM
+
Calibration
Calibration
14
Method 1: linear fusion in good practice
ASV
Spoofing CM
+
Calibration
Calibration
15
Method 1: linear fusion in good practice
ASV
Spoofing CM
+
Calibration
Calibration
Decisions in compositional data analysis
Three data classes but binary decisions!
(sec 2.2 and appendix)
16
Method 1: linear fusion in good practice
Geoffrey Stewart Morrison. 2013. Tutorial on logistic-regression calibration and fusion: converting a score to a likelihood ratio. Australian Journal of Forensic Sciences 45, 2 (2013), 173–197.
Scikit-learn: https://scikit-learn.org/stable/modules/calibration.html
ASV
+
calibration
Calibration
Spoofing CM
estimate {a,b} on using hold-out data
Logistic regression (Morrison 2013)
17
Method 1: linear fusion in good practice
Niko Brummer, Albert Swart, and David Van Leeuwen. 2014. A comparison of linear and non-linear calibrations for speaker recognition. In Proc. Odyssey, 2014. 14–18.
ASV
+
calibration
Calibration
Spoofing CM
Logistic regression
Generative calibration (Brummer 2014)
18
Method 1: linear fusion in good practice
Luciana Ferrer, "Analysis and Comparison of Classification Metrics", arXiv:2209.05355, https://github.com/luferrer/CalibrationTutorial
David A. van Leeuwen and Niko Brümmer. 2013. The distribution of calibrated likelihood-ratios in speaker recognition. In Proc. Interspeech, 2013. 1619–1623.
ASV
+
calibration
Calibration
Spoofing CM
Logistic regression
Generative calibration
Many other methods exist (Ferrer 2022, Leeuwen 2013)
19
Method 1: linear fusion in good practice
+
ASV
calibration
calibration
Spoofing CM
See more in Sec2.5 & Appendix
Cost | | |
Bona fide matched | 0 | Cmiss |
Bona fide unmatched | Cfa | 0 |
Spoofed | Cfa | 0 |
20
Method 2: non-linear fusion is better
ASV
calibration
calibration
Spoofing CM
Cost | | |
Bona fide matched | 0 | Cmiss |
Bona fide unmatched | Cfa | 0 |
Spoofed | Cfa | 0 |
fuse
for Cfa=Cmiss
See more in Sec2.5 & Appendix
21
Method 2: non-linear fusion is better
Tomi H. Kinnunen, Kong Aik Lee, Hemlata Tak, Nicholas Evans, and Andreas Nautsch. 2023. t-EER: Parameter-Free Tandem Evaluation of Countermeasures and Biometric Comparators. IEEE Trans. Pattern Anal. Mach. Intell. (2023), 1–16. https://doi.org/10.1109/TPAMI.2023.3313648
ASV
Calibration
Calibration
Spoofing CM
fuse
for Cfa=Cmiss
Asserted spoofing prior (Kinnuen 2023)
22
Method 2: non-linear fusion is better
Tomi H. Kinnunen, Kong Aik Lee, Hemlata Tak, Nicholas Evans, and Andreas Nautsch. 2023. t-EER: Parameter-Free Tandem Evaluation of Countermeasures and Biometric Comparators. IEEE Trans. Pattern Anal. Mach. Intell. (2023), 1–16. https://doi.org/10.1109/TPAMI.2023.3313648
ASV
Calibration
Calibration
Spoofing CM
fuse
for Cfa=Cmiss
Asserted spoofing prior (Kinnuen 2023)
23
Method 2: non-linear fusion is better
Massimiliano Todisco, Héctor Delgado, Kong Aik Lee, Md Sahidullah, Nicholas Evans, Tomi Kinnunen, and Junichi Yamagishi. 2018. Integrated presentation attack detection and automatic speaker verification: Common features and gaussian back-end fusion. In Proc. Interspeech, 2018. 77–81.
ASV
Calibration
Calibration
Spoofing CM
fuse
for Cfa=Cmiss
Asserted spoofing prior (Kinnuen 2023)
A general form of Gaussian fusion (Todisco 2018)
24
Demo on toy data set
25
Demo on toy data set
26
Demo on toy data set
27
Recap the practices
ASV
Calibration
Calibration
Spoofing CM
fuse
Linear fusion
Non-linear fusion
All are supported by decision theory
28
Experiments
Jee-weon Jung, Hemlata Tak, Hye-jin Shim, Hee-Soo Heo, Bong-Jin Lee, Soo-Whan Chung, Ha-Jin Yu, Nicholas Evans, and Tomi Kinnunen. 2022. SASV 2022: The first spoofing-aware speaker verification challenge. In Proc. Interspeech, 2022. 2893–2897.
29
Experiments
better
worse
Systems with different fusion & calibration methods
SASV-EER
(Jung2022)
From other papers
other metrics
linear
linear
non-linear
30
Experiments
ASV
CM
+
ASV
CM
+
logistic reg. calibration
logistic reg. calibration
ASV
CM
+
Gaussian +
logistic reg.
Gaussian +
logistic reg.
log.reg. calibration
no
calibration
log.reg. + Gaussian calibration
linear
linear
baseline
good linear fusion
good linear fusion
31
Experiments
ASV
CM
+
ASV
CM
+
logistic reg. calibration
logistic reg. calibration
ASV
CM
+
Gaussian +
logistic reg.
Gaussian +
logistic reg.
bona fide matched
bona fide unmatched
spoofed
baseline
good linear fusion
good linear fusion
32
Experiments
ASV
CM
+
ASV
CM
+
logistic reg. calibration
logistic reg. calibration
ASV
CM
+
Gaussian +
logistic reg.
Gaussian +
logistic reg.
bona fide matched
bona fide unmatched
spoofed
33
Experiments
Jee-weon Jung, Hemlata Tak, Hye-jin Shim, Hee-Soo Heo, Bong-Jin Lee, Soo-Whan Chung, Ha-Jin Yu, Nicholas Evans, and Tomi Kinnunen. 2022. SASV 2022: The first spoofing-aware speaker verification challenge. In Proc. Interspeech, 2022. 2893–2897.
You Zhang, Ge Zhu, and Zhiyao Duan. 2022. A Probabilistic Fusion Framework for Spoofing Aware Speaker Verification. In Proc. Odyssey, June 28, 2022. ISCA, 77–84.
linear
non-liear
(Jung 2022)
(Zhang 2022)
The difference is small on this database
good linear fusion
good non-linear fusion
34
Main messages
ASV
Calibration
Calibration
Spoofing CM
fuse
35
Pointers
Hye-jin Shim, Jee-weon Jung, Tomi Kinnunen, Nicholas Evans, Jean-Francois Bonastre, and Itshak Lapidot. 2024. a-DCF: an architecture agnostic metric with application to spoofing-robust speaker verification. In Proc. Odyssey, 2024. 158–164. https://doi.org/10.21437/odyssey.2024-23
Jee-weon Jung, Xin Wang, Nicholas Evans, Shinji Watanabe, Hye-jin Shim, Hemlata Tak, Sidhhant Arora, Junichi Yamagishi, and Joon Son Chung. 2024. To what extent can ASV systems naturally defend against spoofing attacks? In Proc. Interspeech, 2024. .
A4-05.5
36
Thank you
Appendix
theory in details
Code & Jupyter notebook
step-by-step explanation
ASVspoof
37
Fusing CM & ASV is special
Assuming a 1-0 decision cost
ASV
match
not match
decision
scoring
38
Fusing CM & ASV is special
ASV
Face recognition
decision
scoring
+
39
Fusing CM & ASV is special
ASV
CM
decision
scoring
+
40
Fusing CM & ASV is special
FAKE
Bayes’ rule
&
Isometric-log-ratio
1
Simplex
Optimal way using ternary hypothesis testing
What we need
FAKE
FAKE
41
Fusing CM & ASV is special
FAKE
Bayes’ rule
&
Isometric-log-ratio
1
Simplex
FAKE
vs
log likelihood ratio
vs
log likelihood ratio
42