Decision Theory in Practice: Validation, Accuracy, and Trade-offs
By Vera Wilde, Ph.D.
About me
Academic:
Civil society:
The Validation Problem
“There’s a major difference between asking people about something that
you can verify and asking
them about something that
you can’t…”
- Stephen Fienberg, 2009
interview
Image: The American Academy of Social and Political Science,
Binary Screening Test Results
Case Study: Chat Control
Scaled up from Fienberg et al’s 2003 NAS polygraph report, Table S-1: https://nap.nationalacademies.org/read/10420/chapter/2#5.
Case Study: Chat Control
Irresolvable Tension
Why the Trade-Off?
detection realities.
mathematical laws
What’s the Problem?
Image: Sherkiya Wedgeworth, CC Attribution-NonComm. 4.0 Int’l Lic.
Case Study: UTIs in Primary Care
Table 1 Summary statistics for laboratory tests and initial antibiotic prescribing | |||||||
| All tested | Positive test | Negative test | ||||
| N | Bacterial rate | Prescrib. rate | N | Prescrib. rate | N | Prescrib. rate |
2010 | 17,513 | 0.37 | 0.39 | 6,411 | 0.60 | 11,102 | 0.27 |
2011 | 21,237 | 0.39 | 0.39 | 8,305 | 0.60 | 12,932 | 0.25 |
2012 | 27,169 | 0.39 | 0.39 | 10,510 | 0.61 | 16,659 | 0.25 |
Total | 65,919 | 0.38 | 0.39 | 25,226 | 0.61 | 40,693 | 0.26 |
Bias-variance trade-off
Image: The American Academy of Social and Political Science,
Persistent Uncertainties
Image: Cristian Faezi & Omar Vidal.
Application | Target | Bycatch |
Trawling | Tuna | Dolphin |
Polygraph | Spies, Terrorists | Non-spies, non-terrorists |
iBorderCtrl | Bad crossings | Innocent crossings |
ChatControl | CSAM | Innocent coms |
Asymptomatic cancer screenings | Deaths | Healthy people |
Lifestyle diseases | Big problems | Mild cases |
Advanced medical imaging | See problems | Harmless anomalies |
Educational ethics | Plagiarism and AI use in writing | Innocent students |
Misinformation | Provably wrong | Ambiguity, dissent |
Disinformation | Hostile propaganda | Counterpoint |
Application | Target | Bycatch |
Trawling | Tuna | Dolphin |
Polygraph | Spies, Terrorists | Non-spies, non-terrorists |
iBorderCtrl | Bad crossings | Innocent crossings |
ChatControl | CSAM | Innocent coms |
Asymptomatic cancer screenings | Deaths | Healthy people |
Lifestyle diseases | Big problems | Mild cases |
Advanced medical imaging | See problems | Harmless anomalies |
Educational ethics | Plagiarism and AI use in writing | Innocent students |
Misinformation | Provably wrong | Ambiguity, dissent |
Disinformation | Hostile propaganda | Counterpoint |
Application | Target | Bycatch |
Trawling | Tuna | Dolphin |
Polygraph | Spies, Terrorists | Non-spies, non-terrorists |
iBorderCtrl | Bad crossings | Innocent crossings |
ChatControl | CSAM | Innocent coms |
Asymptomatic cancer screenings | Deaths | Healthy people |
Lifestyle diseases | Big problems | Mild cases |
Advanced medical imaging | See problems | Harmless anomalies |
Educational ethics | Plagiarism and AI use in writing | Innocent students |
Misinformation | Provably wrong | Ambiguity, dissent |
Disinformation | Hostile propaganda | Counterpoint |
Application | Target | Bycatch |
Trawling | Tuna | Dolphin |
Polygraph | Spies, Terrorists | Non-spies, non-terrorists |
iBorderCtrl | Bad crossings | Innocent crossings |
ChatControl | CSAM | Innocent coms |
Asymptomatic cancer screenings | Deaths | Healthy people |
Lifestyle diseases | Big problems | Mild cases |
Advanced medical imaging | See problems | Harmless anomalies |
Educational ethics | Plagiarism and AI use in writing | Innocent students |
Misinformation | Provably wrong | Ambiguity, dissent |
Disinformation | Hostile propaganda | Counterpoint |
Application | Target | Bycatch |
Trawling | Tuna | Dolphin |
Polygraph | Spies, Terrorists | Non-spies, non-terrorists |
iBorderCtrl | Bad crossings | Innocent crossings |
ChatControl | CSAM | Innocent coms |
Asymptomatic cancer screenings | Deaths | Healthy people |
Lifestyle diseases | Big problems | Mild cases |
Advanced medical imaging | See problems | Harmless anomalies |
Educational ethics | Plagiarism and AI use in writing | Innocent students |
Misinformation | Provably wrong | Ambiguity, dissent |
Disinformation | Hostile propaganda | Ambiguity, dissent |
A Dangerous Structure
Image: Russell Lee, 1942, public domain. Coolidge, Pinal County, Arizona. Casa Grande Farms, FSA (Farm Security Administration) project. Pigs at a feed trough.
Validation problem spectrum
Chat Control
(unsolved)
UTIs in Danish primary care
Validation problem spectrum
Chat Control
(unsolved)
UTIs in Danish primary care
Policy problems:
References
Signal detection theory and psychophysics, by Green & Swets. John Wiley, 1966.
“How to Improve Bayesian Reasoning Without Instruction: Frequency Formats,”
Gerd Gigerenzer & Ulrich Hoffrage, Psychological Review, 102(4), October
1995.
The Polygraph and Lie Detection, Stephen Fienberg et al, National Academies
Press, 2002.
“The Need for Cognitive Science in Methodology,” Sander Greenland, American
Journal of Epidemiology, Vol. 186, No. 6, 15 September 2017, p. 639–645.
References
Bayesian Inference in Statistical Analysis, by Box & Tiao, especially the aphorism
about point estimates (relevant, e.g., to quoted accuracy rates of screening
tests): “To the idea that people like to have a single number we answer that
usually they shouldn’t get it,” p. 310.
Statistical Rethinking, by Richard McElreath, especially the vampire example in
Chapter 3, “Sampling the Imaginary.”
Inevitable Illusions: How mistakes of reason rule our minds, by Massimo
Piattelli-Palmarini, especially Chapters 6, “The Fallacy of Near Certainty”
(Bayes’ rule is required to reason about screening tests, and native intuitions
tend to be poor) and 7, “The Seven Deadly Sins” (e.g., overconfidence increases
more than prediction accuracy for experts, anchoring works, and untrained
statistical intuitions tend to be wrong).
References
Michael A. Ribers & Hannes Ullrich, “Complementarities between algorithmic and
human decision-making: The case of antibiotic prescribing,” Quantitative
Marketing and Economics, 2024.
Harding Center for Risk Literacy, “Early detection of breast cancer by
mammography screening,” Fact Box,
https://www.hardingcenter.de/en/transfer-and-impact/fact-boxes/early-detect
ion-of-cancer/early-detection-of-breast-cancer-by-mammography-screening.
“Risk stratification in breast screening workshop,” Andrew Anderson, Cristina
Visintin, Antonis Antoniou, Nora Pashayan, Fiona J. Gilbert, Allan Hackshaw,
Rikesh Bhatt, Harry Hill, Stuart Wright, Katherine Payne, Gabriel Rogers, Bethany Shinkins, Sian Taylor-Phillips & Rosalind Given-Wilson, BMC Proceedings, Vol. 18, No. 22, 2024.
References
Overdiagnosed: Making People Sick in The Pursuit of Health, H.
Gilbert Welch, Lisa M. Schwartz, and Steven Woloshin (MDs;
Beacon Press, 2011).
“Quantifying Biases in Causal Models: Classical Confounding vs
Collider-Stratification Bias,” Sander Greenland, Epidemiology
14(3):p 300-306, May 2003.
“Causal Diagrams,” by M. Maria Glymour and Sander Greenland,
Chapter 12.