Fighting Bias with Bias
Challenges and Opportunities for Artificial Intelligence in Healthcare
Keith Harrigian
Johns Hopkins University
About Me
In the realm of healthcare, artificial intelligence serves as a powerful antidote to bias, paving the way for a future where every individual receives unbiased and equal treatment.
– ChatGPT
Transformative AI is Here: Now What?
Rapid Progress of AI
Proceed With Caution
“Can LLMs like GPT-4 outperform traditional AI tools in dementia diagnosis? Maybe, but not today.” Wang et al. arXiv. 2023.
Agenda
Review
AI, Bias, and
Healthcare
Case Study
Characterizing Stigmatizing Language in Medical Records
Open Dialogue
Bringing AI to the
Alzheimer’s Association
Review
AI, Bias, and Healthcare
Terminology
Systematic error in the outcome of a study due to dataset curation or modeling decisions
Examples
Human-leveled prejudices and predispositions regarding groups, attributes, or circumstances
Examples
Statistical Bias
Social Bias
Sources of Bias
“Biases in AI Systems.” Srinivasan and Chander. Communications of the ACM. 2021.
Distribution Shift
What Happens
Possible Solutions
Example: Language models trained on out-of-distribution data require adaptation to their target distribution.
“An Eye on Clinical BERT: Investigating Language Model Generalization for Diabetic Eye Disease Phenotyping.” Harrigian et al. Under Review. 2023.
Distribution Shift
Example: Words which were frequently used by individuals with depression started to be used by the general population after the beginning of COVID-19 to reflect pandemic-specific phenomena.
Term | 2019 Embedding Neighborhood | 2020 Embedding Neighborhood |
Panic | Emotion (i.e., Fear) rage, meltdown, anxiety, anger, barrage, migraine, phobia, outrage, manic, rush, asthma | Panic Buying, Misinformation hysteria, chaos, fear, misinformation, confusion, frenzy, paranoia, mayhem, insanity, fearmongering |
Cuts | Physical cut, jumps, runs, cutting, pulls, moves, bounces, falls, turns, burns, drags, dips, breaks, bursts, rips, goes, bumps | Economic cut, cutting, subsidies, budgets, deductions, revenues, checks, payments, breaks, deals, figures, loans, deposits, gains |
Isolated | Feeling Detached unpleasant, unstable, detached, unsafe, populated, invasive, unknown, confined, endangered, absent, vulnerable | Quarantine quarantined, isolating, separated, enclosed, insulated, infectious, confined, active, populated, autonomous, vulnerable, detached |
Strain | Discomfort/Pressure inflammation, deficiency, dose, stress, pressure, calcium, medication, concentration, tissue, nausea, receptors, doses | Virus disease, illness, infections, symptom, mutation, virus, outbreak, pneumonia, infection, strains, influenza, epidemic |
Vulnerable | Emotion susceptible, dangerous, prone, unstable, aggressive, hostile, disruptive, detrimental, receptive, fragile, damaging | At-risk Populations susceptible, dangerous, immunocompromised, infectious, isolating, elderly, disadvantaged, contagious, tolerant, likely, isolated |
“The Problem of Semantic Shift in Longitudinal Monitoring of Social Media.” Harrigian et al. WebSci. 2022.
Group Imbalance
What Happens
Possible Solutions
Example: A Logistic Regression classifier trained using ERM compromises minority group performance in favor of increasing majority group performance
Spurious Correlations
What Happens
Possible Solutions
Hospital Bed
Vitals
Mortality
Example: Models will learn non-causal relationships between spurious (unstable) attributes and outcomes.
A
X
Y
The State of AI Bias Research
Defensive Tactics
Offensive Tactics
Improved Health Equity
Measure, identify, and protect against social and statistical bias in algorithmic healthcare tools
Measure, identify, and address instances of social bias in our healthcare system
Case Study
Characterizing Stigmatizing Language in Medical Records
Collaborators
Aya Zirikly
Brant Chee
Yahan Li
Mark Dredze
Anne R. Links
Alya Ahamad
Somnath Saha
Mary Catherine Beach
Problem Context
Black patients are significantly more likely than white patients to experience discrimination in the healthcare system (12.3% vs. 2.3%)
Patients who experience discrimination have:
Healthcare providers who read notes containing stigmatizing language are more likely to formulate a less aggressive treatment plan
21st Century Cures Act mandates EHRs are readily available to all patients
Stigmatizing Language
Stigmatizing language assigns negative labels, stereotypes, and judgment to certain groups of people.
Often recognized in discussion regarding mental health and addiction
More generally, stigmatizing language reflects an implicit bias
Stigmatizing Language Taxonomy
Class | Definition | Examples |
Disbelief | Insinuates doubt about a patient’s stated testimony. | adamant he doesn’t smoke; claims to see a therapist |
Difficult | Describes patient perspective as inflexible/difficult/entrenched, typically with respect to their intentions. | insists on being admitted; adamantly opposed to limiting fruit intake |
Exclude | Word/phrase is not used to characterize the patient or describe the patient’s behavior; may refer to medical condition or treatment or to another person or context. | patient’s friend insisted she go to the hospital; test claims submitted to insurance |
Task: Credibility and Obstinacy
Stigmatizing Language Taxonomy
Class | Definition | Examples |
Negative | Patient not, unlikely to, or questionably following medical advice | adherence to therapeutic medication is unclear; mother declines vaccines; struggles with medication and follow-up compliance |
Neutral | Not used to describe whether the patient is not following medical advice or rejecting treatment; often used to describe generically some future plan involving a hypothetical. | discussed medication compliance; school refuses to provide adequate accommodations; feels that her parents’ health has declined |
Positive | Patient following medical advice. | continues to be compliant with aspirin regimen; reports excellent adherence |
Task: Compliance
Stigmatizing Language Taxonomy
Class | Definition | Examples |
Negative | Patient’s demeanor cast in a negative light; insinuates the patients is not being forthright | concern for secondary gain; unwilling to meet with case manager |
Neutral | Negation of negative descriptors; insinuates the patient was expected to have a negative demeanor. | not combative or belligerent; dad seems angry with patient at times |
Positive | Patient’s demeanor or behavior is described in a positive light; patient is easy to interact with. | lovely 80 year old woman; well-groomed and holds good eye contact |
Exclude | Patient self-description or description of another individual. | does not want providers to think she’s malingering; reports feeling angry |
Task: Descriptors
Overview of System Structure
| | |
| | |
| | |
| | |
| | |
Stigma
Labels
Machine Learning Classifier
Anchor
Extraction
Clinical
Notes
“Despite my best advice, the patient remains adamant about leaving the hospital today. Social services is aware of the situation.”
adamant
Disbelief |
Exclude |
Difficult |
my best advice the patient remains adamant about leaving the hospital today social
Data
Johns Hopkins University (Private)
English-language progress notes
5 clinical specialties are represented – internal medicine, emergency medicine, pediatrics, OB-GYN, and general surgery (Baltimore, MD)
5,201 labeled instances
MIMIC-IV (Public)
De-identified, English discharge notes
Patients admitted to emergency department or an intensive care unit at Beth Israel Deaconess Medical Center (Boston, MA)
5,043 labeled instances
Model Performance and Keyword Grounding Limitation
Figure 2: Projection of embeddings for a subset of keywords. Labels cluster globally, but keywords cluster locally.
Figure 1: Model accuracy in the Credibility task. BERT models maximize performance at cost of interpretability.
Exclude
Negative
Neutral
Domain Transfer Performance
What happened?
Figure 3: Macro F1 Score when training and testing on different distributions. There exists a consistent loss in performance when transferring between datasets.
Recap of Biases
System Design
Sample Selection
The Opportunities Ahead
“… there is a suspicion that the patient is not adhering to their medication regimen consistently.”
“Characterization of Stigmatizing Language in Medical Records” Harrigian et al. ACL. 2023.
Open Dialogue
Bringing AI to the
Alzheimer’s Association
Areas of Discussion
Thank you
Email: kharrigian@jhu.edu
Learn More: kharrigian.github.io