BDA 2021
1
Neural Language Modeling of Unstructured Clinical Notes for Automated Patient Phenotyping
Akshara Prabhakar*, Shidharth Srinivasan*, Sowmya Kamath
Healthcare Analytics & Language Engineering (HALE) Lab, Department of Information Technology,
National Institute of Technology Karnataka, Surathkal
o r
Presented by
Shidharth S
56th Annual Conference on Information Sciences and Systems
(CISS 2022)
Organized by
Princeton University
In Online mode
March 9-11, 2022
Outline
2
Introduction
3
Introduction
4
Problem Statement
Neural Language Modeling of Unstructured Clinical Notes for Automated Patient Phenotyping
Contributions
5
Previous Works
6
Work | Data source | Approach |
Gehrmann et al. | Discharge summary notes | Convolution neural network to identify patient phenotypes |
ws-CNN, Yang et al. | Discharge summary notes | CNN with three different filter sizes with a combination of word and sentence level embeddings |
ClinicalBERT, Huang et al. | All notes from MIMIC III | Pre-trained BERT on clinical notes and fine-tune the network for predicting hospital readmissions at various time points. |
ClinicalBERT based fmean, Mulyar et al. | N2C2 2008 | Divide the entire clinical document into chunks and use various approaches to combine important information from them using the sequence of CLS tokens, with the best performance obtained when a mean is taken |
7
Modules
Encoder Module
Cross & Self Attention Module
8
Class Imbalance
class weighta = Pb/(Pb + Pa)
class weightb = Pa/(Pb + Pa)
9
Experiments and Results
10
Experiments and Results
11
Most existing works utilize only a subset of the patients from the MIMIC-III dataset termed ``frequent flyers" >= 3 ICU visits in a year, as introduced by Gehrmann et al. It contains the discharge summary of 1,610 patients. We refer to this subset as D_sub.
Experiments and Results
12
Conclusion
13
Future Work
14
References
15
Thank you
Questions?
16