D13
Hongyun Wu
Pin Jui Chen
Sherry Chi
Shuhei Fujimoto
PLN
City of Mesa
911 Process Improvement
Background/Process
Business Problem
Client’s Goal
The client’s goal is to enable the 911 dispatcher to identify more behavioral patients and dispatch the behavioral unit more frequently so the behavioral unit can assess and send these patients quickly to behavioral facilities.
ASU’s Objectives
Datasets
Resource Description | Location | Access | Need |
ePCR (Details of 911 calls, patient information, treatment etc.) CSV, approx. 1000 MB 10 tables in total (derived from database) | Google Drive
| No access limits (Client provided downloaded data) | All the time |
Preprocessing
EDA
EDA
We compiled a list of indicators and contra-indicators that will later on help us build our predictive models. “+” means that an existence of this characteristics indicates the patient might be behavioral while “-” means the existence indicates non-behavioral.
Medical history | Impression | Medication |
Depression (+) Anxiety (+) Bipolar (+) Schizophrenia (+) Suicide attempts (+) Hypertension (-) Asthma (-) Copd (-) Seizure (-) Stroke (-) | Behavioral/Psychological disorder (+) Suicide ideation (+) Any sort of injury (-)* Any sort of pain (-)* Difficulty breathing (respiratory distress, acute) (-) *Any impression value that has "injury" or "pain" in it was grouped in these two contraindications. | Trazodone (+) Seroquel (+) Gabapentin (+) Lisinopril (-) Metoprolol (-) Atorvastatin (-) Amlodipine (-) Metformin (-) Albuterol (-) |
Data Visualization
This bar chart represents the counts of each medical history among known behavioral patients.
* duplications due to joining different tables together
Data Visualization
Most patients don’t have any reported medications, but we included the next three medications (which are mostly antidepressant)
Data Visualization
The three most common impression of behavioral patients.
We ended up only including the top 2: Behavioral/psychiatric disorder and suicidal ideation.
Data Visualization
The most common inputs of primary complaint as well as secondary complaint for non-behavioral patients correspond with the contra-indicators that we indentified in the previous list
Model Performance
After EDA, we cleaned the data by removing records that have null values (might indicates no patients found, hence empty records for medical history, impression, and such) and created new binary columns for all the indicators/contraindications in the list.
We included all the identified indicators/contraindications as our predictors for the model, and duplicated the number of known behavioral records to be as many as non-behavioral records. We tried Logistic Regression and Random Forest, and we achieved the best result with Logistic Regression with a recall score of 0.78.
Next Steps
Thank you!
Questions?