Ontology highlight
ABSTRACT: Objective
To develop machine learning models for classifying the severity of opioid overdose events from clinical data.Materials and methods
Opioid overdoses were identified by diagnoses codes from the Marshfield Clinic population and assigned a severity score via chart review to form a gold standard set of labels. Three primary feature sets were constructed from disparate data sources surrounding each event and used to train machine learning models for phenotyping.Results
Random forest and penalized logistic regression models gave the best performance with cross-validated mean areas under the ROC curves (AUCs) for all severity classes of 0.893 and 0.882 respectively. Features derived from a common data model outperformed features collected from disparate data sources for the same cohort of patients (AUCs 0.893 versus 0.837, p value = 0.002). The addition of features extracted from free text to machine learning models also increased AUCs from 0.827 to 0.893 (p value < 0.0001). Key word features extracted using natural language processing (NLP) such as 'Narcan' and 'Endotracheal Tube' are important for classifying overdose event severity.Conclusion
Random forest models using features derived from a common data model and free text can be effective for classifying opioid overdose events.
SUBMITTER: Badger J
PROVIDER: S-EPMC6622451 | biostudies-literature | 2019 Jun
REPOSITORIES: biostudies-literature
Badger Jonathan J LaRose Eric E Mayer John J Bashiri Fereshteh F Page David D Peissig Peggy P
Journal of biomedical informatics 20190425
<h4>Objective</h4>To develop machine learning models for classifying the severity of opioid overdose events from clinical data.<h4>Materials and methods</h4>Opioid overdoses were identified by diagnoses codes from the Marshfield Clinic population and assigned a severity score via chart review to form a gold standard set of labels. Three primary feature sets were constructed from disparate data sources surrounding each event and used to train machine learning models for phenotyping.<h4>Results</h ...[more]