Unknown

Dataset Information

0

Development and Validation of an Algorithm to Identify Nonalcoholic Fatty Liver Disease in the Electronic Medical Record.


ABSTRACT: Nonalcoholic fatty liver disease (NAFLD) is the most common cause of chronic liver disease worldwide. Risk factors for NAFLD disease progression and liver-related outcomes remain incompletely understood due to the lack of computational identification methods. The present study sought to design a classification algorithm for NAFLD within the electronic medical record (EMR) for the development of large-scale longitudinal cohorts.We implemented feature selection using logistic regression with adaptive LASSO. A training set of 620 patients was randomly selected from the Research Patient Data Registry at Partners Healthcare. To assess a true diagnosis for NAFLD we performed chart reviews and considered either a documentation of a biopsy or a clinical diagnosis of NAFLD. We included in our model variables laboratory measurements, diagnosis codes, and concepts extracted from medical notes. Variables with P < 0.05 were included in the multivariable analysis.The NAFLD classification algorithm included number of natural language mentions of NAFLD in the EMR, lifetime number of ICD-9 codes for NAFLD, and triglyceride level. This classification algorithm was superior to an algorithm using ICD-9 data alone with AUC of 0.85 versus 0.75 (P < 0.0001) and leads to the creation of a new independent cohort of 8458 individuals with a high probability for NAFLD.The NAFLD classification algorithm is superior to ICD-9 billing data alone. This approach is simple to develop, deploy, and can be applied across different institutions to create EMR-based cohorts of individuals with NAFLD.

SUBMITTER: Corey KE 

PROVIDER: S-EPMC4761309 | biostudies-literature | 2016 Mar

REPOSITORIES: biostudies-literature

altmetric image

Publications

Development and Validation of an Algorithm to Identify Nonalcoholic Fatty Liver Disease in the Electronic Medical Record.

Corey Kathleen E KE   Kartoun Uri U   Zheng Hui H   Shaw Stanley Y SY  

Digestive diseases and sciences 20151104 3


<h4>Background and aims</h4>Nonalcoholic fatty liver disease (NAFLD) is the most common cause of chronic liver disease worldwide. Risk factors for NAFLD disease progression and liver-related outcomes remain incompletely understood due to the lack of computational identification methods. The present study sought to design a classification algorithm for NAFLD within the electronic medical record (EMR) for the development of large-scale longitudinal cohorts.<h4>Methods</h4>We implemented feature se  ...[more]

Similar Datasets

| S-EPMC7593991 | biostudies-literature
| S-EPMC9145474 | biostudies-literature
| S-EPMC8487619 | biostudies-literature
| S-EPMC6692049 | biostudies-literature
| S-EPMC9685512 | biostudies-literature
| S-EPMC6699164 | biostudies-literature
| S-EPMC10500930 | biostudies-literature
| S-EPMC4147111 | biostudies-literature
| S-EPMC4966969 | biostudies-literature
| S-EPMC7735897 | biostudies-literature