Unknown

Dataset Information

0

Predicting outcomes of chronic kidney disease from EMR data based on Random Forest Regression.


ABSTRACT: Chronic kidney disease (CKD) is prevalent across the world, and kidney function is well defined by an estimated glomerular filtration rate (eGFR). The progression of kidney disease can be predicted if the future eGFR can be accurately estimated using predictive analytics. In this study, we developed and validated a prediction model of eGFR by data extracted from a regional health system. This dataset includes demographic, clinical and laboratory information from primary care clinics. The model was built using Random Forest regression and evaluated using Goodness-of-fit statistics and discrimination metrics. After data preprocessing, the patient cohort for model development and validation contained 61,740 patients. The final model included eGFR, age, gender, body mass index (BMI), obesity, hypertension, and diabetes, which achieved a mean coefficient of determination of 0.95. The estimated eGFRs were used to classify patients into CKD stages with high macro-averaged and micro-averaged metrics. In conclusion, a model using real-world electronic medical records (EMR) data can accurately predict future kidney functions and provide clinical decision support.

SUBMITTER: Zhao J 

PROVIDER: S-EPMC6435377 | biostudies-literature | 2019 Apr

REPOSITORIES: biostudies-literature

altmetric image

Publications

Predicting outcomes of chronic kidney disease from EMR data based on Random Forest Regression.

Zhao Jing J   Gu Shaopeng S   McDermaid Adam A  

Mathematical biosciences 20190212


Chronic kidney disease (CKD) is prevalent across the world, and kidney function is well defined by an estimated glomerular filtration rate (eGFR). The progression of kidney disease can be predicted if the future eGFR can be accurately estimated using predictive analytics. In this study, we developed and validated a prediction model of eGFR by data extracted from a regional health system. This dataset includes demographic, clinical and laboratory information from primary care clinics. The model w  ...[more]

Similar Datasets

| S-EPMC3163175 | biostudies-literature
| S-EPMC5852600 | biostudies-literature
| S-EPMC8373264 | biostudies-literature
| S-EPMC5099106 | biostudies-literature
| S-EPMC7099795 | biostudies-literature
| S-EPMC5431489 | biostudies-literature
| S-EPMC5537639 | biostudies-literature
| S-EPMC6889672 | biostudies-literature
| S-EPMC8427975 | biostudies-literature
| S-EPMC4004351 | biostudies-literature