Dataset Information

Machine Learning Techniques for Prediction of Early Childhood Obesity.

ABSTRACT:

Objectives

This paper aims to predict childhood obesity after age two, using only data collected prior to the second birthday by a clinical decision support system called CHICA.

Methods

Analyses of six different machine learning methods: RandomTree, RandomForest, J48, ID3, Naïve Bayes, and Bayes trained on CHICA data show that an accurate, sensitive model can be created.

Results

Of the methods analyzed, the ID3 model trained on the CHICA dataset proved the best overall performance with accuracy of 85% and sensitivity of 89%. Additionally, the ID3 model had a positive predictive value of 84% and a negative predictive value of 88%. The structure of the tree also gives insight into the strongest predictors of future obesity in children. Many of the strongest predictors seen in the ID3 modeling of the CHICA dataset have been independently validated in the literature as correlated with obesity, thereby supporting the validity of the model.

Conclusions

This study demonstrated that data from a production clinical decision support system can be used to build an accurate machine learning model to predict obesity in children after age two.

SUBMITTER: Dugan TM

PROVIDER: S-EPMC4586339 | biostudies-literature | 2015

REPOSITORIES: biostudies-literature

ACCESS DATA

Publications

Machine Learning Techniques for Prediction of Early Childhood Obesity.

Dugan T M TM Mukhopadhyay S S Carroll A A Downs S S

Applied clinical informatics 20150812 3

<h4>Objectives</h4>This paper aims to predict childhood obesity after age two, using only data collected prior to the second birthday by a clinical decision support system called CHICA.<h4>Methods</h4>Analyses of six different machine learning methods: RandomTree, RandomForest, J48, ID3, Naïve Bayes, and Bayes trained on CHICA data show that an accurate, sensitive model can be created.<h4>Results</h4>Of the methods analyzed, the ID3 model trained on the CHICA dataset proved the best overall perf ...[more]

PMID: 26448795

Similar Datasets

Project description:MotivationPatients with novel coronavirus disease 2019 (COVID-19) worsen into critical illness suddenly is a matter of great concern. Early identification and effective triaging of patients with a high risk of developing critical illness COVID-19 upon admission can aid in improving patient care, increasing the cure rate, and mitigating the burden on the medical care system. This study proposed and extended classical least absolute shrinkage and selection operator (LASSO) logistic regression to objectively identify clinical determination and risk factors for the early identification of patients at high risk of progression to critical illness at the time of hospital admission.MethodsIn this retrospective multicenter study, data of 1,929 patients with COVID-19 were assessed. The association between laboratory characteristics measured at admission and critical illness was screened with logistic regression. LASSO logistic regression was utilized to construct predictive models for estimating the risk that a patient with COVID-19 will develop a critical illness.ResultsThe development cohort consisted of 1,363 patients with COVID-19 with 133 (9.7%) patients developing the critical illness. Univariate logistic regression analysis revealed 28 variables were prognosis factors for critical illness COVID-19 (p < 0.05). Elevated CK-MB, neutrophils, PCT, α-HBDH, D-dimer, LDH, glucose, PT, APTT, RDW (SD and CV), fibrinogen, and AST were predictors for the early identification of patients at high risk of progression to critical illness. Lymphopenia, a low rate of basophils, eosinophils, thrombopenia, red blood cell, hematocrit, hemoglobin concentration, blood platelet count, and decreased levels of K, Na, albumin, albumin to globulin ratio, and uric acid were clinical determinations associated with the development of critical illness at the time of hospital admission. The risk score accurately predicted critical illness in the development cohort [area under the curve (AUC) = 0.83, 95% CI: 0.78-0.86], also in the external validation cohort (n = 566, AUC = 0.84).ConclusionA risk prediction model based on laboratory findings of patients with COVID-19 was developed for the early identification of patients at high risk of progression to critical illness. This cohort study identified 28 indicators associated with critical illness of patients with COVID-19. The risk model might contribute to the treatment of critical illness disease as early as possible and allow for optimized use of medical resources.

Dataset Information

Machine Learning Techniques for Prediction of Early Childhood Obesity.

Objectives

Methods

Results

Conclusions

Publications

Machine Learning Techniques for Prediction of Early Childhood Obesity.

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets