Dataset Information

Prediction model development of late-onset preeclampsia using machine learning-based methods.

ABSTRACT: Preeclampsia is one of the leading causes of maternal and fetal morbidity and mortality. Due to the lack of effective preventive measures, its prediction is essential to its prompt management. This study aimed to develop models using machine learning to predict late-onset preeclampsia using hospital electronic medical record data. The performance of the machine learning based models and models using conventional statistical methods were also compared. A total of 11,006 pregnant women who received antenatal care at Yonsei University Hospital were included. Maternal data were retrieved from electronic medical records during the early second trimester to 34 weeks. The prediction outcome was late-onset preeclampsia occurrence after 34 weeks' gestation. Pattern recognition and cluster analysis were used to select the parameters included in the prediction models. Logistic regression, decision tree model, naïve Bayes classification, support vector machine, random forest algorithm, and stochastic gradient boosting method were used to construct the prediction models. C-statistics was used to assess the performance of each model. The overall preeclampsia development rate was 4.7% (474 patients). Systolic blood pressure, serum blood urea nitrogen and creatinine levels, platelet counts, serum potassium level, white blood cell count, serum calcium level, and urinary protein were the most influential variables included in the prediction models. C-statistics for the decision tree model, naïve Bayes classification, support vector machine, random forest algorithm, stochastic gradient boosting method, and logistic regression models were 0.857, 0.776, 0.573, 0.894, 0.924, and 0.806, respectively. The stochastic gradient boosting model had the best prediction performance with an accuracy and false positive rate of 0.973 and 0.009, respectively. The combined use of maternal factors and common antenatal laboratory data of the early second trimester through early third trimester could effectively predict late-onset preeclampsia using machine learning algorithms. Future prospective studies are needed to verify the clinical applicability algorithms.

SUBMITTER: Jhee JH

PROVIDER: S-EPMC6707607 | biostudies-literature | 2019

REPOSITORIES: biostudies-literature

ACCESS DATA

Publications

Prediction model development of late-onset preeclampsia using machine learning-based methods.

Jhee Jong Hyun JH Lee SungHee S Park Yejin Y Lee Sang Eun SE Kim Young Ah YA Kang Shin-Wook SW Kwon Ja-Young JY Park Jung Tak JT

PloS one 20190823 8

Preeclampsia is one of the leading causes of maternal and fetal morbidity and mortality. Due to the lack of effective preventive measures, its prediction is essential to its prompt management. This study aimed to develop models using machine learning to predict late-onset preeclampsia using hospital electronic medical record data. The performance of the machine learning based models and models using conventional statistical methods were also compared. A total of 11,006 pregnant women who receive ...[more]

PMID: 31442238

Similar Datasets

Project description:BackgroundEarly onset preeclampsia (eoPE) is a hypertensive disorder of pregnancy with endothelial dysfunction manifested before 34 weeks where expectant management is usually attempted. However, the timing of hospitalization, corticosteroids, and delivery remain a challenge. We aim to develop a prediction model using machine-learning tools for the need for delivery within 7 days of diagnosis (model D) and the risk of developing hemolysis, elevated liver enzymes, and low platelets (HELLP) syndrome or abruptio placentae (model HA).Materials and methodsA retrospective cohort of singleton pregnancies with eoPE and attempted expectant management between 2014 and 2020. A Mono-objective Genetic Algorithm based on supervised classification models was implemented to develop D and HA models. Maternal basal characteristics and data gathered during eoPE diagnosis: gestational age, blood pressure, platelets, creatinine, transaminases, angiogenesis biomarkers (soluble fms-like tyrosine kinase-1, placental growth factor), and ultrasound data were pooled for analysis. The most relevant variables were selected by bio-inspired algorithms. We developed basal models that solely included demographic characteristics of the patient (D1, HA1), and advanced models adding information available at diagnosis of eoPE (D2, HA2).ResultsWe evaluated 215 eoPE cases and 47.9% required delivery within 7 days. The median time-to-delivery was 8 days. Basal models were better predicted by K-nearest-neighbor in D1, which had a diagnostic precision of 0.68 ± 0.09, with 63.6% sensitivity (Sn), 71.4% specificity (Sp), 70% positive predictive value (PPV), and 65.2% negative predictive value (NPV) using 13 variables and HA1 of 0.77 ± 0.09, 60.4% Sn, 80% Sp, 50% PPV, and 87.9% NPV. Models at diagnosis were better developed by support vector machine (SVM) using 18 variables, where D2's precision improved to 0.79 ± 0.05 with 77.3% Sn, 80.1% Sp, 81.5% PPV, and 76.2% NPV, and HA2 had a precision of 0.79 ± 0.08 with 66.7% Sn, 82.8% Sp, 51.6% PPV, and 90.3% NPV.ConclusionAt the time of diagnosis of eoPE, SVM with evolutionary feature selection process provides good predictive information of the need for delivery within 7 days and development of HELLP/abruptio placentae, using maternal characteristics and markers that can be obtained routinely. This information could be of value when assessing hospitalization and timing of antenatal corticosteroid administration.

Project description:ObjectivesPrediction of late-onset sepsis (onset beyond day 3 of life) in preterm infants, based on multiple patient monitoring signals 24 hours before onset.DesignContinuous high-resolution electrocardiogram and respiration (chest impedance) data from the monitoring signals were extracted and used to create time-interval features representing heart rate variability, respiration, and body motion. For each infant with a blood culture-proven late-onset sepsis, a Cultures, Resuscitation, and Antibiotics Started Here moment was defined. The Cultures, Resuscitation, and Antibiotics Started Here moment served as an anchor point for the prediction analysis. In the group with controls (C), an "equivalent crash moment" was calculated as anchor point, based on comparable gestational and postnatal age. Three common machine learning approaches (logistic regressor, naive Bayes, and nearest mean classifier) were used to binary classify samples of late-onset sepsis from C. For training and evaluation of the three classifiers, a leave-k-subjects-out cross-validation was used.SettingLevel III neonatal ICU.PatientsThe patient population consisted of 32 premature infants with sepsis and 32 age-matched control patients.InterventionsNo interventions were performed.Measurements and main resultsFor the interval features representing heart rate variability, respiration, and body motion, differences between late-onset sepsis and C were visible up to 5 hours preceding the Cultures, Resuscitation, and Antibiotics Started Here moment. Using a combination of all features, classification of late-onset sepsis and C showed a mean accuracy of 0.79 ± 0.12 and mean precision rate of 0.82 ± 0.18 3 hours before the onset of sepsis.ConclusionsInformation from routine patient monitoring can be used to predict sepsis. Specifically, this study shows that a combination of electrocardiogram-based, respiration-based, and motion-based features enables the prediction of late-onset sepsis hours before the clinical crash moment.

Dataset Information

Prediction model development of late-onset preeclampsia using machine learning-based methods.

Publications

Prediction model development of late-onset preeclampsia using machine learning-based methods.

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets