Dataset Information

Prediction of Preeclampsia and Intrauterine Growth Restriction: Development of Machine Learning Models on a Prospective Cohort.

ABSTRACT: BACKGROUND:Preeclampsia and intrauterine growth restriction are placental dysfunction-related disorders (PDDs) that require a referral decision be made within a certain time period. An appropriate prediction model should be developed for these diseases. However, previous models did not demonstrate robust performances and/or they were developed from datasets with highly imbalanced classes. OBJECTIVE:In this study, we developed a predictive model of PDDs by machine learning that uses features at 24-37 weeks' gestation, including maternal characteristics, uterine artery (UtA) Doppler measures, soluble fms-like tyrosine kinase receptor-1 (sFlt-1), and placental growth factor (PlGF). METHODS:A public dataset was taken from a prospective cohort study that included pregnant women with PDDs (66/95, 69%) and a control group (29/95, 31%). Preliminary selection of features was based on a statistical analysis using SAS 9.4 (SAS Institute). We used Weka (Waikato Environment for Knowledge Analysis) 3.8.3 (The University of Waikato, Hamilton, NZ) to automatically select the best model using its optimization algorithm. We also manually selected the best of 23 white-box models. Models, including those from recent studies, were also compared by interval estimation of evaluation metrics. We used the Matthew correlation coefficient (MCC) as the main metric. It is not overoptimistic to evaluate the performance of a prediction model developed from a dataset with a class imbalance. Repeated 10-fold cross-validation was applied. RESULTS:The classification via regression model was chosen as the best model. Our model had a robust MCC (.93, 95% CI .87-1.00, vs .64, 95% CI .57-.71) and specificity (100%, 95% CI 100-100, vs 90%, 95% CI 90-90) compared to each metric of the best models from recent studies. The sensitivity of this model was not inferior (95%, 95% CI 91-100, vs 100%, 95% CI 92-100). The area under the receiver operating characteristic curve was also competitive (0.970, 95% CI 0.966-0.974, vs 0.987, 95% CI 0.980-0.994). Features in the best model were maternal weight, BMI, pulsatility index of the UtA, sFlt-1, and PlGF. The most important feature was the sFlt-1/PlGF ratio. This model used an M5P algorithm consisting of a decision tree and four linear models with different thresholds. Our study was also better than the best ones among recent studies in terms of the class balance and the size of the case class (66/95, 69%, vs 27/239, 11.3%). CONCLUSIONS:Our model had a robust predictive performance. It was also developed to deal with the problem of a class imbalance. In the context of clinical management, this model may improve maternal mortality and neonatal morbidity and reduce health care costs.

SUBMITTER: Sufriyana H

PROVIDER: S-EPMC7265111 | biostudies-literature | 2020 May

REPOSITORIES: biostudies-literature

ACCESS DATA

Publications

Prediction of Preeclampsia and Intrauterine Growth Restriction: Development of Machine Learning Models on a Prospective Cohort.

Sufriyana Herdiantri H Wu Yu-Wei YW Su Emily Chia-Yu EC

JMIR medical informatics 20200518 5

<h4>Background</h4>Preeclampsia and intrauterine growth restriction are placental dysfunction-related disorders (PDDs) that require a referral decision be made within a certain time period. An appropriate prediction model should be developed for these diseases. However, previous models did not demonstrate robust performances and/or they were developed from datasets with highly imbalanced classes.<h4>Objective</h4>In this study, we developed a predictive model of PDDs by machine learning that use ...[more]

PMID: 32348266

Similar Datasets

Project description:BackgroundIntraUterine Growth Restriction (IUGR) is a global public health concern and has major implications for neonatal health. The early diagnosis of this condition is crucial for obtaining positive outcomes for the newborn. In recent years Artificial intelligence (AI) and machine learning (ML) techniques are being used to identify risk factors and provide early prediction of IUGR. We performed a systematic review (SR) and meta-analysis (MA) aimed to evaluate the use and performance of AI/ML models in detecting fetuses at risk of IUGR.MethodsWe conducted a systematic review according to the PRISMA checklist. We searched for studies in all the principal medical databases (MEDLINE, EMBASE, CINAHL, Scopus, Web of Science, and Cochrane). To assess the quality of the studies we used the JBI and CASP tools. We performed a meta-analysis of the diagnostic test accuracy, along with the calculation of the pooled principal measures.ResultsWe included 20 studies reporting the use of AI/ML models for the prediction of IUGR. Out of these, 10 studies were used for the quantitative meta-analysis. The most common input variable to predict IUGR was the fetal heart rate variability (n = 8, 40%), followed by the biochemical or biological markers (n = 5, 25%), DNA profiling data (n = 2, 10%), Doppler indices (n = 3, 15%), MRI data (n = 1, 5%), and physiological, clinical, or socioeconomic data (n = 1, 5%). Overall, we found that AI/ML techniques could be effective in predicting and identifying fetuses at risk for IUGR during pregnancy with the following pooled overall diagnostic performance: sensitivity = 0.84 (95% CI 0.80-0.88), specificity = 0.87 (95% CI 0.83-0.90), positive predictive value = 0.78 (95% CI 0.68-0.86), negative predictive value = 0.91 (95% CI 0.86-0.94) and diagnostic odds ratio = 30.97 (95% CI 19.34-49.59). In detail, the RF-SVM (Random Forest-Support Vector Machine) model (with 97% accuracy) showed the best results in predicting IUGR from FHR parameters derived from CTG.Conclusionsour findings showed that AI/ML could be part of a more accurate and cost-effective screening method for IUGR and be of help in optimizing pregnancy outcomes. However, before the introduction into clinical daily practice, an appropriate algorithmic improvement and refinement is needed, and the importance of quality assessment and uniform diagnostic criteria should be further emphasized.

Project description:ObjectivesPreeclampsia is divided into early-onset (delivery before 34 weeks of gestation) and late-onset (delivery at or after 34 weeks) subtypes, which may rise from different etiopathogenic backgrounds. Early-onset disease is associated with placental dysfunction. Late-onset disease develops predominantly due to metabolic disturbances, obesity, diabetes, lipid dysfunction, and inflammation, which affect endothelial function. Our aim was to use cluster analysis to investigate clinical factors predicting the onset and severity of preeclampsia in a cohort of women with known clinical risk factors.MethodsWe recruited 903 pregnant women with risk factors for preeclampsia at gestational weeks 12+0-13+6. Each individual outcome diagnosis was independently verified from medical records. We applied a Bayesian clustering algorithm to classify the study participants to clusters based on their particular risk factor combination. For each cluster, we computed the risk ratio of each disease outcome, relative to the risk in the general population.ResultsThe risk of preeclampsia increased exponentially with respect to the number of risk factors. Our analysis revealed 25 number of clusters. Preeclampsia in a previous pregnancy (n = 138) increased the risk of preeclampsia 8.1 fold (95% confidence interval (CI) 5.7-11.2) compared to a general population of pregnant women. Having a small for gestational age infant (n = 57) in a previous pregnancy increased the risk of early-onset preeclampsia 17.5 fold (95%CI 2.1-60.5). Cluster of those two risk factors together (n = 21) increased the risk of severe preeclampsia to 23.8-fold (95%CI 5.1-60.6), intermediate onset (delivery between 34+0-36+6 weeks of gestation) to 25.1-fold (95%CI 3.1-79.9) and preterm preeclampsia (delivery before 37+0 weeks of gestation) to 16.4-fold (95%CI 2.0-52.4). Body mass index over 30 kg/m2 (n = 228) as a sole risk factor increased the risk of preeclampsia to 2.1-fold (95%CI 1.1-3.6). Together with preeclampsia in an earlier pregnancy the risk increased to 11.4 (95%CI 4.5-20.9). Chronic hypertension (n = 60) increased the risk of preeclampsia 5.3-fold (95%CI 2.4-9.8), of severe preeclampsia 22.2-fold (95%CI 9.9-41.0), and risk of early-onset preeclampsia 16.7-fold (95%CI 2.0-57.6). If a woman had chronic hypertension combined with obesity, gestational diabetes and earlier preeclampsia, the risk of term preeclampsia increased 4.8-fold (95%CI 0.1-21.7). Women with type 1 diabetes mellitus had a high risk of all subgroups of preeclampsia.ConclusionThe risk of preeclampsia increases exponentially with respect to the number of risk factors. Early-onset preeclampsia and severe preeclampsia have different risk profile from term preeclampsia.

Project description:BackgroundEarly onset preeclampsia (eoPE) is a hypertensive disorder of pregnancy with endothelial dysfunction manifested before 34 weeks where expectant management is usually attempted. However, the timing of hospitalization, corticosteroids, and delivery remain a challenge. We aim to develop a prediction model using machine-learning tools for the need for delivery within 7 days of diagnosis (model D) and the risk of developing hemolysis, elevated liver enzymes, and low platelets (HELLP) syndrome or abruptio placentae (model HA).Materials and methodsA retrospective cohort of singleton pregnancies with eoPE and attempted expectant management between 2014 and 2020. A Mono-objective Genetic Algorithm based on supervised classification models was implemented to develop D and HA models. Maternal basal characteristics and data gathered during eoPE diagnosis: gestational age, blood pressure, platelets, creatinine, transaminases, angiogenesis biomarkers (soluble fms-like tyrosine kinase-1, placental growth factor), and ultrasound data were pooled for analysis. The most relevant variables were selected by bio-inspired algorithms. We developed basal models that solely included demographic characteristics of the patient (D1, HA1), and advanced models adding information available at diagnosis of eoPE (D2, HA2).ResultsWe evaluated 215 eoPE cases and 47.9% required delivery within 7 days. The median time-to-delivery was 8 days. Basal models were better predicted by K-nearest-neighbor in D1, which had a diagnostic precision of 0.68 ± 0.09, with 63.6% sensitivity (Sn), 71.4% specificity (Sp), 70% positive predictive value (PPV), and 65.2% negative predictive value (NPV) using 13 variables and HA1 of 0.77 ± 0.09, 60.4% Sn, 80% Sp, 50% PPV, and 87.9% NPV. Models at diagnosis were better developed by support vector machine (SVM) using 18 variables, where D2's precision improved to 0.79 ± 0.05 with 77.3% Sn, 80.1% Sp, 81.5% PPV, and 76.2% NPV, and HA2 had a precision of 0.79 ± 0.08 with 66.7% Sn, 82.8% Sp, 51.6% PPV, and 90.3% NPV.ConclusionAt the time of diagnosis of eoPE, SVM with evolutionary feature selection process provides good predictive information of the need for delivery within 7 days and development of HELLP/abruptio placentae, using maternal characteristics and markers that can be obtained routinely. This information could be of value when assessing hospitalization and timing of antenatal corticosteroid administration.

Dataset Information

Prediction of Preeclampsia and Intrauterine Growth Restriction: Development of Machine Learning Models on a Prospective Cohort.

Publications

Prediction of Preeclampsia and Intrauterine Growth Restriction: Development of Machine Learning Models on a Prospective Cohort.

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets