Unknown

Dataset Information

0

Biomarker discovery for predicting spontaneous preterm birth from gene expression data by regularized logistic regression.


ABSTRACT: In this work, we provide a computational method of regularized logistic regression for discovering biomarkers of spontaneous preterm birth (SPTB) from gene expression data. The successful identification of SPTB biomarkers will greatly benefit the interference of infant gestational age for reducing the risks of pregnant women and preemies. In recent years, various approaches have been proposed for the feature selection of identifying the subset of meaningful genes that can achieve accurate classification for disease samples from controls. Here, we comprehensively summarize the regularized logistic regression with seven effective penalties developed for the selection of strongly indicative genes of SPTB from microarray data. We compare their properties and assess their classification performances in multiple datasets. It shows that elastic net, lasso, L1/2 and SCAD penalties get the better performance than others and can be successfully used to identify biomarkers of SPTB. Particularly, we make a functional enrichment analysis on these biomarkers and construct a logistic regression classifier based on them. The classifier generates an indicator of preterm risk score (PRS) for predicting SPTB. Based on the trained predictor, we verify the identified biomarkers on an independent dataset. The biomarkers achieve the AUC value of 0.933 in the SPTB classification. The results demonstrate the effectiveness and efficiency of the built-up strategy of biomarker discovery with regularized logistic regression. Obviously, the proposed method of discovering biomarkers for SPTB can be easily extended for other complex diseases.

SUBMITTER: Li L 

PROVIDER: S-EPMC7689379 | biostudies-literature | 2020

REPOSITORIES: biostudies-literature

altmetric image

Publications

Biomarker discovery for predicting spontaneous preterm birth from gene expression data by regularized logistic regression.

Li Lingyu L   Liu Zhi-Ping ZP  

Computational and structural biotechnology journal 20201110


In this work, we provide a computational method of regularized logistic regression for discovering biomarkers of spontaneous preterm birth (SPTB) from gene expression data. The successful identification of SPTB biomarkers will greatly benefit the interference of infant gestational age for reducing the risks of pregnant women and preemies. In recent years, various approaches have been proposed for the feature selection of identifying the subset of meaningful genes that can achieve accurate classi  ...[more]

Similar Datasets

| S-EPMC8596493 | biostudies-literature
| S-EPMC4769543 | biostudies-literature
| S-EPMC8244906 | biostudies-literature
| S-EPMC9844919 | biostudies-literature
| S-EPMC5098220 | biostudies-literature
2021-08-10 | GSE120480 | GEO
2021-12-06 | PXD028343 | Pride
| S-EPMC5834399 | biostudies-literature
| S-EPMC4672891 | biostudies-literature
| S-EPMC8633292 | biostudies-literature