Unknown

Dataset Information

0

A biological network-based regularized artificial neural network model for robust phenotype prediction from gene expression data.


ABSTRACT: BACKGROUND:Stratification of patient subpopulations that respond favorably to treatment or experience and adverse reaction is an essential step toward development of new personalized therapies and diagnostics. It is currently feasible to generate omic-scale biological measurements for all patients in a study, providing an opportunity for machine learning models to identify molecular markers for disease diagnosis and progression. However, the high variability of genetic background in human populations hampers the reproducibility of omic-scale markers. In this paper, we develop a biological network-based regularized artificial neural network model for prediction of phenotype from transcriptomic measurements in clinical trials. To improve model sparsity and the overall reproducibility of the model, we incorporate regularization for simultaneous shrinkage of gene sets based on active upstream regulatory mechanisms into the model. RESULTS:We benchmark our method against various regression, support vector machines and artificial neural network models and demonstrate the ability of our method in predicting the clinical outcomes using clinical trial data on acute rejection in kidney transplantation and response to Infliximab in ulcerative colitis. We show that integration of prior biological knowledge into the classification as developed in this paper, significantly improves the robustness and generalizability of predictions to independent datasets. We provide a Java code of our algorithm along with a parsed version of the STRING DB database. CONCLUSION:In summary, we present a method for prediction of clinical phenotypes using baseline genome-wide expression data that makes use of prior biological knowledge on gene-regulatory interactions in order to increase robustness and reproducibility of omic-scale markers. The integrated group-wise regularization methods increases the interpretability of biological signatures and gives stable performance estimates across independent test sets.

SUBMITTER: Kang T 

PROVIDER: S-EPMC5735940 | biostudies-literature | 2017 Dec

REPOSITORIES: biostudies-literature

altmetric image

Publications

A biological network-based regularized artificial neural network model for robust phenotype prediction from gene expression data.

Kang Tianyu T   Ding Wei W   Zhang Luoyan L   Ziemek Daniel D   Zarringhalam Kourosh K  

BMC bioinformatics 20171219 1


<h4>Background</h4>Stratification of patient subpopulations that respond favorably to treatment or experience and adverse reaction is an essential step toward development of new personalized therapies and diagnostics. It is currently feasible to generate omic-scale biological measurements for all patients in a study, providing an opportunity for machine learning models to identify molecular markers for disease diagnosis and progression. However, the high variability of genetic background in huma  ...[more]

Similar Datasets

| S-EPMC9038085 | biostudies-literature
| S-EPMC8062617 | biostudies-literature
| S-EPMC7643315 | biostudies-literature
| S-EPMC6028566 | biostudies-literature
| S-EPMC140555 | biostudies-literature
| S-EPMC6482337 | biostudies-literature
| S-EPMC8994362 | biostudies-literature
| S-EPMC5909924 | biostudies-literature
| S-EPMC7039514 | biostudies-literature
| S-EPMC7178452 | biostudies-literature