Unknown

Dataset Information

0

Bayesian Neural Networks for Selection of Drug Sensitive Genes.


ABSTRACT: Recent advances in high-throughput biotechnologies have provided an unprecedented opportunity for biomarker discovery, which, from a statistical point of view, can be cast as a variable selection problem. This problem is challenging due to the high-dimensional and non-linear nature of omics data and, in general, it suffers three difficulties: (i) an unknown functional form of the nonlinear system, (ii) variable selection consistency, and (iii) high-demanding computation. To circumvent the first difficulty, we employ a feed-forward neural network to approximate the unknown nonlinear function motivated by its universal approximation ability. To circumvent the second difficulty, we conduct structure selection for the neural network, which induces variable selection, by choosing appropriate prior distributions that lead to the consistency of variable selection. To circumvent the third difficulty, we implement the population stochastic approximation Monte Carlo algorithm, a parallel adaptive Markov Chain Monte Carlo (MCMC) algorithm, on the OpenMP platform which provides a linear speedup for the simulation with the number of cores of the computer. The numerical results indicate that the proposed method can work very well for identification of relevant variables for high-dimensional nonlinear systems. The proposed method is successfully applied to identification of the genes that are associated with anticancerdrug sensitivities based on the data collected in the cancer cell line encyclopedia (CCLE) study.

SUBMITTER: Liang F 

PROVIDER: S-EPMC6660200 | biostudies-literature | 2018

REPOSITORIES: biostudies-literature

altmetric image

Publications

Bayesian Neural Networks for Selection of Drug Sensitive Genes.

Liang Faming F   Li Qizhai Q   Zhou Lei L  

Journal of the American Statistical Association 20180628 523


Recent advances in high-throughput biotechnologies have provided an unprecedented opportunity for biomarker discovery, which, from a statistical point of view, can be cast as a variable selection problem. This problem is challenging due to the high-dimensional and non-linear nature of omics data and, in general, it suffers three difficulties: (i) an unknown functional form of the nonlinear system, (ii) variable selection consistency, and (iii) high-demanding computation. To circumvent the first  ...[more]

Similar Datasets

| S-EPMC8557386 | biostudies-literature
| S-EPMC9234235 | biostudies-literature
| S-EPMC4575256 | biostudies-literature
| S-EPMC9708898 | biostudies-literature
| S-EPMC4752975 | biostudies-literature
| S-EPMC9021150 | biostudies-literature
2024-08-06 | MODEL2408060002 | BioModels
| S-EPMC6868054 | biostudies-literature
| S-EPMC10438902 | biostudies-literature
| S-EPMC4256933 | biostudies-literature