Unknown

Dataset Information

0

Reconstruction of gene networks using prior knowledge.


ABSTRACT: Reconstructing gene regulatory networks (GRNs) from expression data is a challenging task that has become essential to the understanding of complex regulatory mechanisms in cells. The major issues are the usually very high ratio of number of genes to sample size, and the noise in the available data. Integrating biological prior knowledge to the learning process is a natural and promising way to partially compensate for the lack of reliable expression data and to increase the accuracy of network reconstruction algorithms.In this manuscript, we present PriorPC, a new algorithm based on the PC algorithm. PC algorithm is one of the most popular methods for Bayesian network reconstruction. The result of PC is known to depend on the order in which conditional independence tests are processed, especially for large networks. PriorPC uses prior knowledge to exclude unlikely edges from network estimation and introduces a particular ordering for the conditional independence tests. We show on synthetic data that the structural accuracy of networks obtained with PriorPC is greatly improved compared to PC.PriorPC improves structural accuracy of inferred gene networks by using soft priors which assign to edges a probability of existence. It is robust to false prior which is not avoidable in the context of biological data. PriorPC is also fast and scales well for large networks which is important for its applicability to real data.

SUBMITTER: Ghanbari M 

PROVIDER: S-EPMC4654848 | biostudies-other | 2015

REPOSITORIES: biostudies-other

altmetric image

Publications

Reconstruction of gene networks using prior knowledge.

Ghanbari Mahsa M   Lasserre Julia J   Vingron Martin M  

BMC systems biology 20151120


<h4>Background</h4>Reconstructing gene regulatory networks (GRNs) from expression data is a challenging task that has become essential to the understanding of complex regulatory mechanisms in cells. The major issues are the usually very high ratio of number of genes to sample size, and the noise in the available data. Integrating biological prior knowledge to the learning process is a natural and promising way to partially compensate for the lack of reliable expression data and to increase the a  ...[more]

Similar Datasets

| S-EPMC5760079 | biostudies-literature
2016-11-08 | GSE73638 | GEO
| S-EPMC4461287 | biostudies-literature
| S-EPMC4067568 | biostudies-literature
2016-11-08 | GSE73551 | GEO
| S-EPMC4905609 | biostudies-literature
2016-11-08 | GSE73637 | GEO
| S-EPMC4447287 | biostudies-literature
| S-EPMC6584077 | biostudies-literature
| PRJNA297464 | ENA