Unknown

Dataset Information

0

Integration of Multiple Data Sources for Gene Network Inference Using Genetic Perturbation Data.


ABSTRACT: The inference of gene networks from large-scale human genomic data is challenging due to the difficulty in identifying correct regulators for each gene in a high-dimensional search space. We present a Bayesian approach integrating external data sources with knockdown data from human cell lines to infer gene regulatory networks. In particular, we assemble multiple data sources, including gene expression data, genome-wide binding data, gene ontology, and known pathways, and use a supervised learning framework to compute prior probabilities of regulatory relationships. We show that our integrated method improves the accuracy of inferred gene networks as well as extends some previous Bayesian frameworks both in theory and applications. We apply our method to two different human cell lines, namely skin melanoma cell line A375 and lung cancer cell line A549, to illustrate the capabilities of our method. Our results show that the improvement in performance could vary from cell line to cell line and that we might need to choose different external data sources serving as prior knowledge if we hope to obtain better accuracy for different cell lines.

SUBMITTER: Liang X 

PROVIDER: S-EPMC6786343 | biostudies-literature | 2019 Oct

REPOSITORIES: biostudies-literature

altmetric image

Publications

Integration of Multiple Data Sources for Gene Network Inference Using Genetic Perturbation Data.

Liang Xiao X   Young William Chad WC   Hung Ling-Hong LH   Raftery Adrian E AE   Yeung Ka Yee KY  

Journal of computational biology : a journal of computational molecular cell biology 20190422 10


<b>The inference of gene networks from large-scale human genomic data is challenging due to the difficulty in identifying correct regulators for each gene in a high-dimensional search space. We present a Bayesian approach integrating external data sources with knockdown data from human cell lines to infer gene regulatory networks. In particular, we assemble multiple data sources, including gene expression data, genome-wide binding data, gene ontology, and known pathways, and use a supervised lea  ...[more]

Similar Datasets

| S-EPMC4750075 | biostudies-literature
| S-EPMC4079061 | biostudies-literature
| S-EPMC6956787 | biostudies-literature
| S-EPMC9756951 | biostudies-literature
| S-EPMC5286517 | biostudies-literature
| S-EPMC2585105 | biostudies-literature
| S-EPMC7447758 | biostudies-literature
| S-EPMC10245711 | biostudies-literature
| S-EPMC2268002 | biostudies-literature
| S-EPMC9946838 | biostudies-literature