Unknown

Dataset Information

0

NetGen: a novel network-based probabilistic generative model for gene set functional enrichment analysis.


ABSTRACT: High-throughput experimental techniques have been dramatically improved and widely applied in the past decades. However, biological interpretation of the high-throughput experimental results, such as differential expression gene sets derived from microarray or RNA-seq experiments, is still a challenging task. Gene Ontology (GO) is commonly used in the functional enrichment studies. The GO terms identified via current functional enrichment analysis tools often contain direct parent or descendant terms in the GO hierarchical structure. Highly redundant terms make users difficult to analyze the underlying biological processes.In this paper, a novel network-based probabilistic generative model, NetGen, was proposed to perform the functional enrichment analysis. An additional protein-protein interaction (PPI) network was explicitly used to assist the identification of significantly enriched GO terms. NetGen achieved a superior performance than the existing methods in the simulation studies. The effectiveness of NetGen was explored further on four real datasets. Notably, several GO terms which were not directly linked with the active gene list for each disease were identified. These terms were closely related to the corresponding diseases when accessed to the curated literatures. NetGen has been implemented in the R package CopTea publicly available at GitHub ( http://github.com/wulingyun/CopTea/ ).Our procedure leads to a more reasonable and interpretable result of the functional enrichment analysis. As a novel term combination-based functional enrichment analysis method, NetGen is complementary to current individual term-based methods, and can help to explore the underlying pathogenesis of complex diseases.

SUBMITTER: Sun D 

PROVIDER: S-EPMC5615262 | biostudies-literature | 2017 Sep

REPOSITORIES: biostudies-literature

altmetric image

Publications

NetGen: a novel network-based probabilistic generative model for gene set functional enrichment analysis.

Sun Duanchen D   Liu Yinliang Y   Zhang Xiang-Sun XS   Wu Ling-Yun LY  

BMC systems biology 20170921 Suppl 4


<h4>Background</h4>High-throughput experimental techniques have been dramatically improved and widely applied in the past decades. However, biological interpretation of the high-throughput experimental results, such as differential expression gene sets derived from microarray or RNA-seq experiments, is still a challenging task. Gene Ontology (GO) is commonly used in the functional enrichment studies. The GO terms identified via current functional enrichment analysis tools often contain direct pa  ...[more]

Similar Datasets

| S-EPMC2553574 | biostudies-literature
| S-EPMC3436816 | biostudies-other
| S-EPMC6117355 | biostudies-literature
| S-EPMC2981572 | biostudies-literature
| S-EPMC3572115 | biostudies-literature
| S-EPMC3505158 | biostudies-literature
| S-EPMC3278830 | biostudies-literature
| S-EPMC2440424 | biostudies-literature
| S-EPMC4039382 | biostudies-literature