Unknown

Dataset Information

0

CLIC, a tool for expanding biological pathways based on co-expression across thousands of datasets.


ABSTRACT: In recent years, there has been a huge rise in the number of publicly available transcriptional profiling datasets. These massive compendia comprise billions of measurements and provide a special opportunity to predict the function of unstudied genes based on co-expression to well-studied pathways. Such analyses can be very challenging, however, since biological pathways are modular and may exhibit co-expression only in specific contexts. To overcome these challenges we introduce CLIC, CLustering by Inferred Co-expression. CLIC accepts as input a pathway consisting of two or more genes. It then uses a Bayesian partition model to simultaneously partition the input gene set into coherent co-expressed modules (CEMs), while assigning the posterior probability for each dataset in support of each CEM. CLIC then expands each CEM by scanning the transcriptome for additional co-expressed genes, quantified by an integrated log-likelihood ratio (LLR) score weighted for each dataset. As a byproduct, CLIC automatically learns the conditions (datasets) within which a CEM is operative. We implemented CLIC using a compendium of 1774 mouse microarray datasets (28628 microarrays) or 1887 human microarray datasets (45158 microarrays). CLIC analysis reveals that of 910 canonical biological pathways, 30% consist of strongly co-expressed gene modules for which new members are predicted. For example, CLIC predicts a functional connection between protein C7orf55 (FMC1) and the mitochondrial ATP synthase complex that we have experimentally validated. CLIC is freely available at www.gene-clic.org. We anticipate that CLIC will be valuable both for revealing new components of biological pathways as well as the conditions in which they are active.

SUBMITTER: Li Y 

PROVIDER: S-EPMC5546725 | biostudies-literature | 2017 Jul

REPOSITORIES: biostudies-literature

altmetric image

Publications

CLIC, a tool for expanding biological pathways based on co-expression across thousands of datasets.

Li Yang Y   Jourdain Alexis A AA   Calvo Sarah E SE   Liu Jun S JS   Mootha Vamsi K VK  

PLoS computational biology 20170718 7


In recent years, there has been a huge rise in the number of publicly available transcriptional profiling datasets. These massive compendia comprise billions of measurements and provide a special opportunity to predict the function of unstudied genes based on co-expression to well-studied pathways. Such analyses can be very challenging, however, since biological pathways are modular and may exhibit co-expression only in specific contexts. To overcome these challenges we introduce CLIC, CLusterin  ...[more]

Similar Datasets

| S-EPMC2896182 | biostudies-literature
| S-EPMC6378939 | biostudies-literature
| S-EPMC6309236 | biostudies-literature
| S-EPMC7020396 | biostudies-literature
| S-EPMC9974061 | biostudies-literature
| S-EPMC2978222 | biostudies-literature
| S-EPMC9421690 | biostudies-literature
| S-EPMC4581526 | biostudies-literature
| S-EPMC6853649 | biostudies-literature
| S-EPMC3469350 | biostudies-literature