Unknown

Dataset Information

0

WeGET: predicting new genes for molecular systems by weighted co-expression.


ABSTRACT: We have developed the Weighted Gene Expression Tool and database (WeGET, http://weget.cmbi.umcn.nl) for the prediction of new genes of a molecular system by correlated gene expression. WeGET utilizes a compendium of 465 human and 560 murine gene expression datasets that have been collected from multiple tissues under a wide range of experimental conditions. It exploits this abundance of expression data by assigning a high weight to datasets in which the known genes of a molecular system are harmoniously up- and down-regulated. WeGET ranks new candidate genes by calculating their weighted co-expression with that system. A weighted rank is calculated for human genes and their mouse orthologs. Then, an integrated gene rank and p-value is computed using a rank-order statistic. We applied our method to predict novel genes that have a high degree of co-expression with Gene Ontology terms and pathways from KEGG and Reactome. For each query set we provide a list of predicted novel genes, computed weights for transcription datasets used and cell and tissue types that contributed to the final predictions. The performance for each query set is assessed by 10-fold cross-validation. Finally, users can use the WeGET to predict novel genes that co-express with a custom query set.

SUBMITTER: Szklarczyk R 

PROVIDER: S-EPMC4702868 | biostudies-literature | 2016 Jan

REPOSITORIES: biostudies-literature

altmetric image

Publications

WeGET: predicting new genes for molecular systems by weighted co-expression.

Szklarczyk Radek R   Megchelenbrink Wout W   Cizek Pavel P   Ledent Marie M   Velemans Gonny G   Szklarczyk Damian D   Huynen Martijn A MA  

Nucleic acids research 20151117 D1


We have developed the Weighted Gene Expression Tool and database (WeGET, http://weget.cmbi.umcn.nl) for the prediction of new genes of a molecular system by correlated gene expression. WeGET utilizes a compendium of 465 human and 560 murine gene expression datasets that have been collected from multiple tissues under a wide range of experimental conditions. It exploits this abundance of expression data by assigning a high weight to datasets in which the known genes of a molecular system are harm  ...[more]

Similar Datasets

| S-EPMC10116020 | biostudies-literature
| S-EPMC6023727 | biostudies-literature
| S-EPMC7887323 | biostudies-literature
| S-EPMC7609726 | biostudies-literature
| S-EPMC7463304 | biostudies-literature
| S-EPMC7815844 | biostudies-literature
| S-EPMC3431293 | biostudies-literature
| S-EPMC5122405 | biostudies-literature
| S-EPMC9046402 | biostudies-literature
| S-EPMC5937478 | biostudies-literature