Unknown

Dataset Information

0

Combining evidence of preferential gene-tissue relationships from multiple sources.


ABSTRACT: An important challenge in drug discovery and disease prognosis is to predict genes that are preferentially expressed in one or a few tissues, i.e. showing a considerably higher expression in one tissue(s) compared to the others. Although several data sources and methods have been published explicitly for this purpose, they often disagree and it is not evident how to retrieve these genes and how to distinguish true biological findings from those that are due to choice-of-method and/or experimental settings. In this work we have developed a computational approach that combines results from multiple methods and datasets with the aim to eliminate method/study-specific biases and to improve the predictability of preferentially expressed human genes. A rule-based score is used to merge and assign support to the results. Five sets of genes with known tissue specificity were used for parameter pruning and cross-validation. In total we identify 3434 tissue-specific genes. We compare the genes of highest scores with the public databases: PaGenBase (microarray), TiGER (EST) and HPA (protein expression data). The results have 85% overlap to PaGenBase, 71% to TiGER and only 28% to HPA. 99% of our predictions have support from at least one of these databases. Our approach also performs better than any of the databases on identifying drug targets and biomarkers with known tissue-specificity.

SUBMITTER: Guo J 

PROVIDER: S-EPMC3741196 | biostudies-literature | 2013

REPOSITORIES: biostudies-literature

altmetric image

Publications

Combining evidence of preferential gene-tissue relationships from multiple sources.

Guo Jing J   Hammar Mårten M   Oberg Lisa L   Padmanabhuni Shanmukha S SS   Bjäreland Marcus M   Dalevi Daniel D  

PloS one 20130812 8


An important challenge in drug discovery and disease prognosis is to predict genes that are preferentially expressed in one or a few tissues, i.e. showing a considerably higher expression in one tissue(s) compared to the others. Although several data sources and methods have been published explicitly for this purpose, they often disagree and it is not evident how to retrieve these genes and how to distinguish true biological findings from those that are due to choice-of-method and/or experimenta  ...[more]

Similar Datasets

| S-EPMC7571608 | biostudies-literature
| S-EPMC3969754 | biostudies-literature
| S-EPMC3964681 | biostudies-literature
| S-EPMC10150643 | biostudies-literature
| S-EPMC1255806 | biostudies-literature
| S-EPMC8782526 | biostudies-literature
| S-EPMC7324951 | biostudies-literature
| S-EPMC3668276 | biostudies-literature
| S-EPMC3123338 | biostudies-literature
| S-EPMC3873956 | biostudies-literature