Unknown

Dataset Information

0

Combining guilt-by-association and guilt-by-profiling to predict Saccharomyces cerevisiae gene function.


ABSTRACT: Learning the function of genes is a major goal of computational genomics. Methods for inferring gene function have typically fallen into two categories: 'guilt-by-profiling', which exploits correlation between function and other gene characteristics; and 'guilt-by-association', which transfers function from one gene to another via biological relationships.We have developed a strategy ('Funckenstein') that performs guilt-by-profiling and guilt-by-association and combines the results. Using a benchmark set of functional categories and input data for protein-coding genes in Saccharomyces cerevisiae, Funckenstein was compared with a previous combined strategy. Subsequently, we applied Funckenstein to 2,455 Gene Ontology terms. In the process, we developed 2,455 guilt-by-profiling classifiers based on 8,848 gene characteristics and 12 functional linkage graphs based on 23 biological relationships.Funckenstein outperforms a previous combined strategy using a common benchmark dataset. The combination of 'guilt-by-profiling' and 'guilt-by-association' gave significant improvement over the component classifiers, showing the greatest synergy for the most specific functions. Performance was evaluated by cross-validation and by literature examination of the top-scoring novel predictions. These quantitative predictions should help prioritize experimental study of yeast gene functions.

SUBMITTER: Tian W 

PROVIDER: S-EPMC2447541 | biostudies-literature | 2008

REPOSITORIES: biostudies-literature

altmetric image

Publications

Combining guilt-by-association and guilt-by-profiling to predict Saccharomyces cerevisiae gene function.

Tian Weidong W   Zhang Lan V LV   Taşan Murat M   Gibbons Francis D FD   King Oliver D OD   Park Julie J   Wunderlich Zeba Z   Cherry J Michael JM   Roth Frederick P FP  

Genome biology 20080627


<h4>Background</h4>Learning the function of genes is a major goal of computational genomics. Methods for inferring gene function have typically fallen into two categories: 'guilt-by-profiling', which exploits correlation between function and other gene characteristics; and 'guilt-by-association', which transfers function from one gene to another via biological relationships.<h4>Results</h4>We have developed a strategy ('Funckenstein') that performs guilt-by-profiling and guilt-by-association and  ...[more]

Similar Datasets

| S-EPMC166232 | biostudies-literature
| S-EPMC102388 | biostudies-literature
| S-EPMC3096706 | biostudies-literature
| S-EPMC3598864 | biostudies-literature
| S-EPMC3953483 | biostudies-literature
| S-EPMC6395859 | biostudies-literature
| S-EPMC5328829 | biostudies-literature
| S-EPMC7363925 | biostudies-literature
| S-EPMC2213690 | biostudies-literature
| S-EPMC2573129 | biostudies-literature