Unknown

Dataset Information

0

Integrating protein-protein interactions and text mining for protein function prediction.


ABSTRACT: BACKGROUND: Functional annotation of proteins remains a challenging task. Currently the scientific literature serves as the main source for yet uncurated functional annotations, but curation work is slow and expensive. Automatic techniques that support this work are still lacking reliability. We developed a method to identify conserved protein interaction graphs and to predict missing protein functions from orthologs in these graphs. To enhance the precision of the results, we furthermore implemented a procedure that validates all predictions based on findings reported in the literature. RESULTS: Using this procedure, more than 80% of the GO annotations for proteins with highly conserved orthologs that are available in UniProtKb/Swiss-Prot could be verified automatically. For a subset of proteins we predicted new GO annotations that were not available in UniProtKb/Swiss-Prot. All predictions were correct (100% precision) according to the verifications from a trained curator. CONCLUSION: Our method of integrating CCSs and literature mining is thus a highly reliable approach to predict GO annotations for weakly characterized proteins with orthologs.

SUBMITTER: Jaeger S 

PROVIDER: S-EPMC2500093 | biostudies-literature | 2008

REPOSITORIES: biostudies-literature

altmetric image

Publications

Integrating protein-protein interactions and text mining for protein function prediction.

Jaeger Samira S   Gaudan Sylvain S   Leser Ulf U   Rebholz-Schuhmann Dietrich D  

BMC bioinformatics 20080722


<h4>Background</h4>Functional annotation of proteins remains a challenging task. Currently the scientific literature serves as the main source for yet uncurated functional annotations, but curation work is slow and expensive. Automatic techniques that support this work are still lacking reliability. We developed a method to identify conserved protein interaction graphs and to predict missing protein functions from orthologs in these graphs. To enhance the precision of the results, we furthermore  ...[more]

Similar Datasets

| S-EPMC3290545 | biostudies-literature
| S-EPMC6649004 | biostudies-literature
| S-EPMC1869015 | biostudies-literature
| S-EPMC4331678 | biostudies-literature
| S-EPMC4674139 | biostudies-literature
| S-EPMC2216687 | biostudies-literature
| S-EPMC11009020 | biostudies-literature
| S-EPMC3932451 | biostudies-literature
| S-EPMC3584913 | biostudies-literature
| S-EPMC11374024 | biostudies-literature