Unknown

Dataset Information

0

Combining modularity, conservation, and interactions of proteins significantly increases precision and coverage of protein function prediction.


ABSTRACT:

Background

While the number of newly sequenced genomes and genes is constantly increasing, elucidation of their function still is a laborious and time-consuming task. This has led to the development of a wide range of methods for predicting protein functions in silico. We report on a new method that predicts function based on a combination of information about protein interactions, orthology, and the conservation of protein networks in different species.

Results

We show that aggregation of these independent sources of evidence leads to a drastic increase in number and quality of predictions when compared to baselines and other methods reported in the literature. For instance, our method generates more than 12,000 novel protein functions for human with an estimated precision of ~76%, among which are 7,500 new functional annotations for 1,973 human proteins that previously had zero or only one function annotated. We also verified our predictions on a set of genes that play an important role in colorectal cancer (MLH1, PMS2, EPHB4 ) and could confirm more than 73% of them based on evidence in the literature.

Conclusions

The combination of different methods into a single, comprehensive prediction method infers thousands of protein functions for every species included in the analysis at varying, yet always high levels of precision and very good coverage.

SUBMITTER: Jaeger S 

PROVIDER: S-EPMC3017542 | biostudies-literature | 2010 Dec

REPOSITORIES: biostudies-literature

altmetric image

Publications

Combining modularity, conservation, and interactions of proteins significantly increases precision and coverage of protein function prediction.

Jaeger Samira S   Sers Christine T CT   Leser Ulf U  

BMC genomics 20101220


<h4>Background</h4>While the number of newly sequenced genomes and genes is constantly increasing, elucidation of their function still is a laborious and time-consuming task. This has led to the development of a wide range of methods for predicting protein functions in silico. We report on a new method that predicts function based on a combination of information about protein interactions, orthology, and the conservation of protein networks in different species.<h4>Results</h4>We show that aggre  ...[more]

Similar Datasets

| S-EPMC6299973 | biostudies-literature
| S-EPMC2718497 | biostudies-literature
| S-EPMC7890638 | biostudies-literature
| S-EPMC5133940 | biostudies-literature
| 2014357 | ecrin-mdr-crc
| S-EPMC3424558 | biostudies-literature
| S-EPMC4814079 | biostudies-literature
| S-EPMC2430716 | biostudies-literature
| S-EPMC3302896 | biostudies-other
| S-EPMC390347 | biostudies-literature