Unknown

Dataset Information

0

Predicting protein function by machine learning on amino acid sequences--a critical evaluation.


ABSTRACT:

Background

Predicting the function of newly discovered proteins by simply inspecting their amino acid sequence is one of the major challenges of post-genomic computational biology, especially when done without recourse to experimentation or homology information. Machine learning classifiers are able to discriminate between proteins belonging to different functional classes. Until now, however, it has been unclear if this ability would be transferable to proteins of unknown function, which may show distinct biases compared to experimentally more tractable proteins.

Results

Here we show that proteins with known and unknown function do indeed differ significantly. We then show that proteins from different bacterial species also differ to an even larger and very surprising extent, but that functional classifiers nonetheless generalize successfully across species boundaries. We also show that in the case of highly specialized proteomes classifiers from a different, but more conventional, species may in fact outperform the endogenous species-specific classifier.

Conclusion

We conclude that there is very good prospect of successfully predicting the function of yet uncharacterized proteins using machine learning classifiers trained on proteins of known function.

SUBMITTER: Al-Shahib A 

PROVIDER: S-EPMC1847686 | biostudies-literature | 2007 Mar

REPOSITORIES: biostudies-literature

altmetric image

Publications

Predicting protein function by machine learning on amino acid sequences--a critical evaluation.

Al-Shahib Ali A   Breitling Rainer R   Gilbert David R DR  

BMC genomics 20070320


<h4>Background</h4>Predicting the function of newly discovered proteins by simply inspecting their amino acid sequence is one of the major challenges of post-genomic computational biology, especially when done without recourse to experimentation or homology information. Machine learning classifiers are able to discriminate between proteins belonging to different functional classes. Until now, however, it has been unclear if this ability would be transferable to proteins of unknown function, whic  ...[more]

Similar Datasets

| S-EPMC9241370 | biostudies-literature
| S-EPMC4047675 | biostudies-literature
| S-EPMC7016212 | biostudies-literature
2021-07-09 | GSE163896 | GEO
| S-EPMC6555512 | biostudies-literature
| S-EPMC9249596 | biostudies-literature
| S-EPMC7373184 | biostudies-literature
| S-EPMC6855455 | biostudies-literature
| S-EPMC8743549 | biostudies-literature
| S-EPMC6151554 | biostudies-literature