Unknown

Dataset Information

0

'Unite and conquer': enhanced prediction of protein subcellular localization by integrating multiple specialized tools.


ABSTRACT:

Background

Knowing the subcellular location of proteins provides clues to their function as well as the interconnectivity of biological processes. Dozens of tools are available for predicting protein location in the eukaryotic cell. Each tool performs well on certain data sets, but their predictions often disagree for a given protein. Since the individual tools each have particular strengths, we set out to integrate them in a way that optimally exploits their potential. The method we present here is applicable to various subcellular locations, but tailored for predicting whether or not a protein is localized in mitochondria. Knowledge of the mitochondrial proteome is relevant to understanding the role of this organelle in global cellular processes.

Results

In order to develop a method for enhanced prediction of subcellular localization, we integrated the outputs of available localization prediction tools by several strategies, and tested the performance of each strategy with known mitochondrial proteins. The accuracy obtained (up to 92%) surpasses by far the individual tools. The method of integration proved crucial to the performance. For the prediction of mitochondrion-located proteins, integration via a two-layer decision tree clearly outperforms simpler methods, as it allows emphasis of biologically relevant features such as the mitochondrial targeting peptide and transmembrane domains.

Conclusion

We developed an approach that enhances the prediction accuracy of mitochondrial proteins by uniting the strength of specialized tools. The combination of machine-learning based integration with biological expert knowledge leads to improved performance. This approach also alleviates the conundrum of how to choose between conflicting predictions. Our approach is easy to implement, and applicable to predicting subcellular locations other than mitochondria, as well as other biological features. For a trial of our approach, we provide a webservice for mitochondrial protein prediction (named YimLOC), which can be accessed through the AnaBench suite at http://anabench.bcm.umontreal.ca/anabench/. The source code is provided in the Additional File 2.

SUBMITTER: Shen YQ 

PROVIDER: S-EPMC2176073 | biostudies-literature | 2007 Oct

REPOSITORIES: biostudies-literature

altmetric image

Publications

'Unite and conquer': enhanced prediction of protein subcellular localization by integrating multiple specialized tools.

Shen Yao Qing YQ   Burger Gertraud G  

BMC bioinformatics 20071029


<h4>Background</h4>Knowing the subcellular location of proteins provides clues to their function as well as the interconnectivity of biological processes. Dozens of tools are available for predicting protein location in the eukaryotic cell. Each tool performs well on certain data sets, but their predictions often disagree for a given protein. Since the individual tools each have particular strengths, we set out to integrate them in a way that optimally exploits their potential. The method we pre  ...[more]

Similar Datasets

| S-EPMC2685389 | biostudies-literature
| S-EPMC2745392 | biostudies-literature
| S-EPMC1182350 | biostudies-literature
| S-EPMC2582614 | biostudies-literature
| S-EPMC7764902 | biostudies-literature
| S-EPMC3000424 | biostudies-literature
| S-EPMC7604748 | biostudies-literature
| S-EPMC6680503 | biostudies-literature