Unknown

Dataset Information

0

TESTLoc: protein subcellular localization prediction from EST data.


ABSTRACT:

Background

The eukaryotic cell has an intricate architecture with compartments and substructures dedicated to particular biological processes. Knowing the subcellular location of proteins not only indicates how bio-processes are organized in different cellular compartments, but also contributes to unravelling the function of individual proteins. Computational localization prediction is possible based on sequence information alone, and has been successfully applied to proteins from virtually all subcellular compartments and all domains of life. However, we realized that current prediction tools do not perform well on partial protein sequences such as those inferred from Expressed Sequence Tag (EST) data, limiting the exploitation of the large and taxonomically most comprehensive body of sequence information from eukaryotes.

Results

We developed a new predictor, TESTLoc, suited for subcellular localization prediction of proteins based on their partial sequence conceptually translated from ESTs (EST-peptides). Support Vector Machine (SVM) is used as computational method and EST-peptides are represented by different features such as amino acid composition and physicochemical properties. When TESTLoc was applied to the most challenging test case (plant data), it yielded high accuracy (~85%).

Conclusions

TESTLoc is a localization prediction tool tailored for EST data. It provides a variety of models for the users to choose from, and is available for download at http://megasun.bch.umontreal.ca/~shenyq/TESTLoc/TESTLoc.html.

SUBMITTER: Shen YQ 

PROVIDER: S-EPMC3000424 | biostudies-literature | 2010 Nov

REPOSITORIES: biostudies-literature

altmetric image

Publications

TESTLoc: protein subcellular localization prediction from EST data.

Shen Yao-Qing YQ   Burger Gertraud G  

BMC bioinformatics 20101115


<h4>Background</h4>The eukaryotic cell has an intricate architecture with compartments and substructures dedicated to particular biological processes. Knowing the subcellular location of proteins not only indicates how bio-processes are organized in different cellular compartments, but also contributes to unravelling the function of individual proteins. Computational localization prediction is possible based on sequence information alone, and has been successfully applied to proteins from virtua  ...[more]

Similar Datasets

| S-EPMC7214030 | biostudies-literature
| S-EPMC7764902 | biostudies-literature
| S-EPMC2648781 | biostudies-literature
| S-EPMC7604748 | biostudies-literature
| S-EPMC1182350 | biostudies-literature
| S-EPMC2788359 | biostudies-literature
| S-EPMC9252801 | biostudies-literature
| S-EPMC2859129 | biostudies-literature
| S-EPMC1289393 | biostudies-literature
| S-EPMC2040162 | biostudies-literature