Unknown

Dataset Information

0

PPero, a Computational Model for Plant PTS1 Type Peroxisomal Protein Prediction.


ABSTRACT: Well-defined motifs often make it easy to investigate protein function and localization. In plants, peroxisomal proteins are guided to peroxisomes mainly by a conserved type 1 (PTS1) or type 2 (PTS2) targeting signal, and the PTS1 motif is commonly used for peroxisome targeting protein prediction. Currently computational prediction of peroxisome targeted PTS1-type proteins are mostly based on the 3 amino acids PTS1 motif and the adjacent sequence which is less than 14 amino acid residue in length. The potential contribution of the adjacent sequences beyond this short region has never been well investigated in plants. In this work, we develop a bi-profile Bayesian SVM method to extract and learn position-based amino acid features for both PTS1 motifs and their extended adjacent sequences in plants. Our proposed model outperformed other implementations with similar applications and achieved the highest accuracy of 93.6% and 92.6% for Arabidosis and other plant species respectively. A large scale analysis for Arabidopsis, Rice, Maize, Potato, Wheat, and Soybean proteome was conducted using the proposed model and a batch of candidate PTS1 proteins were predicted. The DNA segments corresponding to the C-terminal sequences of 9 selected candidates were cloned and transformed into Arabidopsis for experimental validation, and 5 of them demonstrated peroxisome targeting.

SUBMITTER: Wang J 

PROVIDER: S-EPMC5207514 | biostudies-literature |

REPOSITORIES: biostudies-literature

Similar Datasets

| S-EPMC20933 | biostudies-literature
| S-EPMC6584649 | biostudies-literature
| S-EPMC3427985 | biostudies-literature
| S-EPMC7037794 | biostudies-literature
| S-EPMC1794383 | biostudies-literature
| S-EPMC168944 | biostudies-literature
| S-EPMC3599296 | biostudies-literature
| S-EPMC1325227 | biostudies-literature
| S-EPMC3566003 | biostudies-literature
| S-EPMC6767901 | biostudies-literature