Unknown

Dataset Information

0

Rice_Phospho 1.0: a new rice-specific SVM predictor for protein phosphorylation sites.


ABSTRACT: Experimentally-determined or computationally-predicted protein phosphorylation sites for distinctive species are becoming increasingly common. In this paper, we compare the predictive performance of a novel classification algorithm with different encoding schemes to develop a rice-specific protein phosphorylation site predictor. Our results imply that the combination of Amino acid occurrence Frequency with Composition of K-Spaced Amino Acid Pairs (AF-CKSAAP) provides the best description of relevant sequence features that surround a phosphorylation site. A support vector machine (SVM) using AF-CKSAAP achieves the best performance in classifying rice protein phophorylation sites when compared to the other algorithms. We have used SVM with AF-CKSAAP to construct a rice-specific protein phosphorylation sites predictor, Rice_Phospho 1.0 (http://bioinformatics.fafu.edu.cn/rice_phospho1.0). We measure the Accuracy (ACC) and Matthews Correlation Coefficient (MCC) of Rice_Phospho 1.0 to be 82.0% and 0.64, significantly higher than those measures for other predictors such as Scansite, Musite, PlantPhos and PhosphoRice. Rice_Phospho 1.0 also successfully predicted the experimentally identified phosphorylation sites in LOC_Os03g51600.1, a protein sequence which did not appear in the training dataset. In summary, Rice_phospho 1.0 outputs reliable predictions of protein phosphorylation sites in rice, and will serve as a useful tool to the community.

SUBMITTER: Lin S 

PROVIDER: S-EPMC4493637 | biostudies-literature | 2015 Jul

REPOSITORIES: biostudies-literature

altmetric image

Publications

Rice_Phospho 1.0: a new rice-specific SVM predictor for protein phosphorylation sites.

Lin Shoukai S   Song Qi Q   Tao Huan H   Wang Wei W   Wan Weifeng W   Huang Jian J   Xu Chaoqun C   Chebii Vivien V   Kitony Justine J   Que Shufu S   Harrison Andrew A   He Huaqin H  

Scientific reports 20150707


Experimentally-determined or computationally-predicted protein phosphorylation sites for distinctive species are becoming increasingly common. In this paper, we compare the predictive performance of a novel classification algorithm with different encoding schemes to develop a rice-specific protein phosphorylation site predictor. Our results imply that the combination of Amino acid occurrence Frequency with Composition of K-Spaced Amino Acid Pairs (AF-CKSAAP) provides the best description of rele  ...[more]

Similar Datasets

| S-EPMC3395875 | biostudies-other
| S-EPMC6062547 | biostudies-literature
| S-EPMC1160232 | biostudies-literature
| S-EPMC4767366 | biostudies-literature
| S-EPMC5860208 | biostudies-literature
| S-EPMC2981550 | biostudies-literature
| S-EPMC1764470 | biostudies-literature
| S-EPMC3739059 | biostudies-literature
| S-EPMC11009412 | biostudies-literature
| S-EPMC4401752 | biostudies-literature