Dataset Information

Prediction of protein S-nitrosylation sites based on adapted normal distribution bi-profile Bayes and Chou's pseudo amino acid composition.

ABSTRACT: Protein S-nitrosylation is a reversible post-translational modification by covalent modification on the thiol group of cysteine residues by nitric oxide. Growing evidence shows that protein S-nitrosylation plays an important role in normal cellular function as well as in various pathophysiologic conditions. Because of the inherent chemical instability of the S-NO bond and the low abundance of endogenous S-nitrosylated proteins, the unambiguous identification of S-nitrosylation sites by commonly used proteomic approaches remains challenging. Therefore, computational prediction of S-nitrosylation sites has been considered as a powerful auxiliary tool. In this work, we mainly adopted an adapted normal distribution bi-profile Bayes (ANBPB) feature extraction model to characterize the distinction of position-specific amino acids in 784 S-nitrosylated and 1568 non-S-nitrosylated peptide sequences. We developed a support vector machine prediction model, iSNO-ANBPB, by incorporating ANBPB with the Chou's pseudo amino acid composition. In jackknife cross-validation experiments, iSNO-ANBPB yielded an accuracy of 65.39% and a Matthew's correlation coefficient (MCC) of 0.3014. When tested on an independent dataset, iSNO-ANBPB achieved an accuracy of 63.41% and a MCC of 0.2984, which are much higher than the values achieved by the existing predictors SNOSite, iSNO-PseAAC, the Li et al. algorithm, and iSNO-AAPair. On another training dataset, iSNO-ANBPB also outperformed GPS-SNO and iSNO-PseAAC in the 10-fold crossvalidation test.

SUBMITTER: Jia C

PROVIDER: S-EPMC4100159 | biostudies-literature | 2014 Jun

REPOSITORIES: biostudies-literature

ACCESS DATA

Publications

Prediction of protein S-nitrosylation sites based on adapted normal distribution bi-profile Bayes and Chou's pseudo amino acid composition.

Jia Cangzhi C Lin Xin X Wang Zhiping Z

International journal of molecular sciences 20140610 6

Protein S-nitrosylation is a reversible post-translational modification by covalent modification on the thiol group of cysteine residues by nitric oxide. Growing evidence shows that protein S-nitrosylation plays an important role in normal cellular function as well as in various pathophysiologic conditions. Because of the inherent chemical instability of the S-NO bond and the low abundance of endogenous S-nitrosylated proteins, the unambiguous identification of S-nitrosylation sites by commonly ...[more]

PMID: 24918295

Dataset Information

Prediction of protein S-nitrosylation sites based on adapted normal distribution bi-profile Bayes and Chou's pseudo amino acid composition.

Publications

Prediction of protein S-nitrosylation sites based on adapted normal distribution bi-profile Bayes and Chou's pseudo amino acid composition.

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets

Similar Datasets

Identify Lysine Neddylation Sites Using Bi-profile Bayes Feature Extraction <i>via</i> the Chou's 5-steps Rule and General Pseudo Components.
| S-EPMC7290059 | biostudies-literature

Computational identification of protein methylation sites through bi-profile Bayes feature extraction.
| S-EPMC2654709 | biostudies-literature

iSNO-PseAAC: predict cysteine S-nitrosylation sites in proteins by incorporating position specific amino acid propensity into pseudo amino acid composition.
| S-EPMC3567014 | biostudies-literature

PSNO: predicting cysteine S-nitrosylation sites by incorporating various sequence-derived features into the general form of Chou's PseAAC.
| S-EPMC4139777 | biostudies-literature

iSulfoTyr-PseAAC: Identify Tyrosine Sulfation Sites by Incorporating Statistical Moments <i>via</i> Chou's 5-steps Rule and Pseudo Components.
| S-EPMC6983959 | biostudies-literature

iMethyl-PseAAC: identification of protein methylation sites via a pseudo amino acid composition approach.
| S-EPMC4054830 | biostudies-literature

iNitro-Tyr: prediction of nitrotyrosine sites in proteins with general pseudo amino acid composition.
| S-EPMC4133382 | biostudies-literature

iSNO-AAPair: incorporating amino acid pairwise coupling into PseAAC for predicting cysteine S-nitrosylation sites in proteins.
| S-EPMC3792191 | biostudies-literature

A counting renaissance: combining stochastic mapping and empirical Bayes to quickly detect amino acid sites under positive selection.
| S-EPMC3579240 | biostudies-literature

iSUMOK-PseAAC: prediction of lysine sumoylation sites using statistical moments and Chou's PseAAC.
| S-EPMC8349168 | biostudies-literature