Unknown

Dataset Information

0

Prediction of protein modification sites of pyrrolidone carboxylic acid using mRMR feature selection and analysis.


ABSTRACT: Pyrrolidone carboxylic acid (PCA) is formed during a common post-translational modification (PTM) of extracellular and multi-pass membrane proteins. In this study, we developed a new predictor to predict the modification sites of PCA based on maximum relevance minimum redundancy (mRMR) and incremental feature selection (IFS). We incorporated 727 features that belonged to 7 kinds of protein properties to predict the modification sites, including sequence conservation, residual disorder, amino acid factor, secondary structure and solvent accessibility, gain/loss of amino acid during evolution, propensity of amino acid to be conserved at protein-protein interface and protein surface, and deviation of side chain carbon atom number. Among these 727 features, 244 features were selected by mRMR and IFS as the optimized features for the prediction, with which the prediction model achieved a maximum of MCC of 0.7812. Feature analysis showed that all feature types contributed to the modification process. Further site-specific feature analysis showed that the features derived from PCA's surrounding sites contributed more to the determination of PCA sites than other sites. The detailed feature analysis in this paper might provide important clues for understanding the mechanism of the PCA formation and guide relevant experimental validations.

SUBMITTER: Zheng LL 

PROVIDER: S-EPMC3235115 | biostudies-literature | 2011

REPOSITORIES: biostudies-literature

altmetric image

Publications

Prediction of protein modification sites of pyrrolidone carboxylic acid using mRMR feature selection and analysis.

Zheng Lu-Lu LL   Niu Shen S   Hao Pei P   Feng Kaiyan K   Cai Yu-Dong YD   Li Yixue Y  

PloS one 20111209 12


Pyrrolidone carboxylic acid (PCA) is formed during a common post-translational modification (PTM) of extracellular and multi-pass membrane proteins. In this study, we developed a new predictor to predict the modification sites of PCA based on maximum relevance minimum redundancy (mRMR) and incremental feature selection (IFS). We incorporated 727 features that belonged to 7 kinds of protein properties to predict the modification sites, including sequence conservation, residual disorder, amino aci  ...[more]

Similar Datasets

| S-EPMC3376124 | biostudies-other
| S-EPMC5133563 | biostudies-literature
| S-EPMC4029432 | biostudies-literature
| S-EPMC4164654 | biostudies-literature
| S-EPMC4145740 | biostudies-literature
| S-EPMC3429425 | biostudies-literature
| S-EPMC7512587 | biostudies-literature
| S-EPMC4731830 | biostudies-literature
| S-EPMC7287073 | biostudies-literature
| S-EPMC5410141 | biostudies-literature