Unknown

Dataset Information

0

DeepCSO: A Deep-Learning Network Approach to Predicting Cysteine S-Sulphenylation Sites.


ABSTRACT: Cysteine S-sulphenylation (CSO), as a novel post-translational modification (PTM), has emerged as a potential mechanism to regulate protein functions and affect signal networks. Because of its functional significance, several prediction approaches have been developed. Nevertheless, they are based on a limited dataset from Homo sapiens and there is a lack of prediction tools for the CSO sites of other species. Recently, this modification has been investigated at the proteomics scale for a few species and the number of identified CSO sites has significantly increased. Thus, it is essential to explore the characteristics of this modification across different species and construct prediction models with better performances based on the enlarged dataset. In this study, we constructed several classifiers and found that the long short-term memory model with the word-embedding encoding approach, dubbed LSTM WE , performs favorably to the traditional machine-learning models and other deep-learning models across different species, in terms of cross-validation and independent test. The area under the receiver operating characteristic (ROC) curve for LSTM WE ranged from 0.82 to 0.85 for different organisms, which was superior to the reported CSO predictors. Moreover, we developed the general model based on the integrated data from different species and it showed great universality and effectiveness. We provided the on-line prediction service called DeepCSO that included both species-specific and general models, which is accessible through http://www.bioinfogo.org/DeepCSO.

SUBMITTER: Lyu X 

PROVIDER: S-EPMC7736615 | biostudies-literature | 2020

REPOSITORIES: biostudies-literature

altmetric image

Publications

DeepCSO: A Deep-Learning Network Approach to Predicting Cysteine S-Sulphenylation Sites.

Lyu Xiaru X   Li Shuhao S   Jiang Chunyang C   He Ningning N   Chen Zhen Z   Zou Yang Y   Li Lei L  

Frontiers in cell and developmental biology 20201201


Cysteine S-sulphenylation (CSO), as a novel post-translational modification (PTM), has emerged as a potential mechanism to regulate protein functions and affect signal networks. Because of its functional significance, several prediction approaches have been developed. Nevertheless, they are based on a limited dataset from <i>Homo sapiens</i> and there is a lack of prediction tools for the CSO sites of other species. Recently, this modification has been investigated at the proteomics scale for a  ...[more]

Similar Datasets

| S-EPMC8599298 | biostudies-literature
| S-EPMC5870772 | biostudies-literature
| S-EPMC6411950 | biostudies-literature
| S-EPMC8915183 | biostudies-literature
| S-EPMC8830423 | biostudies-literature
| S-EPMC10775615 | biostudies-literature
| S-EPMC8264445 | biostudies-literature
| S-EPMC8151217 | biostudies-literature
| S-EPMC9778682 | biostudies-literature
| S-EPMC8576272 | biostudies-literature