Unknown

Dataset Information

0

Prediction of N-linked glycosylation sites using position relative features and statistical moments.


ABSTRACT: Glycosylation is one of the most complex post translation modification in eukaryotic cells. Almost 50% of the human proteome is glycosylated as glycosylation plays a vital role in various biological functions such as antigen's recognition, cell-cell communication, expression of genes and protein folding. It is a significant challenge to identify glycosylation sites in protein sequences as experimental methods are time taking and expensive. A reliable computational method is desirable for the identification of glycosylation sites. In this study, a comprehensive technique for the identification of N-linked glycosylation sites has been proposed using machine learning. The proposed predictor was trained using an up-to-date dataset through back propagation algorithm for multilayer neural network. The results of ten-fold cross-validation and other performance measures such as accuracy, sensitivity, specificity and Mathew's correlation coefficient inferred that the accuracy of proposed tool is far better than the existing systems such as Glyomine, GlycoEP, Ensemble SVM and GPP.

SUBMITTER: Akmal MA 

PROVIDER: S-EPMC5552137 | biostudies-literature | 2017

REPOSITORIES: biostudies-literature

altmetric image

Publications

Prediction of N-linked glycosylation sites using position relative features and statistical moments.

Akmal Muhammad Aizaz MA   Rasool Nouman N   Khan Yaser Daanial YD  

PloS one 20170810 8


Glycosylation is one of the most complex post translation modification in eukaryotic cells. Almost 50% of the human proteome is glycosylated as glycosylation plays a vital role in various biological functions such as antigen's recognition, cell-cell communication, expression of genes and protein folding. It is a significant challenge to identify glycosylation sites in protein sequences as experimental methods are time taking and expensive. A reliable computational method is desirable for the ide  ...[more]

Similar Datasets

| S-EPMC8349168 | biostudies-literature
| S-EPMC2651179 | biostudies-literature
| S-EPMC3770009 | biostudies-literature
| S-EPMC4494626 | biostudies-literature
| S-EPMC1150083 | biostudies-other
2021-04-07 | GSE171636 | GEO
| S-EPMC3053258 | biostudies-literature