Unknown

Dataset Information

0

Analysis and prediction of human acetylation using a cascade classifier based on support vector machine.


ABSTRACT: BACKGROUND:Acetylation on lysine is a widespread post-translational modification which is reversible and plays a crucial role in some biological activities. To better understand the mechanism, it is necessary to identify acetylation sites in proteins accurately. Computational methods are popular because they are more convenient and faster than experimental methods. In this study, we proposed a new computational method to predict acetylation sites in human by combining sequence features and structural features including physicochemical property (PCP), position specific score matrix (PSSM), auto covariation (AC), residue composition (RC), secondary structure (SS) and accessible surface area (ASA), which can well characterize the information of acetylated lysine sites. Besides, a two-step feature selection was applied, which combined mRMR and IFS. It finally trained a cascade classifier based on SVM, which successfully solved the imbalance between positive samples and negative samples and covered all negative sample information. RESULTS:The performance of this method is measured with a specificity of 72.19% and a sensibility of 76.71% on independent dataset which shows that a cascade SVM classifier outperforms single SVM classifier. CONCLUSIONS:In addition to the analysis of experimental results, we also made a systematic and comprehensive analysis of the acetylation data.

SUBMITTER: Ning Q 

PROVIDER: S-EPMC6580503 | biostudies-literature | 2019 Jun

REPOSITORIES: biostudies-literature

altmetric image

Publications

Analysis and prediction of human acetylation using a cascade classifier based on support vector machine.

Ning Qiao Q   Yu Miao M   Ji Jinchao J   Ma Zhiqiang Z   Zhao Xiaowei X  

BMC bioinformatics 20190617 1


<h4>Background</h4>Acetylation on lysine is a widespread post-translational modification which is reversible and plays a crucial role in some biological activities. To better understand the mechanism, it is necessary to identify acetylation sites in proteins accurately. Computational methods are popular because they are more convenient and faster than experimental methods. In this study, we proposed a new computational method to predict acetylation sites in human by combining sequence features a  ...[more]

Similar Datasets

| S-EPMC5627885 | biostudies-literature
| S-EPMC8647811 | biostudies-literature
| S-EPMC2220009 | biostudies-literature
| S-EPMC1594580 | biostudies-literature
| S-EPMC8382032 | biostudies-literature
| S-EPMC5788611 | biostudies-literature
| S-EPMC4308892 | biostudies-literature
| S-EPMC2627892 | biostudies-other
| S-EPMC5410141 | biostudies-literature