Unknown

Dataset Information

0

Enhancing protein-vitamin binding residues prediction by multiple heterogeneous subspace SVMs ensemble.


ABSTRACT:

Background

Vitamins are typical ligands that play critical roles in various metabolic processes. The accurate identification of the vitamin-binding residues solely based on a protein sequence is of significant importance for the functional annotation of proteins, especially in the post-genomic era, when large volumes of protein sequences are accumulating quickly without being functionally annotated.

Results

In this paper, a new predictor called TargetVita is designed and implemented for predicting protein-vitamin binding residues using protein sequences. In TargetVita, features derived from the position-specific scoring matrix (PSSM), predicted protein secondary structure, and vitamin binding propensity are combined to form the original feature space; then, several feature subspaces are selected by performing different feature selection methods. Finally, based on the selected feature subspaces, heterogeneous SVMs are trained and then ensembled for performing prediction.

Conclusions

The experimental results obtained with four separate vitamin-binding benchmark datasets demonstrate that the proposed TargetVita is superior to the state-of-the-art vitamin-specific predictor, and an average improvement of 10% in terms of the Matthews correlation coefficient (MCC) was achieved over independent validation tests. The TargetVita web server and the datasets used are freely available for academic use at http://csbio.njust.edu.cn/bioinf/TargetVita or http://www.csbio.sjtu.edu.cn/bioinf/TargetVita.

SUBMITTER: Yu DJ 

PROVIDER: S-EPMC4261549 | biostudies-literature | 2014 Sep

REPOSITORIES: biostudies-literature

altmetric image

Publications

Enhancing protein-vitamin binding residues prediction by multiple heterogeneous subspace SVMs ensemble.

Yu Dong-Jun DJ   Hu Jun J   Yan Hui H   Yang Xi-Bei XB   Yang Jing-Yu JY   Shen Hong-Bin HB  

BMC bioinformatics 20140905


<h4>Background</h4>Vitamins are typical ligands that play critical roles in various metabolic processes. The accurate identification of the vitamin-binding residues solely based on a protein sequence is of significant importance for the functional annotation of proteins, especially in the post-genomic era, when large volumes of protein sequences are accumulating quickly without being functionally annotated.<h4>Results</h4>In this paper, a new predictor called TargetVita is designed and implement  ...[more]

Similar Datasets

| S-EPMC3577447 | biostudies-literature
| S-EPMC6712585 | biostudies-literature
| S-EPMC5773889 | biostudies-literature
| S-EPMC4980076 | biostudies-literature
| S-EPMC3098787 | biostudies-literature
| S-EPMC3380730 | biostudies-literature
| S-EPMC4897909 | biostudies-literature
| S-EPMC2709252 | biostudies-literature