Unknown

Dataset Information

0

JPPRED: Prediction of Types of J-Proteins from Imbalanced Data Using an Ensemble Learning Method.


ABSTRACT: Different types of J-proteins perform distinct functions in chaperone processes and diseases development. Accurate identification of types of J-proteins will provide significant clues to reveal the mechanism of J-proteins and contribute to developing drugs for diseases. In this study, an ensemble predictor called JPPRED for J-protein prediction is proposed with hybrid features, including split amino acid composition (SAAC), pseudo amino acid composition (PseAAC), and position specific scoring matrix (PSSM). To deal with the imbalanced benchmark dataset, the synthetic minority oversampling technique (SMOTE) and undersampling technique are applied. The average sensitivity of JPPRED based on above-mentioned individual feature spaces lies in the range of 0.744-0.851, indicating the discriminative power of these features. In addition, JPPRED yields the highest average sensitivity of 0.875 using the hybrid feature spaces of SAAC, PseAAC, and PSSM. Compared to individual base classifiers, JPPRED obtains more balanced and better performance for each type of J-proteins. To evaluate the prediction performance objectively, JPPRED is compared with previous study. Encouragingly, JPPRED obtains balanced performance for each type of J-proteins, which is significantly superior to that of the existing method. It is anticipated that JPPRED can be a potential candidate for J-protein prediction.

SUBMITTER: Zhang L 

PROVIDER: S-EPMC4637456 | biostudies-literature | 2015

REPOSITORIES: biostudies-literature

altmetric image

Publications

JPPRED: Prediction of Types of J-Proteins from Imbalanced Data Using an Ensemble Learning Method.

Zhang Lina L   Zhang Chengjin C   Gao Rui R   Yang Runtao R  

BioMed research international 20151026


Different types of J-proteins perform distinct functions in chaperone processes and diseases development. Accurate identification of types of J-proteins will provide significant clues to reveal the mechanism of J-proteins and contribute to developing drugs for diseases. In this study, an ensemble predictor called JPPRED for J-protein prediction is proposed with hybrid features, including split amino acid composition (SAAC), pseudo amino acid composition (PseAAC), and position specific scoring ma  ...[more]

Similar Datasets

| S-EPMC7354782 | biostudies-literature
| S-EPMC4713117 | biostudies-literature
| S-EPMC8019903 | biostudies-literature
| S-EPMC8515573 | biostudies-literature
| S-EPMC7911732 | biostudies-literature
| S-EPMC2832827 | biostudies-literature
| S-EPMC8281595 | biostudies-literature
| S-EPMC2808167 | biostudies-literature
| S-EPMC5752022 | biostudies-literature
| S-EPMC4783950 | biostudies-literature