Unknown

Dataset Information

0

Classification of G-protein coupled receptors based on support vector machine with maximum relevance minimum redundancy and genetic algorithm.


ABSTRACT:

Background

Because a priori knowledge about function of G protein-coupled receptors (GPCRs) can provide useful information to pharmaceutical research, the determination of their function is a quite meaningful topic in protein science. However, with the rapid increase of GPCRs sequences entering into databanks, the gap between the number of known sequence and the number of known function is widening rapidly, and it is both time-consuming and expensive to determine their function based only on experimental techniques. Therefore, it is vitally significant to develop a computational method for quick and accurate classification of GPCRs.

Results

In this study, a novel three-layer predictor based on support vector machine (SVM) and feature selection is developed for predicting and classifying GPCRs directly from amino acid sequence data. The maximum relevance minimum redundancy (mRMR) is applied to pre-evaluate features with discriminative information while genetic algorithm (GA) is utilized to find the optimized feature subsets. SVM is used for the construction of classification models. The overall accuracy with three-layer predictor at levels of superfamily, family and subfamily are obtained by cross-validation test on two non-redundant dataset. The results are about 0.5% to 16% higher than those of GPCR-CA and GPCRPred.

Conclusion

The results with high success rates indicate that the proposed predictor is a useful automated tool in predicting GPCRs. GPCR-SVMFS, a corresponding executable program for GPCRs prediction and classification, can be acquired freely on request from the authors.

SUBMITTER: Li Z 

PROVIDER: S-EPMC2905366 | biostudies-literature | 2010 Jun

REPOSITORIES: biostudies-literature

altmetric image

Publications

Classification of G-protein coupled receptors based on support vector machine with maximum relevance minimum redundancy and genetic algorithm.

Li Zhanchao Z   Zhou Xuan X   Dai Zong Z   Zou Xiaoyong X  

BMC bioinformatics 20100616


<h4>Background</h4>Because a priori knowledge about function of G protein-coupled receptors (GPCRs) can provide useful information to pharmaceutical research, the determination of their function is a quite meaningful topic in protein science. However, with the rapid increase of GPCRs sequences entering into databanks, the gap between the number of known sequence and the number of known function is widening rapidly, and it is both time-consuming and expensive to determine their function based onl  ...[more]

Similar Datasets

| S-EPMC4756144 | biostudies-literature
| S-EPMC5209828 | biostudies-literature
| S-EPMC4029432 | biostudies-literature
| S-EPMC4537225 | biostudies-literature
| S-EPMC8382032 | biostudies-literature
| S-EPMC4057401 | biostudies-literature
| S-EPMC6457544 | biostudies-literature
| S-EPMC4395415 | biostudies-other
| S-EPMC4183366 | biostudies-literature
| S-EPMC6377146 | biostudies-literature