Unknown

Dataset Information

0

Non-Alignment Features Based Enzyme/Non-Enzyme Classification Using an Ensemble Method.


ABSTRACT: As a growing number of protein structures are resolved without known functions, using computational methods to help predict protein functions from the structures becomes more and more important. Some computational methods predict protein functions by aligning to homologous proteins with known functions, but they fail to work if such homology cannot be identified. In this paper we classify enzymes/non-enzymes using non-alignment features. We propose a new ensemble method that includes three support vector machines (SVM) and two k-nearest neighbor algorithms (k-NN) and uses a simple majority voting rule. The test on a data set of 697 enzymes and 480 non-enzymes adapted from Dobson and Doig shows 85.59% accuracy in a 10-fold cross validation and 86.49% accuracy in a leave-one-out validation. The prediction accuracy is much better than other non-alignment features based methods and even slightly better than alignment features based methods. To our knowledge, our method is the first time to use ensemble methods to classify enzymes/non-enzymes and is superior over a single classifier.

SUBMITTER: Davidson NJ 

PROVIDER: S-EPMC3091888 | biostudies-literature | 2010 Dec

REPOSITORIES: biostudies-literature

altmetric image

Publications

Non-Alignment Features Based Enzyme/Non-Enzyme Classification Using an Ensemble Method.

Davidson Nicholas J NJ   Wang Xueyi X  

Proceedings of the ... International Conference on Machine Learning and Applications. International Conference on Machine Learning and Applications 20101201


As a growing number of protein structures are resolved without known functions, using computational methods to help predict protein functions from the structures becomes more and more important. Some computational methods predict protein functions by aligning to homologous proteins with known functions, but they fail to work if such homology cannot be identified. In this paper we classify enzymes/non-enzymes using non-alignment features. We propose a new ensemble method that includes three suppo  ...[more]

Similar Datasets

| S-EPMC8177489 | biostudies-literature
| S-EPMC10148686 | biostudies-literature
| S-EPMC7556384 | biostudies-literature
| S-EPMC6480413 | biostudies-literature
| S-EPMC3568092 | biostudies-literature
| S-EPMC6862210 | biostudies-literature
| S-EPMC6550425 | biostudies-literature
| S-EPMC3623732 | biostudies-literature
| S-EPMC4165735 | biostudies-literature
| S-EPMC8320732 | biostudies-literature