Unknown

Dataset Information

0

A framework for protein structure classification and identification of novel protein structures.


ABSTRACT: Protein structure classification plays a central role in understanding the function of a protein molecule with respect to all known proteins in a structure database. With the rapid increase in the number of new protein structures, the need for automated and accurate methods for protein classification is increasingly important.In this paper we present a unified framework for protein structure classification and identification of novel protein structures. The framework consists of a set of components for comparing, classifying, and clustering protein structures. These components allow us to accurately classify proteins into known folds, to detect new protein folds, and to provide a way of clustering the new folds. In our evaluation with SCOP 1.69, our method correctly classifies 86.0%, 87.7%, and 90.5% of new domains at family, superfamily, and fold levels. Furthermore, for protein domains that belong to new domain families, our method is able to produce clusters that closely correspond to the new families in SCOP 1.69. As a result, our method can also be used to suggest new classification groups that contain novel folds.We have developed a method called proCC for automatically classifying and clustering domains. The method is effective in classifying new domains and suggesting new domain families, and it is also very efficient. A web site offering access to proCC is freely available at http://www.eecs.umich.edu/periscope/procc.

SUBMITTER: Kim YJ 

PROVIDER: S-EPMC1622760 | biostudies-literature | 2006 Oct

REPOSITORIES: biostudies-literature

altmetric image

Publications

A framework for protein structure classification and identification of novel protein structures.

Kim You Jung YJ   Patel Jignesh M JM  

BMC bioinformatics 20061016


<h4>Background</h4>Protein structure classification plays a central role in understanding the function of a protein molecule with respect to all known proteins in a structure database. With the rapid increase in the number of new protein structures, the need for automated and accurate methods for protein classification is increasingly important.<h4>Results</h4>In this paper we present a unified framework for protein structure classification and identification of novel protein structures. The fra  ...[more]

Similar Datasets

| S-EPMC5980557 | biostudies-literature
| S-EPMC10991292 | biostudies-literature
| S-EPMC8172868 | biostudies-literature
| S-EPMC6969538 | biostudies-literature
| S-EPMC2654728 | biostudies-literature
| S-EPMC2877116 | biostudies-literature
| S-EPMC1526735 | biostudies-literature
| S-EPMC6031167 | biostudies-literature
| S-EPMC8896646 | biostudies-literature
| S-EPMC9079056 | biostudies-literature