Unknown

Dataset Information

0

A gold standard set of mechanistically diverse enzyme superfamilies.


ABSTRACT: Superfamily and family analyses provide an effective tool for the functional classification of proteins, but must be automated for use on large datasets. We describe a 'gold standard' set of enzyme superfamilies, clustered according to specific sequence, structure, and functional criteria, for use in the validation of family and superfamily clustering methods. The gold standard set represents four fold classes and differing clustering difficulties, and includes five superfamilies, 91 families, 4,887 sequences and 282 structures.

SUBMITTER: Brown SD 

PROVIDER: S-EPMC1431709 | biostudies-literature | 2006

REPOSITORIES: biostudies-literature

altmetric image

Publications

A gold standard set of mechanistically diverse enzyme superfamilies.

Brown Shoshana D SD   Gerlt John A JA   Seffernick Jennifer L JL   Babbitt Patricia C PC  

Genome biology 20060131 1


Superfamily and family analyses provide an effective tool for the functional classification of proteins, but must be automated for use on large datasets. We describe a 'gold standard' set of enzyme superfamilies, clustered according to specific sequence, structure, and functional criteria, for use in the validation of family and superfamily clustering methods. The gold standard set represents four fold classes and differing clustering difficulties, and includes five superfamilies, 91 families, 4  ...[more]

Similar Datasets

| S-EPMC3551608 | biostudies-literature
| S-EPMC7820859 | biostudies-literature
| S-EPMC2536635 | biostudies-literature
| S-EPMC5575885 | biostudies-literature
| S-EPMC2453236 | biostudies-literature
| S-EPMC2631154 | biostudies-literature
| S-EPMC10251251 | biostudies-literature
| S-EPMC3291543 | biostudies-literature
| S-EPMC2781113 | biostudies-literature
| S-EPMC2721411 | biostudies-literature