Unknown

Dataset Information

0

When a domain is not a domain, and why it is important to properly filter proteins in databases: conflicting definitions and fold classification systems for structural domains make filtering of such databases imperative.


ABSTRACT: Membership in a protein domain database does not a domain make; a feature we realized when generating a consensus view of protein fold space with our consensus domain dictionary (CDD). This dictionary was used to select representative structures for characterization of the protein dynameome: the Dynameomics initiative. Through this endeavor we rejected a surprising 40% of the 1,695 folds in the CDD as being non-autonomous folding units. Although some of this was due to the challenges of grouping similar fold topologies, the dissonance between the cataloguing and structural qualification of protein domains remains surprising. Another potential factor is previously overlooked intrinsic disorder; predictions suggest that 40% of proteins have either local or global disorder. One thing is clear, filtering a structural database and ensuring a consistent definition for protein domains is crucial, and caution is prescribed when generalizations of globular domains are drawn from unfiltered protein domain datasets.

SUBMITTER: Towse CL 

PROVIDER: S-EPMC3576730 | biostudies-literature | 2012 Dec

REPOSITORIES: biostudies-literature

altmetric image

Publications

When a domain is not a domain, and why it is important to properly filter proteins in databases: conflicting definitions and fold classification systems for structural domains make filtering of such databases imperative.

Towse Clare-Louise CL   Daggett Valerie V  

BioEssays : news and reviews in molecular, cellular and developmental biology 20121026 12


Membership in a protein domain database does not a domain make; a feature we realized when generating a consensus view of protein fold space with our consensus domain dictionary (CDD). This dictionary was used to select representative structures for characterization of the protein dynameome: the Dynameomics initiative. Through this endeavor we rejected a surprising 40% of the 1,695 folds in the CDD as being non-autonomous folding units. Although some of this was due to the challenges of grouping  ...[more]

Similar Datasets

| S-EPMC10785526 | biostudies-literature
| S-EPMC7809278 | biostudies-literature
| S-EPMC3534397 | biostudies-literature
| S-EPMC4643269 | biostudies-other
| S-EPMC5878146 | biostudies-literature
| S-EPMC8259355 | biostudies-literature
| S-EPMC6461583 | biostudies-literature
| S-EPMC6589474 | biostudies-literature
| S-EPMC8571870 | biostudies-literature
| S-EPMC153506 | biostudies-literature