(?,?)? motifs: a purely conformation-based fine-grained enumeration of protein parts at the two-residue level.
Ontology highlight
ABSTRACT: A deep understanding of protein structure benefits from the use of a variety of classification strategies that enhance our ability to effectively describe local patterns of conformation. Here, we use a clustering algorithm to analyze 76,533 all-trans segments from protein structures solved at 1.2 Å resolution or better to create a purely ?,?-based comprehensive empirical categorization of common conformations adopted by two adjacent ?,? pairs (i.e., (?,?)(2) motifs). The clustering algorithm works in an origin-shifted four-dimensional space based on the two ?,? pairs to yield a parameter-dependent list of (?,?)(2) motifs, in order of their prominence. The results are remarkably distinct from and complementary to the standard hydrogen-bond-centered view of secondary structure. New insights include an unprecedented level of precision in describing the ?,? angles of both previously known and novel motifs, ordering of these motifs by their population density, a data-driven recommendation that the standard C(?(i))…C(?(i+3))<7 Å criteria for defining turns be changed to 6.5 Å, identification of ?-strand and turn capping motifs, and identification of conformational capping by residues in polypeptide II conformation. We further document that the conformational preferences of a residue are substantially influenced by the conformation of its neighbors, and we suggest that accounting for these dependencies will improve protein modeling accuracy. Although the CUEVAS-4D(r(10)?(14)) 'parts list' presented here is only an initial exploration of the complex (?,?)(2) landscape of proteins, it shows that there is value to be had from this approach, and it opens the door to more in-depth characterizations at the (?,?)(2) level and at higher dimensions.
SUBMITTER: Hollingsworth SA
PROVIDER: S-EPMC3268948 | biostudies-literature | 2012 Feb
REPOSITORIES: biostudies-literature
ACCESS DATA