Unknown

Dataset Information

0

Molprobity's ultimate rotamer-library distributions for model validation.


ABSTRACT: Here we describe the updated MolProbity rotamer-library distributions derived from an order-of-magnitude larger and more stringently quality-filtered dataset of about 8000 (vs. 500) protein chains, and we explain the resulting changes and improvements to model validation as seen by users. To include only side-chains with satisfactory justification for their given conformation, we added residue-specific filters for electron-density value and model-to-density fit. The combined new protocol retains a million residues of data, while cleaning up false-positive noise in the multi- ? datapoint distributions. It enables unambiguous characterization of conformational clusters nearly 1000-fold less frequent than the most common ones. We describe examples of local interactions that favor these rare conformations, including the role of authentic covalent bond-angle deviations in enabling presumably strained side-chain conformations. Further, along with favored and outlier, an allowed category (0.3-2.0% occurrence in reference data) has been added, analogous to Ramachandran validation categories. The new rotamer distributions are used for current rotamer validation in MolProbity and PHENIX, and for rotamer choice in PHENIX model-building and refinement. The multi-dimensional ? distributions and Top8000 reference dataset are freely available on GitHub. These rotamers are termed "ultimate" because data sampling and quality are now fully adequate for this task, and also because we believe the future of conformational validation should integrate side-chain with backbone criteria. Proteins 2016; 84:1177-1189. © 2016 Wiley Periodicals, Inc.

SUBMITTER: Hintze BJ 

PROVIDER: S-EPMC4983197 | biostudies-literature | 2016 Sep

REPOSITORIES: biostudies-literature

altmetric image

Publications

Molprobity's ultimate rotamer-library distributions for model validation.

Hintze Bradley J BJ   Lewis Steven M SM   Richardson Jane S JS   Richardson David C DC  

Proteins 20160623 9


Here we describe the updated MolProbity rotamer-library distributions derived from an order-of-magnitude larger and more stringently quality-filtered dataset of about 8000 (vs. 500) protein chains, and we explain the resulting changes and improvements to model validation as seen by users. To include only side-chains with satisfactory justification for their given conformation, we added residue-specific filters for electron-density value and model-to-density fit. The combined new protocol retains  ...[more]

Similar Datasets

| S-EPMC3764097 | biostudies-literature
| S-EPMC2947618 | biostudies-literature
| S-EPMC4227732 | biostudies-literature
| S-EPMC3079439 | biostudies-literature
| S-EPMC3118414 | biostudies-literature
| S-EPMC2286725 | biostudies-literature
| S-EPMC3601743 | biostudies-literature
| S-EPMC3302646 | biostudies-literature
| S-EPMC5732036 | biostudies-literature
| S-EPMC6761354 | biostudies-literature