Unknown

Dataset Information

0

Accuracy issues involved in modeling in vivo protein structures using PM7.


ABSTRACT: Using the semiempirical method PM7, an attempt has been made to quantify the error in prediction of the in vivo structure of proteins relative to X-ray structures. Three important contributory factors are the experimental limitations of X-ray structures, the difference between the crystal and solution environments, and the errors due to PM7. The geometries of 19 proteins from the Protein Data Bank that had small R values, that is, high accuracy structures, were optimized and the resulting drop in heat of formation was calculated. Analysis of the changes showed that about 10% of this decrease in heat of formation was caused by faults in PM7, the balance being attributable to the X-ray structure and the difference between the crystal and solution environments. A previously unknown fault in PM7 was revealed during tests to validate the geometries generated using PM7. Clashscores generated by the Molprobity molecular mechanics structure validation program showed that PM7 was predicting unrealistically close contacts between nonbonding atoms in regions where the local geometry is dominated by very weak noncovalent interactions. The origin of this fault was traced to an underestimation of the core-core repulsion between atoms at distances smaller than the equilibrium distance.

SUBMITTER: Martin BP 

PROVIDER: S-EPMC4744657 | biostudies-literature | 2015 Aug

REPOSITORIES: biostudies-literature

altmetric image

Publications

Accuracy issues involved in modeling in vivo protein structures using PM7.

Martin Benjamin P BP   Brandon Christopher J CJ   Stewart James J P JJ   Braun-Sand Sonja B SB  

Proteins 20150606 8


Using the semiempirical method PM7, an attempt has been made to quantify the error in prediction of the in vivo structure of proteins relative to X-ray structures. Three important contributory factors are the experimental limitations of X-ray structures, the difference between the crystal and solution environments, and the errors due to PM7. The geometries of 19 proteins from the Protein Data Bank that had small R values, that is, high accuracy structures, were optimized and the resulting drop i  ...[more]

Similar Datasets

| S-EPMC5011913 | biostudies-literature
| S-EPMC3806451 | biostudies-literature
2010-11-10 | PRD000105 | Pride
| S-EPMC8725162 | biostudies-literature
| S-EPMC7230021 | biostudies-literature
| S-EPMC2144714 | biostudies-other
| S-EPMC3672994 | biostudies-literature
| S-EPMC3415962 | biostudies-literature
| S-EPMC4637837 | biostudies-literature
| S-EPMC9278006 | biostudies-literature