Unknown

Dataset Information

0

Universal distribution of protein evolution rates as a consequence of protein folding physics.


ABSTRACT: The hypothesis that folding robustness is the primary determinant of the evolution rate of proteins is explored using a coarse-grained off-lattice model. The simplicity of the model allows rapid computation of the folding probability of a sequence to any folded conformation. For each robust folder, the network of sequences that share its native structure is identified. The fitness of a sequence is postulated to be a simple function of the number of misfolded molecules that have to be produced to reach a characteristic protein abundance. After fixation probabilities of mutants are computed under a simple population dynamics model, a Markov chain on the fold network is constructed, and the fold-averaged evolution rate is computed. The distribution of the logarithm of the evolution rates across distinct networks exhibits a peak with a long tail on the low rate side and resembles the universal empirical distribution of the evolutionary rates more closely than either distribution resembles the log-normal distribution. The results suggest that the universal distribution of the evolutionary rates of protein-coding genes is a direct consequence of the basic physics of protein folding.

SUBMITTER: Lobkovsky AE 

PROVIDER: S-EPMC2840281 | biostudies-literature | 2010 Feb

REPOSITORIES: biostudies-literature

altmetric image

Publications

Universal distribution of protein evolution rates as a consequence of protein folding physics.

Lobkovsky Alexander E AE   Wolf Yuri I YI   Koonin Eugene V EV  

Proceedings of the National Academy of Sciences of the United States of America 20100126 7


The hypothesis that folding robustness is the primary determinant of the evolution rate of proteins is explored using a coarse-grained off-lattice model. The simplicity of the model allows rapid computation of the folding probability of a sequence to any folded conformation. For each robust folder, the network of sequences that share its native structure is identified. The fitness of a sequence is postulated to be a simple function of the number of misfolded molecules that have to be produced to  ...[more]

Similar Datasets

| S-EPMC1891811 | biostudies-literature
| S-EPMC8120156 | biostudies-literature
2011-12-01 | E-GEOD-27062 | biostudies-arrayexpress
| S-EPMC4380988 | biostudies-literature
| S-EPMC4918426 | biostudies-literature
| S-EPMC6660724 | biostudies-literature
| S-EPMC124919 | biostudies-literature
2011-12-01 | GSE27062 | GEO
| S-EPMC2863059 | biostudies-literature
| S-EPMC4143136 | biostudies-literature