Unknown

Dataset Information

0

Amino Acid metabolism conflicts with protein diversity.


ABSTRACT: The 20 protein-coding amino acids are found in proteomes with different relative abundances. The most abundant amino acid, leucine, is nearly an order of magnitude more prevalent than the least abundant amino acid, cysteine. Amino acid metabolic costs differ similarly, constraining their incorporation into proteins. On the other hand, a diverse set of protein sequences is necessary to build functional proteomes. Here, we present a simple model for a cost-diversity trade-off postulating that natural proteomes minimize amino acid metabolic flux while maximizing sequence entropy. The model explains the relative abundances of amino acids across a diverse set of proteomes. We found that the data are remarkably well explained when the cost function accounts for amino acid chemical decay. More than 100 organisms reach comparable solutions to the trade-off by different combinations of proteome cost and sequence diversity. Quantifying the interplay between proteome size and entropy shows that proteomes can get optimally large and diverse.

SUBMITTER: Krick T 

PROVIDER: S-EPMC4209132 | biostudies-literature | 2014 Nov

REPOSITORIES: biostudies-literature

altmetric image

Publications

Amino Acid metabolism conflicts with protein diversity.

Krick Teresa T   Verstraete Nina N   Alonso Leonardo G LG   Shub David A DA   Ferreiro Diego U DU   Shub Michael M   Sánchez Ignacio E IE  

Molecular biology and evolution 20140801 11


The 20 protein-coding amino acids are found in proteomes with different relative abundances. The most abundant amino acid, leucine, is nearly an order of magnitude more prevalent than the least abundant amino acid, cysteine. Amino acid metabolic costs differ similarly, constraining their incorporation into proteins. On the other hand, a diverse set of protein sequences is necessary to build functional proteomes. Here, we present a simple model for a cost-diversity trade-off postulating that natu  ...[more]

Similar Datasets

2023-05-09 | GSE231759 | GEO
| S-EPMC2582520 | biostudies-literature
2023-05-09 | GSE231757 | GEO
2023-05-09 | GSE231756 | GEO
2015-09-09 | E-GEOD-70948 | biostudies-arrayexpress
| S-EPMC7909243 | biostudies-literature
2021-10-13 | GSE126024 | GEO
2023-05-09 | GSE231758 | GEO
| S-EPMC5429979 | biostudies-literature
| S-EPMC5685199 | biostudies-literature