Unknown

Dataset Information

0

An evolutionarily structured universe of protein architecture.


ABSTRACT: Protein structural diversity encompasses a finite set of architectural designs. Embedded in these topologies are evolutionary histories that we here uncover using cladistic principles and measurements of protein-fold usage and sharing. The reconstructed phylogenies are inherently rooted and depict histories of protein and proteome diversification. Proteome phylogenies showed two monophyletic sister-groups delimiting Bacteria and Archaea, and a topology rooted in Eucarya. This suggests three dramatic evolutionary events and a common ancestor with a eukaryotic-like, gene-rich, and relatively modern organization. Conversely, a general phylogeny of protein architectures showed that structural classes of globular proteins appeared early in evolution and in defined order, the alpha/beta class being the first. Although most ancestral folds shared a common architecture of barrels or interleaved beta-sheets and alpha-helices, many were clearly derived, such as polyhedral folds in the all-alpha class and beta-sandwiches, beta-propellers, and beta-prisms in all-beta proteins. We also describe transformation pathways of architectures that are prevalently used in nature. For example, beta-barrels with increased curl and stagger were favored evolutionary outcomes in the all-beta class. Interestingly, we found cases where structural change followed the alpha-to-beta tendency uncovered in the tree of architectures. Lastly, we traced the total number of enzymatic functions associated with folds in the trees and show that there is a general link between structure and enzymatic function.

SUBMITTER: Caetano-Anolles G 

PROVIDER: S-EPMC403752 | biostudies-literature | 2003 Jul

REPOSITORIES: biostudies-literature

altmetric image

Publications

An evolutionarily structured universe of protein architecture.

Caetano-Anollés Gustavo G   Caetano-Anollés Derek D  

Genome research 20030701 7


Protein structural diversity encompasses a finite set of architectural designs. Embedded in these topologies are evolutionary histories that we here uncover using cladistic principles and measurements of protein-fold usage and sharing. The reconstructed phylogenies are inherently rooted and depict histories of protein and proteome diversification. Proteome phylogenies showed two monophyletic sister-groups delimiting Bacteria and Archaea, and a topology rooted in Eucarya. This suggests three dram  ...[more]

Similar Datasets

| S-EPMC2698892 | biostudies-literature
| S-EPMC3118404 | biostudies-literature
| S-EPMC4136566 | biostudies-literature
| S-EPMC4845728 | biostudies-literature
| S-EPMC2744874 | biostudies-literature
| S-EPMC5127300 | biostudies-literature
| S-EPMC6481876 | biostudies-literature
| S-EPMC2973819 | biostudies-literature
| S-EPMC3557196 | biostudies-literature
| S-EPMC10133388 | biostudies-literature