Unknown

Dataset Information

0

Testing Empirical Support for Evolutionary Models that Root the Tree of Life.


ABSTRACT: Trees of life (ToLs) can only be rooted with direct methods that seek optimization of character state information in ingroup taxa. This involves optimizing phylogenetic tree, model and data in an exercise of reciprocal illumination. Rooted ToLs have been built from a census of protein structural domains in proteomes using two kinds of models. Fully-reversible models use standard-ordered (additive) characters and Wagner parsimony to generate unrooted trees of proteomes that are then rooted with Weston's generality criterion. Non-reversible models directly build rooted trees with unordered characters and asymmetric stepmatrices of transformation costs that penalize gain over loss of domains. Here, we test the empirical support for the evolutionary models with character state reconstruction methods using two published proteomic datasets. We show that the reversible models match reconstructed frequencies of character change and are faithful to the distribution of serial homologies in trees. In contrast, the non-reversible models go counter to trends in the data they must explain, attracting organisms with large proteomes to the base of the rooted trees while violating the triangle inequality of distances. This can lead to serious reconstruction inconsistencies that show model inadequacy. Our study highlights the aprioristic perils of disposing of countering evidence in natural history reconstruction.

SUBMITTER: Caetano-Anolles D 

PROVIDER: S-EPMC6443624 | biostudies-literature | 2019 Apr

REPOSITORIES: biostudies-literature

altmetric image

Publications

Testing Empirical Support for Evolutionary Models that Root the Tree of Life.

Caetano-Anollés Derek D   Nasir Arshan A   Kim Kyung Mo KM   Caetano-Anollés Gustavo G  

Journal of molecular evolution 20190318 2-3


Trees of life (ToLs) can only be rooted with direct methods that seek optimization of character state information in ingroup taxa. This involves optimizing phylogenetic tree, model and data in an exercise of reciprocal illumination. Rooted ToLs have been built from a census of protein structural domains in proteomes using two kinds of models. Fully-reversible models use standard-ordered (additive) characters and Wagner parsimony to generate unrooted trees of proteomes that are then rooted with W  ...[more]

Similar Datasets

| S-EPMC6837999 | biostudies-literature
| S-EPMC4911941 | biostudies-literature
| S-EPMC9366629 | biostudies-literature
| S-EPMC3323970 | biostudies-literature
| S-EPMC6942926 | biostudies-literature
| S-EPMC4024910 | biostudies-literature
| S-EPMC9303462 | biostudies-literature
| S-EPMC9952988 | biostudies-literature
| S-EPMC9260635 | biostudies-literature
| S-EPMC42233 | biostudies-other