Unknown

Dataset Information

0

Whole-proteome tree of life suggests a deep burst of organism diversity.


ABSTRACT: An organism tree of life (organism ToL) is a conceptual and metaphorical tree to capture a simplified narrative of the evolutionary course and kinship among the extant organisms. Such a tree cannot be experimentally validated but may be reconstructed based on characteristics associated with the organisms. Since the whole-genome sequence of an organism is, at present, the most comprehensive descriptor of the organism, a whole-genome sequence-based ToL can be an empirically derivable surrogate for the organism ToL. However, experimentally determining the whole-genome sequences of many diverse organisms was practically impossible until recently. We have constructed three types of ToLs for diversely sampled organisms using the sequences of whole genome, of whole transcriptome, and of whole proteome. Of the three, whole-proteome sequence-based ToL (whole-proteome ToL), constructed by applying information theory-based feature frequency profile method, an "alignment-free" method, gave the most topologically stable ToL. Here, we describe the main features of a whole-proteome ToL for 4,023 species with known complete or almost complete genome sequences on grouping and kinship among the groups at deep evolutionary levels. The ToL reveals 1) all extant organisms of this study can be grouped into 2 "Supergroups," 6 "Major Groups," or 35+ "Groups"; 2) the order of emergence of the "founders" of all of the groups may be assigned on an evolutionary progression scale; 3) all of the founders of the groups have emerged in a "deep burst" at the very beginning period near the root of the ToL-an explosive birth of life's diversity.

SUBMITTER: Choi J 

PROVIDER: S-EPMC7035600 | biostudies-literature | 2020 Feb

REPOSITORIES: biostudies-literature

altmetric image

Publications

Whole-proteome tree of life suggests a deep burst of organism diversity.

Choi JaeJin J   Kim Sung-Hou SH  

Proceedings of the National Academy of Sciences of the United States of America 20200204 7


An organism tree of life (organism ToL) is a conceptual and metaphorical tree to capture a simplified narrative of the evolutionary course and kinship among the extant organisms. Such a tree cannot be experimentally validated but may be reconstructed based on characteristics associated with the organisms. Since the whole-genome sequence of an organism is, at present, the most comprehensive descriptor of the organism, a whole-genome sequence-based ToL can be an empirically derivable surrogate for  ...[more]

Similar Datasets

2019-02-18 | GSE112636 | GEO
| S-EPMC7940616 | biostudies-literature
| S-EPMC4157354 | biostudies-literature
| S-EPMC5681887 | biostudies-literature
| S-EPMC8795533 | biostudies-literature
| S-EPMC6533610 | biostudies-literature
| PRJNA524399 | ENA
| PRJNA432042 | ENA
| S-EPMC2629814 | biostudies-literature
| S-EPMC1274291 | biostudies-literature