Unknown

Dataset Information

0

Bacterial Genes Outnumber Archaeal Genes in Eukaryotic Genomes.


ABSTRACT: Eukaryotes are typically depicted as descendants of archaea, but their genomes are evolutionary chimeras with genes stemming from archaea and bacteria. Which prokaryotic heritage predominates? Here, we have clustered 19,050,992 protein sequences from 5,443 bacteria and 212 archaea with 3,420,731 protein sequences from 150 eukaryotes spanning six eukaryotic supergroups. By downsampling, we obtain estimates for the bacterial and archaeal proportions. Eukaryotic genomes possess a bacterial majority of genes. On average, the majority of bacterial genes is 56% overall, 53% in eukaryotes that never possessed plastids, and 61% in photosynthetic eukaryotic lineages, where the cyanobacterial ancestor of plastids contributed additional genes to the eukaryotic lineage. Intracellular parasites, which undergo reductive evolution in adaptation to the nutrient rich environment of the cells that they infect, relinquish bacterial genes for metabolic processes. Such adaptive gene loss is most pronounced in the human parasite Encephalitozoon intestinalis with 86% archaeal and 14% bacterial derived genes. The most bacterial eukaryote genome sampled is rice, with 67% bacterial and 33% archaeal genes. The functional dichotomy, initially described for yeast, of archaeal genes being involved in genetic information processing and bacterial genes being involved in metabolic processes is conserved across all eukaryotic supergroups.

SUBMITTER: Brueckner J 

PROVIDER: S-EPMC7151554 | biostudies-literature | 2020 Apr

REPOSITORIES: biostudies-literature

altmetric image

Publications

Bacterial Genes Outnumber Archaeal Genes in Eukaryotic Genomes.

Brueckner Julia J   Martin William F WF  

Genome biology and evolution 20200401 4


Eukaryotes are typically depicted as descendants of archaea, but their genomes are evolutionary chimeras with genes stemming from archaea and bacteria. Which prokaryotic heritage predominates? Here, we have clustered 19,050,992 protein sequences from 5,443 bacteria and 212 archaea with 3,420,731 protein sequences from 150 eukaryotes spanning six eukaryotic supergroups. By downsampling, we obtain estimates for the bacterial and archaeal proportions. Eukaryotic genomes possess a bacterial majority  ...[more]

Similar Datasets

| S-EPMC6938587 | biostudies-literature
| S-EPMC3636077 | biostudies-literature
| S-EPMC2573894 | biostudies-other
| S-EPMC152858 | biostudies-literature
| S-EPMC1824705 | biostudies-literature
| S-EPMC1129124 | biostudies-literature
| S-EPMC4581349 | biostudies-literature
| S-EPMC4844061 | biostudies-literature
| S-EPMC2144420 | biostudies-other
| S-EPMC1847833 | biostudies-other