Unknown

Dataset Information

0

Pangenomic Definition of Prokaryotic Species and the Phylogenetic Structure of Prochlorococcus spp.


ABSTRACT: The pangenome is the collection of all groups of orthologous genes (OGGs) from a set of genomes. We apply the pangenome analysis to propose a definition of prokaryotic species based on identification of lineage-specific gene sets. While being similar to the classical biological definition based on allele flow, it does not rely on DNA similarity levels and does not require analysis of homologous recombination. Hence this definition is relatively objective and independent of arbitrary thresholds. A systematic analysis of 110 accepted species with the largest numbers of sequenced strains yields results largely consistent with the existing nomenclature. However, it has revealed that abundant marine cyanobacteria Prochlorococcus marinus should be divided into two species. As a control we have confirmed the paraphyletic origin of Yersinia pseudotuberculosis (with embedded, monophyletic Y. pestis) and Burkholderia pseudomallei (with B. mallei). We also demonstrate that by our definition and in accordance with recent studies Escherichia coli and Shigella spp. are one species.

SUBMITTER: Moldovan MA 

PROVIDER: S-EPMC5857598 | biostudies-literature | 2018

REPOSITORIES: biostudies-literature

altmetric image

Publications

Pangenomic Definition of Prokaryotic Species and the Phylogenetic Structure of <i>Prochlorococcus</i> spp.

Moldovan Mikhail A MA   Gelfand Mikhail S MS  

Frontiers in microbiology 20180312


The pangenome is the collection of all groups of orthologous genes (OGGs) from a set of genomes. We apply the pangenome analysis to propose a definition of prokaryotic species based on identification of lineage-specific gene sets. While being similar to the classical biological definition based on allele flow, it does not rely on DNA similarity levels and does not require analysis of homologous recombination. Hence this definition is relatively objective and independent of arbitrary thresholds.  ...[more]

Similar Datasets

| S-EPMC2776425 | biostudies-literature
| S-EPMC5982828 | biostudies-literature
| S-EPMC7174425 | biostudies-literature
| S-EPMC3019706 | biostudies-literature
| S-EPMC6762002 | biostudies-literature
| S-EPMC9122939 | biostudies-literature
| S-EPMC4686806 | biostudies-literature
| S-EPMC7311419 | biostudies-literature
| S-EPMC4810814 | biostudies-literature
| S-EPMC3372372 | biostudies-literature