Unknown

Dataset Information

0

Microbial species delineation using whole genome sequences.


ABSTRACT: Increased sequencing of microbial genomes has revealed that prevailing prokaryotic species assignments can be inconsistent with whole genome information for a significant number of species. The long-standing need for a systematic and scalable species assignment technique can be met by the genome-wide Average Nucleotide Identity (gANI) metric, which is widely acknowledged as a robust measure of genomic relatedness. In this work, we demonstrate that the combination of gANI and the alignment fraction (AF) between two genomes accurately reflects their genomic relatedness. We introduce an efficient implementation of AF,gANI and discuss its successful application to 86.5M genome pairs between 13,151 prokaryotic genomes assigned to 3032 species. Subsequently, by comparing the genome clusters obtained from complete linkage clustering of these pairs to existing taxonomy, we observed that nearly 18% of all prokaryotic species suffer from anomalies in species definition. Our results can be used to explore central questions such as whether microorganisms form a continuum of genetic diversity or distinct species represented by distinct genetic signatures. We propose that this precise and objective AF,gANI-based species definition: the MiSI (Microbial Species Identifier) method, be used to address previous inconsistencies in species classification and as the primary guide for new taxonomic species assignment, supplemented by the traditional polyphasic approach, as required.

SUBMITTER: Varghese NJ 

PROVIDER: S-EPMC4538840 | biostudies-literature | 2015 Aug

REPOSITORIES: biostudies-literature

altmetric image

Publications

Microbial species delineation using whole genome sequences.

Varghese Neha J NJ   Mukherjee Supratim S   Ivanova Natalia N   Konstantinidis Konstantinos T KT   Mavrommatis Kostas K   Kyrpides Nikos C NC   Pati Amrita A  

Nucleic acids research 20150706 14


Increased sequencing of microbial genomes has revealed that prevailing prokaryotic species assignments can be inconsistent with whole genome information for a significant number of species. The long-standing need for a systematic and scalable species assignment technique can be met by the genome-wide Average Nucleotide Identity (gANI) metric, which is widely acknowledged as a robust measure of genomic relatedness. In this work, we demonstrate that the combination of gANI and the alignment fracti  ...[more]

Similar Datasets

| S-EPMC3583267 | biostudies-literature
| S-EPMC11302064 | biostudies-literature
| PRJNA816270 | ENA
| S-EPMC5277532 | biostudies-literature
| S-EPMC2998322 | biostudies-literature
| S-EPMC3674514 | biostudies-literature
| S-EPMC8111133 | biostudies-literature
| S-EPMC5038937 | biostudies-literature
| S-EPMC8510864 | biostudies-literature
| S-EPMC4564799 | biostudies-literature