Dataset Information

Genome-Based Comparison of Clostridioides difficile: Average Amino Acid Identity Analysis of Core Genomes.

ABSTRACT: Infections due to Clostridioides difficile (previously known as Clostridium difficile) are a major problem in hospitals, where cases can be caused by community-acquired strains as well as by nosocomial spread. Whole genome sequences from clinical samples contain a lot of information but that needs to be analyzed and compared in such a way that the outcome is useful for clinicians or epidemiologists. Here, we compare 663 public available complete genome sequences of C. difficile using average amino acid identity (AAI) scores. This analysis revealed that most of these genomes (640, 96.5%) clearly belong to the same species, while the remaining 23 genomes produce four distinct clusters within the Clostridioides genus. The main C. difficile cluster can be further divided into sub-clusters, depending on the chosen cutoff. We demonstrate that MLST, either based on partial or full gene-length, results in biased estimates of genetic differences and does not capture the true degree of similarity or differences of complete genomes. Presence of genes coding for C. difficile toxins A and B (ToxA/B), as well as the binary C. difficile toxin (CDT), was deduced from their unique PfamA domain architectures. Out of the 663 C. difficile genomes, 535 (80.7%) contained at least one copy of ToxA or ToxB, while these genes were missing from 128 genomes. Although some clusters were enriched for toxin presence, these genes are variably present in a given genetic background. The CDT genes were found in 191 genomes, which were restricted to a few clusters only, and only one cluster lacked the toxin A/B genes consistently. A total of 310 genomes contained ToxA/B without CDT (47%). Further, published metagenomic data from stools were used to assess the presence of C. difficile sequences in blinded cases of C. difficile infection (CDI) and controls, to test if metagenomic analysis is sensitive enough to detect the pathogen, and to establish strain relationships between cases from the same hospital. We conclude that metagenomics can contribute to the identification of CDI and can assist in characterization of the most probable causative strain in CDI patients.

SUBMITTER: Cabal A

PROVIDER: S-EPMC6132499 | biostudies-literature | 2018 Oct

REPOSITORIES: biostudies-literature

ACCESS DATA

Publications

Genome-Based Comparison of Clostridioides difficile: Average Amino Acid Identity Analysis of Core Genomes.

Cabal Adriana A Jun Se-Ran SR Jenjaroenpun Piroon P Wanchai Visanu V Nookaew Intawat I Wongsurawat Thidathip T Burgess Mary J MJ Kothari Atul A Wassenaar Trudy M TM Ussery David W DW

Microbial ecology 20180214 3

Infections due to Clostridioides difficile (previously known as Clostridium difficile) are a major problem in hospitals, where cases can be caused by community-acquired strains as well as by nosocomial spread. Whole genome sequences from clinical samples contain a lot of information but that needs to be analyzed and compared in such a way that the outcome is useful for clinicians or epidemiologists. Here, we compare 663 public available complete genome sequences of C. difficile using average ami ...[more]

PMID: 29445826

Dataset Information

Genome-Based Comparison of Clostridioides difficile: Average Amino Acid Identity Analysis of Core Genomes.

Publications

Genome-Based Comparison of Clostridioides difficile: Average Amino Acid Identity Analysis of Core Genomes.

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets

Similar Datasets

Different CprABC amino acid sequences affect nisin A susceptibility in Clostridioides difficile isolates.
| S-EPMC9858009 | biostudies-literature

Core genome multilocus sequence typing of Clostridioides difficile to investigate transmission in the hospital setting.
| S-EPMC10651541 | biostudies-literature

Bile acid-independent protection against Clostridioides difficile infection.
| S-EPMC8555850 | biostudies-literature

Clostridioides difficile WalRK
2022-04-09 | GSE200346 | GEO

Comparison of Whole-Genome Sequence-Based Methods and PCR Ribotyping for Subtyping of Clostridioides difficile.
| S-EPMC8849210 | biostudies-literature

Application of a core genome sequence typing (cgMLST) pipeline for surveillance of <i>Clostridioides difficile</i> in China.
| S-EPMC10040748 | biostudies-literature

Systems biology analysis of the Clostridioides difficile core-genome contextualizes microenvironmental evolutionary pressures leading to genotypic and phenotypic divergence.
| S-EPMC7576604 | biostudies-literature

Using average nucleotide identity to improve taxonomic assignments in prokaryotic genomes at the NCBI.
| S-EPMC6978984 | biostudies-literature

Enterococci enhance Clostridioides difficile pathogenesis
2022-09-17 | GSE165751 | GEO

Clostridioides difficile responses to calprotectin
2019-10-24 | GSE135912 | GEO