Unknown

Dataset Information

0

Phylogenetically typing bacterial strains from partial SNP genotypes observed from direct sequencing of clinical specimen metagenomic data.


ABSTRACT: We describe an approach for genotyping bacterial strains from low coverage genome datasets, including metagenomic data from complex samples. Sequence reads from unknown samples are aligned to a reference genome where the allele states of known SNPs are determined. The Whole Genome Focused Array SNP Typing (WG-FAST) pipeline can identify unknown strains with much less read data than is needed for genome assembly. To test WG-FAST, we resampled SNPs from real samples to understand the relationship between low coverage metagenomic data and accurate phylogenetic placement. WG-FAST can be downloaded from https://github.com/jasonsahl/wgfast.

SUBMITTER: Sahl JW 

PROVIDER: S-EPMC4487561 | biostudies-literature |

REPOSITORIES: biostudies-literature

Similar Datasets

| S-EPMC4166679 | biostudies-literature
| S-EPMC10793249 | biostudies-literature
| S-EPMC3182885 | biostudies-other
| S-EPMC3230589 | biostudies-literature
| S-EPMC9530715 | biostudies-literature
| S-EPMC6814786 | biostudies-literature
| S-EPMC4579273 | biostudies-literature
| S-EPMC3911330 | biostudies-literature
| S-EPMC10714970 | biostudies-literature
| S-EPMC3421839 | biostudies-literature