Unknown

Dataset Information

0

Sequencing of E. coli strain UTI89 on multiple sequencing platforms.


ABSTRACT: OBJECTIVES:The availability of matched sequencing data for the same sample across different sequencing platforms is a necessity for validation and effective comparison of sequencing platforms. A commonly sequenced sample is the lab-adapted MG1655 strain of Escherichia coli; however, this strain is not fully representative of more complex and dynamic genomes of pathogenic E. coli strains. DATA DESCRIPTION:We present six new sequencing data sets for another E. coli strain, UTI89, which is an extraintestinal pathogenic strain isolated from a patient suffering from a urinary tract infection. We now provide matched whole genome sequencing data generated using the PacBio RSII, Oxford Nanopore MinION R9.4, Ion Torrent, ABI SOLiD, and Illumina NextSeq sequencers. Together with other publically available datasets, UTI89 has a nearly complete suite of data generated on most second- and third-generation sequencers. These data can be used as an additional validation set for new sequencing technologies and analytical methods. More than being another E. coli strain, however, UTI89 is pathogenic, with a 10% larger genome, additional pathogenicity islands, and a large plasmid, features that are common among other naturally occurring and disease-causing E. coli isolates. These data therefore provide a more medically relevant test set for development of algorithms.

SUBMITTER: Fenlon SN 

PROVIDER: S-EPMC7576692 | biostudies-literature | 2020 Oct

REPOSITORIES: biostudies-literature

altmetric image

Publications


<h4>Objectives</h4>The availability of matched sequencing data for the same sample across different sequencing platforms is a necessity for validation and effective comparison of sequencing platforms. A commonly sequenced sample is the lab-adapted MG1655 strain of Escherichia coli; however, this strain is not fully representative of more complex and dynamic genomes of pathogenic E. coli strains.<h4>Data description</h4>We present six new sequencing data sets for another E. coli strain, UTI89, wh  ...[more]

Similar Datasets

| S-EPMC3592442 | biostudies-literature
| S-EPMC8173916 | biostudies-literature
| S-EPMC7589345 | biostudies-literature
| S-EPMC4575192 | biostudies-literature
| S-EPMC3431719 | biostudies-literature
| S-EPMC7296898 | biostudies-literature
| S-EPMC4076012 | biostudies-literature
2005-04-12 | GSE2512 | GEO
| S-EPMC2691003 | biostudies-literature
2010-06-10 | E-GEOD-2512 | biostudies-arrayexpress