Ontology highlight
ABSTRACT:
SUBMITTER: Arredondo-Alonso S
PROVIDER: S-EPMC6321875 | biostudies-literature | 2018 Nov
REPOSITORIES: biostudies-literature
Arredondo-Alonso Sergio S Rogers Malbert R C MRC Braat Johanna C JC Verschuuren Tess D TD Top Janetta J Corander Jukka J Willems Rob J L RJL Schürch Anita C AC
Microbial genomics 20181101 11
Assembly of bacterial short-read whole-genome sequencing data frequently results in hundreds of contigs for which the origin, plasmid or chromosome, is unclear. Complete genomes resolved by long-read sequencing can be used to generate and label short-read contigs. These were used to train several popular machine learning methods to classify the origin of contigs from Enterococcus faecium, Klebsiella pneumoniae and Escherichia coli using pentamer frequencies. We selected support-vector machine (S ...[more]