Unknown

Dataset Information

0

Validation of a Bioinformatics Workflow for Routine Analysis of Whole-Genome Sequencing Data and Related Challenges for Pathogen Typing in a European National Reference Center: Neisseria meningitidis as a Proof-of-Concept.


ABSTRACT: Despite being a well-established research method, the use of whole-genome sequencing (WGS) for routine molecular typing and pathogen characterization remains a substantial challenge due to the required bioinformatics resources and/or expertise. Moreover, many national reference laboratories and centers, as well as other laboratories working under a quality system, require extensive validation to demonstrate that employed methods are "fit-for-purpose" and provide high-quality results. A harmonized framework with guidelines for the validation of WGS workflows does currently, however, not exist yet, despite several recent case studies highlighting the urgent need thereof. We present a validation strategy focusing specifically on the exhaustive characterization of the bioinformatics analysis of a WGS workflow designed to replace conventionally employed molecular typing methods for microbial isolates in a representative small-scale laboratory, using the pathogen Neisseria meningitidis as a proof-of-concept. We adapted several classically employed performance metrics specifically toward three different bioinformatics assays: resistance gene characterization (based on the ARG-ANNOT, ResFinder, CARD, and NDARO databases), several commonly employed typing schemas (including, among others, core genome multilocus sequence typing), and serogroup determination. We analyzed a core validation dataset of 67 well-characterized samples typed by means of classical genotypic and/or phenotypic methods that were sequenced in-house, allowing to evaluate repeatability, reproducibility, accuracy, precision, sensitivity, and specificity of the different bioinformatics assays. We also analyzed an extended validation dataset composed of publicly available WGS data for 64 samples by comparing results of the different bioinformatics assays against results obtained from commonly used bioinformatics tools. We demonstrate high performance, with values for all performance metrics >87%, >97%, and >90% for the resistance gene characterization, sequence typing, and serogroup determination assays, respectively, for both validation datasets. Our WGS workflow has been made publicly available as a "push-button" pipeline for Illumina data at https://galaxy.sciensano.be to showcase its implementation for non-profit and/or academic usage. Our validation strategy can be adapted to other WGS workflows for other pathogens of interest and demonstrates the added value and feasibility of employing WGS with the aim of being integrated into routine use in an applied public health setting.

SUBMITTER: Bogaerts B 

PROVIDER: S-EPMC6414443 | biostudies-literature | 2019

REPOSITORIES: biostudies-literature

altmetric image

Publications

Validation of a Bioinformatics Workflow for Routine Analysis of Whole-Genome Sequencing Data and Related Challenges for Pathogen Typing in a European National Reference Center: <i>Neisseria meningitidis</i> as a Proof-of-Concept.

Bogaerts Bert B   Winand Raf R   Fu Qiang Q   Van Braekel Julien J   Ceyssens Pieter-Jan PJ   Mattheus Wesley W   Bertrand Sophie S   De Keersmaecker Sigrid C J SCJ   Roosens Nancy H C NHC   Vanneste Kevin K  

Frontiers in microbiology 20190306


Despite being a well-established research method, the use of whole-genome sequencing (WGS) for routine molecular typing and pathogen characterization remains a substantial challenge due to the required bioinformatics resources and/or expertise. Moreover, many national reference laboratories and centers, as well as other laboratories working under a quality system, require extensive validation to demonstrate that employed methods are "fit-for-purpose" and provide high-quality results. A harmonize  ...[more]

Similar Datasets

| S-EPMC2762044 | biostudies-literature
| S-EPMC96214 | biostudies-literature
| S-EPMC5405183 | biostudies-literature
| S-EPMC6930190 | biostudies-literature
| S-EPMC7685892 | biostudies-literature
| S-EPMC149584 | biostudies-literature
| S-EPMC3372157 | biostudies-literature
2012-10-05 | GSE38033 | GEO
2009-11-11 | GSE18951 | GEO
| S-EPMC2958009 | biostudies-literature