Project description:Modest target sequencing yield improvements using Nanopore adaptive sampling to reduce human DNA contamination from clinical tissue samples
Project description:Bacterial plasmids play a major role in the spread of antibiotic resistance genes. However, their characterization via DNA sequencing suffers from the low abundance of plasmid DNA in those samples. Although sample preparation methods can enrich the proportion of plasmid DNA before sequencing, these methods are expensive and laborious, and they might introduce a bias by enriching only for specific plasmid DNA sequences. Nanopore adaptive sampling could overcome these issues by rejecting uninteresting DNA molecules during the sequencing process. In this study, we assess the application of adaptive sampling for the enrichment of low-abundant plasmids in known bacterial isolates using two different adaptive sampling tools. We show that a significant enrichment can be achieved even on expired flow cells. By applying adaptive sampling, we also improve the quality of de novo plasmid assemblies and reduce the sequencing time. However, our experiments also highlight issues with adaptive sampling if target and non-target sequences span similar regions.ImportanceAntimicrobial resistance causes millions of deaths every year. Mobile genetic elements like bacterial plasmids are key drivers for the dissemination of antimicrobial resistance genes. This makes the characterization of plasmids via DNA sequencing an important tool for clinical microbiologists. Since plasmids are often underrepresented in bacterial samples, plasmid sequencing can be challenging and laborious. To accelerate the sequencing process, we evaluate nanopore adaptive sampling as an in silico method for the enrichment of low-abundant plasmids. Our results show the potential of this cost-efficient method for future plasmid research but also indicate issues that arise from using reference sequences.
Project description:DNA sequencing using nanopore technologies with the affordable MinION device is useful for the identification and characterization of structural variants, long haplotypes, sequencing of repetitive regions and identification of epigenetic modifications. The main limitation of this approach is the low coverage obtained, which might be avoided by adaptive sampling, a computationally controlled method of enrichment for targeted genomic regions. This study dissects the factors involved in the enrichment by adaptive sampling of a panel of 18 human genome regions containing 20 genes implicated in breast cancer sequenced in 16 patients with familial breast cancer negative for NGS screening. An average coverage of 2.0x was obtained for the whole genome and 5.1x for selected regions. Sequencing time was the main factor improving coverage. The selection of long reads (>1 Kb) did not improve the enrichment. The length of the selected region, which in our study ranged from 126 to 565 Kb, did not play a significant role in enrichment. However, the region containing PMS2 showed significantly lower coverage, which could be explained by the high number of PMS2 pseudogenes (N = 14), which were also enriched. Our study shows new evidence of enrichment obtained by adaptive sampling in a panel of genomic regions and shows parameters, the relevance of sequencing time and the role of pseudogenes, that improve the enrichment yield with no library reloading or GPU use, data useful for a more efficient application of this procedure in future studies.
Project description:MotivationNanopore sequencers allow targeted sequencing of interesting nucleotide sequences by rejecting other sequences from individual pores. This feature facilitates the enrichment of low-abundant sequences by depleting overrepresented ones in-silico. Existing tools for adaptive sampling either apply signal alignment, which cannot handle human-sized reference sequences, or apply read mapping in sequence space relying on fast graphical processing units (GPU) base callers for real-time read rejection. Using nanopore long-read mapping tools is also not optimal when mapping shorter reads as usually analyzed in adaptive sampling applications.ResultsHere, we present a new approach for nanopore adaptive sampling that combines fast CPU and GPU base calling with read classification based on Interleaved Bloom Filters. ReadBouncer improves the potential enrichment of low abundance sequences by its high read classification sensitivity and specificity, outperforming existing tools in the field. It robustly removes even reads belonging to large reference sequences while running on commodity hardware without GPUs, making adaptive sampling accessible for in-field researchers. Readbouncer also provides a user-friendly interface and installer files for end-users without a bioinformatics background.Availability and implementationThe C++ source code is available at https://gitlab.com/dacs-hpi/readbouncer.Supplementary informationSupplementary data are available at Bioinformatics online.
Project description:Pharmacogenomics (PGx) studies the impact of interindividual genomic variation on drug response, allowing the opportunity to tailor the dosing regimen for each patient. Current targeted PGx testing platforms are mainly based on microarray, polymerase chain reaction, or short-read sequencing. Despite demonstrating great value for the identification of single nucleotide variants (SNVs) and insertion/deletions (INDELs), these assays do not permit identification of large structural variants, nor do they allow unambiguous haplotype phasing for star-allele assignment. Here, we used Oxford Nanopore Technologies' adaptive sampling to enrich a panel of 1,036 genes with well-documented PGx relevance extracted from the Pharmacogenomics Knowledge Base (PharmGKB). By evaluating concordance with existing truth sets, we demonstrate accurate variant and star-allele calling for five Genome in a Bottle reference samples. We show that up to three samples can be multiplexed on one PromethION flow cell without a significant drop in variant calling performance, resulting in 99.35% and 99.84% recall and precision for the targeted variants, respectively. This work advances the use of nanopore sequencing in clinical PGx settings.