Unknown

Dataset Information

0

Characterization and mitigation of fragmentation enzyme-induced dual stranded artifacts.


ABSTRACT: High-throughput short-read sequencing relies on fragmented DNA for optimal sampling of input nucleic acid. Several vendors now offer proprietary enzyme cocktails as a cheaper and more streamlined method of fragmentation when compared to acoustic shearing. We have discovered that these enzymes induce the formation of library molecules containing regions of nearby DNA from opposite strands. Sequencing reads derived from these molecules can lead to artifact-derived variant calls appearing at variant allele frequencies <5%. We present Fragmentation Artifact Detection and Elimination (FADE), software to remove these artifacts from mapped reads and mitigate artifact-related effects on downstream analysis. We find that the artifacts principally affect downstream analyses that are sensitive to a 1-3% artifact bias in the sequencing reads, such as targeted resequencing and rare variant discovery.

SUBMITTER: Gregory T 

PROVIDER: S-EPMC7531576 | biostudies-literature | 2020 Dec

REPOSITORIES: biostudies-literature

altmetric image

Publications

Characterization and mitigation of fragmentation enzyme-induced dual stranded artifacts.

Gregory Thomas T   Ngankeu Apollinaire A   Orwick Shelley S   Kautto Esko A EA   Woyach Jennifer A JA   Byrd John C JC   Blachly James S JS  

NAR genomics and bioinformatics 20201002 4


High-throughput short-read sequencing relies on fragmented DNA for optimal sampling of input nucleic acid. Several vendors now offer proprietary enzyme cocktails as a cheaper and more streamlined method of fragmentation when compared to acoustic shearing. We have discovered that these enzymes induce the formation of library molecules containing regions of nearby DNA from opposite strands. Sequencing reads derived from these molecules can lead to artifact-derived variant calls appearing at varian  ...[more]

Similar Datasets

| S-EPMC10908179 | biostudies-literature
| S-EPMC6941819 | biostudies-literature
| S-EPMC3322347 | biostudies-literature
2019-01-05 | GSE124686 | GEO
| S-EPMC5562503 | biostudies-other
| S-EPMC6820691 | biostudies-literature
| S-EPMC8727365 | biostudies-literature
| S-EPMC5059735 | biostudies-literature
| S-EPMC4574305 | biostudies-literature
| S-EPMC3507990 | biostudies-literature