Unknown

Dataset Information

0

SASI-Seq: sample assurance Spike-Ins, and highly differentiating 384 barcoding for Illumina sequencing.


ABSTRACT: A minor but significant fraction of samples subjected to next-generation sequencing methods are either mixed-up or cross-contaminated. These events can lead to false or inconclusive results. We have therefore developed SASI-Seq; a process whereby a set of uniquely barcoded DNA fragments are added to samples destined for sequencing. From the final sequencing data, one can verify that all the reads derive from the original sample(s) and not from contaminants or other samples.By adding a mixture of three uniquely barcoded amplicons, of different sizes spanning the range of insert sizes one would normally use for Illumina sequencing, at a spike-in level of approximately 0.1%, we demonstrate that these fragments remain intimately associated with the sample. They can be detected following even the tightest size selection regimes or exome enrichment and can report the occurrence of sample mix-ups and cross-contamination.As a consequence of this work, we have designed a set of 384 eleven-base Illumina barcode sequences that are at least 5 changes apart from each other, allowing for single-error correction and very low levels of barcode misallocation due to sequencing error.SASI-Seq is a simple, inexpensive and flexible tool that enables sample assurance, allows deconvolution of sample mix-ups and reports levels of cross-contamination between samples throughout NGS workflows.

SUBMITTER: Quail MA 

PROVIDER: S-EPMC4008303 | biostudies-literature | 2014 Feb

REPOSITORIES: biostudies-literature

altmetric image

Publications

SASI-Seq: sample assurance Spike-Ins, and highly differentiating 384 barcoding for Illumina sequencing.

Quail Michael A MA   Smith Miriam M   Jackson David D   Leonard Steven S   Skelly Thomas T   Swerdlow Harold P HP   Gu Yong Y   Ellis Peter P  

BMC genomics 20140207


<h4>Background</h4>A minor but significant fraction of samples subjected to next-generation sequencing methods are either mixed-up or cross-contaminated. These events can lead to false or inconclusive results. We have therefore developed SASI-Seq; a process whereby a set of uniquely barcoded DNA fragments are added to samples destined for sequencing. From the final sequencing data, one can verify that all the reads derive from the original sample(s) and not from contaminants or other samples.<h4  ...[more]

Similar Datasets

| S-EPMC8923058 | biostudies-literature
| S-EPMC6428340 | biostudies-literature
| S-EPMC5807891 | biostudies-literature
| S-EPMC9942688 | biostudies-literature
| S-EPMC10048438 | biostudies-literature
2010-04-18 | E-GEOD-20580 | biostudies-arrayexpress
| S-EPMC9498777 | biostudies-literature
2011-06-01 | GSE29505 | GEO
| S-EPMC6281648 | biostudies-literature
| PRJNA490867 | ENA