Unknown

Dataset Information

0

Quality filtering of Illumina index reads mitigates sample cross-talk.


ABSTRACT: Multiplexing multiple samples during Illumina sequencing is a common practice and is rapidly growing in importance as the throughput of the platform increases. Misassignments during de-multiplexing, where sequences are associated with the wrong sample, are an overlooked error mode on the Illumina sequencing platform. This results in a low rate of cross-talk among multiplexed samples and can cause detrimental effects in studies requiring the detection of rare variants or when multiplexing a large number of samples.We observed rates of cross-talk averaging 0.24 % when multiplexing 14 different samples with unique i5 and i7 index sequences. This cross-talk rate corresponded to 254,632 misassigned reads on a single lane of the Illumina HiSeq 2500. Notably, all types of misassignment occur at similar rates: incorrect i5, incorrect i7, and incorrect sequence reads. We demonstrate that misassignments can be nearly eliminated by quality filtering of index reads while preserving about 90 % of the original sequences.Cross-talk among multiplexed samples is a significant error mode on the Illumina platform, especially if samples are only separated by a single unique index. Quality filtering of index sequences offers an effective solution to minimizing cross-talk among samples. Furthermore, we propose a straightforward method for verifying the extent of cross-talk between samples and optimizing quality score thresholds that does not require additional control samples and can even be performed post hoc on previous runs.

SUBMITTER: Wright ES 

PROVIDER: S-EPMC5097354 | biostudies-literature | 2016 Nov

REPOSITORIES: biostudies-literature

altmetric image

Publications

Quality filtering of Illumina index reads mitigates sample cross-talk.

Wright Erik Scott ES   Vetsigian Kalin Horen KH  

BMC genomics 20161104 1


<h4>Background</h4>Multiplexing multiple samples during Illumina sequencing is a common practice and is rapidly growing in importance as the throughput of the platform increases. Misassignments during de-multiplexing, where sequences are associated with the wrong sample, are an overlooked error mode on the Illumina sequencing platform. This results in a low rate of cross-talk among multiplexed samples and can cause detrimental effects in studies requiring the detection of rare variants or when m  ...[more]

Similar Datasets

| S-EPMC3684618 | biostudies-literature
| S-EPMC3531572 | biostudies-literature
| S-EPMC10320065 | biostudies-literature
| S-EPMC11222498 | biostudies-literature
| S-EPMC4471408 | biostudies-literature
| S-EPMC3605598 | biostudies-literature
| S-EPMC4191382 | biostudies-literature
| S-EPMC7071698 | biostudies-literature
| S-EPMC6858638 | biostudies-literature
| S-EPMC3822393 | biostudies-literature