Unknown

Dataset Information

0

Detection and removal of barcode swapping in single-cell RNA-seq data.


ABSTRACT: Barcode swapping results in the mislabelling of sequencing reads between multiplexed samples on patterned flow-cell Illumina sequencing machines. This may compromise the validity of numerous genomic assays; however, the severity and consequences of barcode swapping remain poorly understood. We have used two statistical approaches to robustly quantify the fraction of swapped reads in two plate-based single-cell RNA-sequencing datasets. We found that approximately 2.5% of reads were mislabelled between samples on the HiSeq 4000, which is lower than previous reports. We observed no correlation between the swapped fraction of reads and the concentration of free barcode across plates. Furthermore, we have demonstrated that barcode swapping may generate complex but artefactual cell libraries in droplet-based single-cell RNA-sequencing studies. To eliminate these artefacts, we have developed an algorithm to exclude individual molecules that have swapped between samples in 10x Genomics experiments, allowing the continued use of cutting-edge sequencing machines for these assays.

SUBMITTER: Griffiths JA 

PROVIDER: S-EPMC6039488 | biostudies-literature | 2018 Jul

REPOSITORIES: biostudies-literature

altmetric image

Publications

Detection and removal of barcode swapping in single-cell RNA-seq data.

Griffiths Jonathan A JA   Richard Arianne C AC   Bach Karsten K   Lun Aaron T L ATL   Marioni John C JC  

Nature communications 20180710 1


Barcode swapping results in the mislabelling of sequencing reads between multiplexed samples on patterned flow-cell Illumina sequencing machines. This may compromise the validity of numerous genomic assays; however, the severity and consequences of barcode swapping remain poorly understood. We have used two statistical approaches to robustly quantify the fraction of swapped reads in two plate-based single-cell RNA-sequencing datasets. We found that approximately 2.5% of reads were mislabelled be  ...[more]

Similar Datasets

2018-06-25 | E-MTAB-6854 | biostudies-arrayexpress
| PRJEB27451 | ENA
| S-EPMC7714356 | biostudies-literature
| S-EPMC10769270 | biostudies-literature
2018-06-21 | E-MTAB-6843 | biostudies-arrayexpress
| S-EPMC7541488 | biostudies-literature
| S-EPMC6501316 | biostudies-literature
| S-EPMC10241145 | biostudies-literature
| S-EPMC8215916 | biostudies-literature
| S-EPMC10409753 | biostudies-literature