Unknown

Dataset Information

0

BFF and cellhashR: analysis tools for accurate demultiplexing of cell hashing data.


ABSTRACT:

Motivation

Single-cell sequencing methods provide previously impossible resolution into the transcriptome of individual cells. Cell hashing reduces single-cell sequencing costs by increasing capacity on droplet-based platforms. Cell hashing methods rely on demultiplexing algorithms to accurately classify droplets; however, assumptions underlying these algorithms limit accuracy of demultiplexing, ultimately impacting the quality of single-cell sequencing analyses.

Results

We present Bimodal Flexible Fitting (BFF) demultiplexing algorithms BFFcluster and BFFraw, a novel class of algorithms that rely on the single inviolable assumption that barcode count distributions are bimodal. We integrated these and other algorithms into cellhashR, a new R package that provides integrated QC and a single command to execute and compare multiple demultiplexing algorithms. We demonstrate that BFFcluster demultiplexing is both tunable and insensitive to issues with poorly behaved data that can confound other algorithms. Using two well-characterized reference datasets, we demonstrate that demultiplexing with BFF algorithms is accurate and consistent for both well-behaved and poorly behaved input data.

Availability and implementation

cellhashR is available as an R package at https://github.com/BimberLab/cellhashR. cellhashR version 1.0.3 was used for the analyses in this manuscript and is archived on Zenodo at https://www.doi.org/10.5281/zenodo.6402477.

Supplementary information

Supplementary data are available at Bioinformatics online.

SUBMITTER: Boggy GJ 

PROVIDER: S-EPMC9113275 | biostudies-literature | 2022 May

REPOSITORIES: biostudies-literature

altmetric image

Publications

BFF and cellhashR: analysis tools for accurate demultiplexing of cell hashing data.

Boggy Gregory J GJ   McElfresh G W GW   Mahyari Eisa E   Ventura Abigail B AB   Hansen Scott G SG   Picker Louis J LJ   Bimber Benjamin N BN  

Bioinformatics (Oxford, England) 20220501 10


<h4>Motivation</h4>Single-cell sequencing methods provide previously impossible resolution into the transcriptome of individual cells. Cell hashing reduces single-cell sequencing costs by increasing capacity on droplet-based platforms. Cell hashing methods rely on demultiplexing algorithms to accurately classify droplets; however, assumptions underlying these algorithms limit accuracy of demultiplexing, ultimately impacting the quality of single-cell sequencing analyses.<h4>Results</h4>We presen  ...[more]

Similar Datasets

| S-EPMC11604057 | biostudies-literature
| S-EPMC10104010 | biostudies-literature
| S-EPMC8340999 | biostudies-literature
| S-EPMC10689312 | biostudies-literature
| S-EPMC10985673 | biostudies-literature
| S-EPMC10192724 | biostudies-literature
| S-EPMC9901175 | biostudies-literature
| S-EPMC10412409 | biostudies-literature
| S-EPMC11671100 | biostudies-literature
| S-EPMC6909514 | biostudies-literature