Dataset Information

BFF and cellhashR: analysis tools for accurate demultiplexing of cell hashing data.

ABSTRACT:

Motivation

Single-cell sequencing methods provide previously impossible resolution into the transcriptome of individual cells. Cell hashing reduces single-cell sequencing costs by increasing capacity on droplet-based platforms. Cell hashing methods rely on demultiplexing algorithms to accurately classify droplets; however, assumptions underlying these algorithms limit accuracy of demultiplexing, ultimately impacting the quality of single-cell sequencing analyses.

Results

We present Bimodal Flexible Fitting (BFF) demultiplexing algorithms BFFcluster and BFFraw, a novel class of algorithms that rely on the single inviolable assumption that barcode count distributions are bimodal. We integrated these and other algorithms into cellhashR, a new R package that provides integrated QC and a single command to execute and compare multiple demultiplexing algorithms. We demonstrate that BFFcluster demultiplexing is both tunable and insensitive to issues with poorly behaved data that can confound other algorithms. Using two well-characterized reference datasets, we demonstrate that demultiplexing with BFF algorithms is accurate and consistent for both well-behaved and poorly behaved input data.

Availability and implementation

cellhashR is available as an R package at https://github.com/BimberLab/cellhashR. cellhashR version 1.0.3 was used for the analyses in this manuscript and is archived on Zenodo at https://www.doi.org/10.5281/zenodo.6402477.

Supplementary information

Supplementary data are available at Bioinformatics online.

SUBMITTER: Boggy GJ

PROVIDER: S-EPMC9113275 | biostudies-literature | 2022 May

REPOSITORIES: biostudies-literature

ACCESS DATA

Publications

BFF and cellhashR: analysis tools for accurate demultiplexing of cell hashing data.

Boggy Gregory J GJ McElfresh G W GW Mahyari Eisa E Ventura Abigail B AB Hansen Scott G SG Picker Louis J LJ Bimber Benjamin N BN

Bioinformatics (Oxford, England) 20220501 10

<h4>Motivation</h4>Single-cell sequencing methods provide previously impossible resolution into the transcriptome of individual cells. Cell hashing reduces single-cell sequencing costs by increasing capacity on droplet-based platforms. Cell hashing methods rely on demultiplexing algorithms to accurately classify droplets; however, assumptions underlying these algorithms limit accuracy of demultiplexing, ultimately impacting the quality of single-cell sequencing analyses.<h4>Results</h4>We presen ...[more]

PMID: 35561167

Dataset Information

BFF and cellhashR: analysis tools for accurate demultiplexing of cell hashing data.

Motivation

Results

Availability and implementation

Supplementary information

Publications

BFF and cellhashR: analysis tools for accurate demultiplexing of cell hashing data.

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets

Similar Datasets

demuxSNP: supervised demultiplexing single-cell RNA sequencing using cell hashing and SNPs.
| S-EPMC11604057 | biostudies-literature

A hybrid demultiplexing strategy that improves performance and robustness of cell hashing.
| S-EPMC10104010 | biostudies-literature

CONSULT: accurate contamination removal using locality-sensitive hashing.
| S-EPMC8340999 | biostudies-literature

An integrative protocol for one-step PCR amplicon library construction and accurate demultiplexing of pooled sequencing data.
| S-EPMC10689312 | biostudies-literature

CONSULT-II: accurate taxonomic identification and profiling using locality-sensitive hashing.
| S-EPMC10985673 | biostudies-literature

Evaluation of genetic demultiplexing of single-cell sequencing data from model species.
| S-EPMC10192724 | biostudies-literature

demuxmix: Demultiplexing oligonucleotide-barcoded single-cell RNA sequencing data with regression mixture models.
| S-EPMC9901175 | biostudies-literature

demuxmix: demultiplexing oligonucleotide-barcoded single-cell RNA sequencing data with regression mixture models.
| S-EPMC10412409 | biostudies-literature

MitoSort: Robust Demultiplexing of Pooled Single-cell Genomic Data Using Endogenous Mitochondrial Variants.
| S-EPMC11671100 | biostudies-literature

Vireo: Bayesian demultiplexing of pooled single-cell RNA-seq data without genotype reference.
| S-EPMC6909514 | biostudies-literature