Unknown

Dataset Information

0

DoubletD: detecting doublets in single-cell DNA sequencing data.


ABSTRACT:

Motivation

While single-cell DNA sequencing (scDNA-seq) has enabled the study of intratumor heterogeneity at an unprecedented resolution, current technologies are error-prone and often result in doublets where two or more cells are mistaken for a single cell. Not only do doublets confound downstream analyses, but the increase in doublet rate is also a major bottleneck preventing higher throughput with current single-cell technologies. Although doublet detection and removal are standard practice in scRNA-seq data analysis, options for scDNA-seq data are limited. Current methods attempt to detect doublets while also performing complex downstream analyses tasks, leading to decreased efficiency and/or performance.

Results

We present doubletD, the first standalone method for detecting doublets in scDNA-seq data. Underlying our method is a simple maximum likelihood approach with a closed-form solution. We demonstrate the performance of doubletD on simulated data as well as real datasets, outperforming current methods for downstream analysis of scDNA-seq data that jointly infer doublets as well as standalone approaches for doublet detection in scRNA-seq data. Incorporating doubletD in scDNA-seq analysis pipelines will reduce complexity and lead to more accurate results.

Availability and implementation

https://github.com/elkebir-group/doubletD.

Supplementary information

Supplementary data are available at Bioinformatics online.

SUBMITTER: Weber LL 

PROVIDER: S-EPMC8275324 | biostudies-literature | 2021 Jul

REPOSITORIES: biostudies-literature

altmetric image

Publications

doubletD: detecting doublets in single-cell DNA sequencing data.

Weber Leah L LL   Sashittal Palash P   El-Kebir Mohammed M  

Bioinformatics (Oxford, England) 20210701 Suppl_1


<h4>Motivation</h4>While single-cell DNA sequencing (scDNA-seq) has enabled the study of intratumor heterogeneity at an unprecedented resolution, current technologies are error-prone and often result in doublets where two or more cells are mistaken for a single cell. Not only do doublets confound downstream analyses, but the increase in doublet rate is also a major bottleneck preventing higher throughput with current single-cell technologies. Although doublet detection and removal are standard p  ...[more]

Similar Datasets

| S-EPMC6983270 | biostudies-literature
| S-EPMC9805559 | biostudies-literature
| S-EPMC7703774 | biostudies-literature
| S-EPMC9151659 | biostudies-literature
| S-EPMC6625319 | biostudies-literature
| S-EPMC11840951 | biostudies-literature
| S-EPMC6022575 | biostudies-literature
| S-EPMC8198695 | biostudies-literature
| S-EPMC6900933 | biostudies-literature
| S-EPMC9092674 | biostudies-literature