Unknown

Dataset Information

0

A benchmark of batch-effect correction methods for single-cell RNA sequencing data.


ABSTRACT: BACKGROUND:Large-scale single-cell transcriptomic datasets generated using different technologies contain batch-specific systematic variations that present a challenge to batch-effect removal and data integration. With continued growth expected in scRNA-seq data, achieving effective batch integration with available computational resources is crucial. Here, we perform an in-depth benchmark study on available batch correction methods to determine the most suitable method for batch-effect removal. RESULTS:We compare 14 methods in terms of computational runtime, the ability to handle large datasets, and batch-effect correction efficacy while preserving cell type purity. Five scenarios are designed for the study: identical cell types with different technologies, non-identical cell types, multiple batches, big data, and simulated data. Performance is evaluated using four benchmarking metrics including kBET, LISI, ASW, and ARI. We also investigate the use of batch-corrected data to study differential gene expression. CONCLUSION:Based on our results, Harmony, LIGER, and Seurat 3 are the recommended methods for batch integration. Due to its significantly shorter runtime, Harmony is recommended as the first method to try, with the other methods as viable alternatives.

SUBMITTER: Tran HTN 

PROVIDER: S-EPMC6964114 | biostudies-literature | 2020 Jan

REPOSITORIES: biostudies-literature

altmetric image

Publications

A benchmark of batch-effect correction methods for single-cell RNA sequencing data.

Tran Hoa Thi Nhu HTN   Ang Kok Siong KS   Chevrier Marion M   Zhang Xiaomeng X   Lee Nicole Yee Shin NYS   Goh Michelle M   Chen Jinmiao J  

Genome biology 20200116 1


<h4>Background</h4>Large-scale single-cell transcriptomic datasets generated using different technologies contain batch-specific systematic variations that present a challenge to batch-effect removal and data integration. With continued growth expected in scRNA-seq data, achieving effective batch integration with available computational resources is crucial. Here, we perform an in-depth benchmark study on available batch correction methods to determine the most suitable method for batch-effect r  ...[more]

Similar Datasets

| S-EPMC8617278 | biostudies-literature
| S-EPMC10781938 | biostudies-literature
| S-EPMC8324985 | biostudies-literature
| S-EPMC8053088 | biostudies-literature
| S-EPMC8579191 | biostudies-literature
| S-EPMC6918801 | biostudies-literature
| S-EPMC7330047 | biostudies-literature
| S-EPMC9848058 | biostudies-literature
| S-EPMC8720898 | biostudies-literature
| S-EPMC7444317 | biostudies-literature