Unknown

Dataset Information

0

Benchmarking principal component analysis for large-scale single-cell RNA-sequencing.


ABSTRACT: BACKGROUND:Principal component analysis (PCA) is an essential method for analyzing single-cell RNA-seq (scRNA-seq) datasets, but for large-scale scRNA-seq datasets, computation time is long and consumes large amounts of memory. RESULTS:In this work, we review the existing fast and memory-efficient PCA algorithms and implementations and evaluate their practical application to large-scale scRNA-seq datasets. Our benchmark shows that some PCA algorithms based on Krylov subspace and randomized singular value decomposition are fast, memory-efficient, and more accurate than the other algorithms. CONCLUSION:We develop a guideline to select an appropriate PCA implementation based on the differences in the computational environment of users and developers.

SUBMITTER: Tsuyuzaki K 

PROVIDER: S-EPMC6970290 | biostudies-literature | 2020 Jan

REPOSITORIES: biostudies-literature

altmetric image

Publications

Benchmarking principal component analysis for large-scale single-cell RNA-sequencing.

Tsuyuzaki Koki K   Sato Hiroyuki H   Sato Kenta K   Nikaido Itoshi I  

Genome biology 20200120 1


<h4>Background</h4>Principal component analysis (PCA) is an essential method for analyzing single-cell RNA-seq (scRNA-seq) datasets, but for large-scale scRNA-seq datasets, computation time is long and consumes large amounts of memory.<h4>Results</h4>In this work, we review the existing fast and memory-efficient PCA algorithms and implementations and evaluate their practical application to large-scale scRNA-seq datasets. Our benchmark shows that some PCA algorithms based on Krylov subspace and r  ...[more]

Similar Datasets

| S-EPMC7460854 | biostudies-literature
| S-EPMC3981753 | biostudies-literature
| S-EPMC7798298 | biostudies-literature
| S-EPMC7897250 | biostudies-literature
| S-EPMC11245320 | biostudies-literature
| S-EPMC8339294 | biostudies-literature
| S-EPMC8672480 | biostudies-literature
| S-EPMC7437817 | biostudies-literature
| S-EPMC5802054 | biostudies-other
| S-EPMC9746874 | biostudies-literature