Unknown

Dataset Information

0

A Random Matrix Theory Approach to Denoise Single-Cell Data.


ABSTRACT: Single-cell technologies provide the opportunity to identify new cellular states. However, a major obstacle to the identification of biological signals is noise in single-cell data. In addition, single-cell data are very sparse. We propose a new method based on random matrix theory to analyze and denoise single-cell sequencing data. The method uses the universal distributions predicted by random matrix theory for the eigenvalues and eigenvectors of random covariance/Wishart matrices to distinguish noise from signal. In addition, we explain how sparsity can cause spurious eigenvector localization, falsely identifying meaningful directions in the data. We show that roughly 95% of the information in single-cell data is compatible with the predictions of random matrix theory, about 3% is spurious signal induced by sparsity, and only the last 2% reflects true biological signal. We demonstrate the effectiveness of our approach by comparing with alternative techniques in a variety of examples with marked cell populations.

SUBMITTER: Aparicio L 

PROVIDER: S-EPMC7660363 | biostudies-literature | 2020 Jun

REPOSITORIES: biostudies-literature

altmetric image

Publications

A Random Matrix Theory Approach to Denoise Single-Cell Data.

Aparicio Luis L   Bordyuh Mykola M   Blumberg Andrew J AJ   Rabadan Raul R  

Patterns (New York, N.Y.) 20200504 3


Single-cell technologies provide the opportunity to identify new cellular states. However, a major obstacle to the identification of biological signals is noise in single-cell data. In addition, single-cell data are very sparse. We propose a new method based on random matrix theory to analyze and denoise single-cell sequencing data. The method uses the universal distributions predicted by random matrix theory for the eigenvalues and eigenvectors of random covariance/Wishart matrices to distingui  ...[more]

Similar Datasets

| S-EPMC9579216 | biostudies-literature
| S-EPMC7980374 | biostudies-literature
| S-EPMC11304254 | biostudies-literature
| S-EPMC4595991 | biostudies-literature
| S-EPMC5784053 | biostudies-literature
| S-EPMC11274112 | biostudies-literature
| S-EPMC4031172 | biostudies-literature
| S-EPMC5132142 | biostudies-literature
| S-EPMC6022547 | biostudies-literature
| S-EPMC7334504 | biostudies-literature