Unknown

Dataset Information

0

Optimal marker gene selection for cell type discrimination in single cell analyses.


ABSTRACT: Single-cell technologies characterize complex cell populations across multiple data modalities at unprecedented scale and resolution. Multi-omic data for single cell gene expression, in situ hybridization, or single cell chromatin states are increasingly available across diverse tissue types. When isolating specific cell types from a sample of disassociated cells or performing in situ sequencing in collections of heterogeneous cells, one challenging task is to select a small set of informative markers that robustly enable the identification and discrimination of specific cell types or cell states as precisely as possible. Given single cell RNA-seq data and a set of cellular labels to discriminate, scGeneFit?selects gene markers that jointly optimize cell label recovery using label-aware compressive classification methods. This results in a substantially more robust and less redundant set of markers than existing methods, most of which identify markers that separate each cell label from the rest. When applied to a data set given a hierarchy of cell types as labels, the markers found by our method improves the recovery of the cell type hierarchy with fewer markers than existing methods using a computationally efficient and principled optimization.

SUBMITTER: Dumitrascu B 

PROVIDER: S-EPMC7895823 | biostudies-literature | 2021 Feb

REPOSITORIES: biostudies-literature

altmetric image

Publications

Optimal marker gene selection for cell type discrimination in single cell analyses.

Dumitrascu Bianca B   Villar Soledad S   Mixon Dustin G DG   Engelhardt Barbara E BE  

Nature communications 20210219 1


Single-cell technologies characterize complex cell populations across multiple data modalities at unprecedented scale and resolution. Multi-omic data for single cell gene expression, in situ hybridization, or single cell chromatin states are increasingly available across diverse tissue types. When isolating specific cell types from a sample of disassociated cells or performing in situ sequencing in collections of heterogeneous cells, one challenging task is to select a small set of informative m  ...[more]

Similar Datasets

| S-EPMC11294843 | biostudies-literature
| S-EPMC7536825 | biostudies-literature
| S-EPMC10895860 | biostudies-literature
| S-EPMC10864304 | biostudies-literature
| S-EPMC6837541 | biostudies-literature
| S-EPMC9010440 | biostudies-literature
| S-EPMC10048047 | biostudies-literature
| S-EPMC6298060 | biostudies-literature
| S-EPMC8494219 | biostudies-literature
| S-EPMC4320824 | biostudies-literature