Unknown

Dataset Information

0

IKAP-Identifying K mAjor cell Population groups in single-cell RNA-sequencing analysis.


ABSTRACT: BACKGROUND:In single-cell RNA-sequencing analysis, clustering cells into groups and differentiating cell groups by differentially expressed (DE) genes are 2 separate steps for investigating cell identity. However, the ability to differentiate between cell groups could be affected by clustering. This interdependency often creates a bottleneck in the analysis pipeline, requiring researchers to repeat these 2 steps multiple times by setting different clustering parameters to identify a set of cell groups that are more differentiated and biologically relevant. FINDINGS:To accelerate this process, we have developed IKAP-an algorithm to identify major cell groups and improve differentiating cell groups by systematically tuning parameters for clustering. We demonstrate that, with default parameters, IKAP successfully identifies major cell types such as T cells, B cells, natural killer cells, and monocytes in 2 peripheral blood mononuclear cell datasets and recovers major cell types in a previously published mouse cortex dataset. These major cell groups identified by IKAP present more distinguishing DE genes compared with cell groups generated by different combinations of clustering parameters. We further show that cell subtypes can be identified by recursively applying IKAP within identified major cell types, thereby delineating cell identities in a multi-layered ontology. CONCLUSIONS:By tuning the clustering parameters to identify major cell groups, IKAP greatly improves the automation of single-cell RNA-sequencing analysis to produce distinguishing DE genes and refine cell ontology using single-cell RNA-sequencing data.

SUBMITTER: Chen YC 

PROVIDER: S-EPMC6771546 | biostudies-literature | 2019 Oct

REPOSITORIES: biostudies-literature

altmetric image

Publications

IKAP-Identifying K mAjor cell Population groups in single-cell RNA-sequencing analysis.

Chen Yun-Ching YC   Suresh Abhilash A   Underbayev Chingiz C   Sun Clare C   Singh Komudi K   Seifuddin Fayaz F   Wiestner Adrian A   Pirooznia Mehdi M  

GigaScience 20191001 10


<h4>Background</h4>In single-cell RNA-sequencing analysis, clustering cells into groups and differentiating cell groups by differentially expressed (DE) genes are 2 separate steps for investigating cell identity. However, the ability to differentiate between cell groups could be affected by clustering. This interdependency often creates a bottleneck in the analysis pipeline, requiring researchers to repeat these 2 steps multiple times by setting different clustering parameters to identify a set  ...[more]

Similar Datasets

| S-EPMC5037372 | biostudies-literature
| S-EPMC8672480 | biostudies-literature
| S-EPMC8187165 | biostudies-literature
| S-EPMC10463720 | biostudies-literature
| S-EPMC5376499 | biostudies-literature
| S-EPMC7397487 | biostudies-literature
| S-EPMC9871437 | biostudies-literature
2022-07-14 | E-MTAB-11745 | biostudies-arrayexpress
| S-EPMC9910198 | biostudies-literature
| S-EPMC10658177 | biostudies-literature