Unknown

Dataset Information

0

Domain adaptation for supervised integration of scRNA-seq data.


ABSTRACT: Large-scale scRNA-seq studies typically generate data in batches, which often induce nontrivial batch effects that need to be corrected. Given the global efforts for building cell atlases and the increasing number of annotated scRNA-seq datasets accumulated, we propose a supervised strategy for scRNA-seq data integration called SIDA (Supervised Integration using Domain Adaptation), which uses the cell type annotations to guide the integration of diverse batches. The supervised strategy is based on domain adaptation that was initially proposed in the computer vision field. We demonstrate that SIDA is able to generate comprehensive reference datasets that lead to improved accuracy in automated cell type mapping analyses.

SUBMITTER: Sun Y 

PROVIDER: S-EPMC10020569 | biostudies-literature | 2023 Mar

REPOSITORIES: biostudies-literature

altmetric image

Publications

Domain adaptation for supervised integration of scRNA-seq data.

Sun Yutong Y   Qiu Peng P  

Communications biology 20230316 1


Large-scale scRNA-seq studies typically generate data in batches, which often induce nontrivial batch effects that need to be corrected. Given the global efforts for building cell atlases and the increasing number of annotated scRNA-seq datasets accumulated, we propose a supervised strategy for scRNA-seq data integration called SIDA (Supervised Integration using Domain Adaptation), which uses the cell type annotations to guide the integration of diverse batches. The supervised strategy is based  ...[more]

Similar Datasets

| S-EPMC8157426 | biostudies-literature
| S-EPMC10516353 | biostudies-literature
| S-EPMC8344557 | biostudies-literature
| S-EPMC6693154 | biostudies-literature
| S-EPMC7141853 | biostudies-literature
| S-EPMC9437856 | biostudies-literature
| S-EPMC5888655 | biostudies-literature
| S-EPMC6547431 | biostudies-literature
| S-EPMC7647120 | biostudies-literature
| S-BSST858 | biostudies-other