Dataset Information

Integrating single-cell transcriptomic data across different conditions, technologies, and species

ABSTRACT: Computational single-cell RNA-seq (scRNA-seq) methods have been successfully applied to experiments representing a single condition, technology, or species to discover and define cellular phenotypes. However, identifying subpopulations of cells that are present across multiple datasets remains challenging. Here, we introduce an analytical strategy for integrating scRNA-seq datasets based on common sources of variation, enabling the identification of shared populations across datasets and downstream comparative analysis. Implemented in our R toolkit Seurat (http://satijalab.org/seurat/), we use our approach to align scRNA-seq datasets of peripheral blood monocytes (PBMCs) under resting and stimulated conditions, hematopoietic progenitors sequenced using two profiling technologies, and pancreatic cell ‘atlases’ generated from human and mouse islets. In each case, we learn distinct or transitional cell states jointly across datasets, while boosting statistical power through integrated analysis. Our approach facilitates general comparisons of scRNA-seq datasets, potentially deepening our understanding of how distinct cell states respond to perturbation, disease, and evolution.

ORGANISM(S): Homo sapiens

PROVIDER: GSE110513 | GEO | 2018/04/02

REPOSITORIES: GEO

ACCESS DATA

Dataset's files

Source:

			Action	DRS
		Other

Items per page:

1 - 1 of 1

Similar Datasets

Project description:The organ development is dictated by the genes preferentially expressed in tissue or cell types uniquely. The clarification of gene expression profile and identification of specific genes in organ provide detailed understanding of organogenesis. Toward this purpose, the genome-wide analysis is a growing powerful tool for understanding development processes in organogenesis. Tooth is composed from enamel, dentin and cementum and these tissues are identical from other organs. The tooth-forming cell types are therefore unique and have complexed organization. The mechanisms of signal induction during tooth development is complicated because of the function of each cell types are still unclear. Previously, we performed the CAGE (Cap Analysis of Gene Expression) using mouse tooth germ to identify the genes preferentially expressed in tooth. The CAGE has advantage in quantification because it counts short reads of 5’ end without the bias of transcript length. Single-cell RNA-sequence (scRNA-seq) is a suitable tool revealing gene expression of each cells. The fundamental question of this study is how we can identify the tooth-specific and cell-type specific genes. In this study, we approach this using combination of scRNA-seq and CAGE (Cap Analysis of Gene Expression) as bioinformatics analyses. We obtained the scRNA-seq datasets of 12,212 cells from postnatal-day (P) 1 mouse molars, and the CAGE-seq datasets from P1 molars. scRNA-seq analysis revealed the spatio-temporal expression of tooth-related genes and CAGE determined whether these genes are truly tooth-specific or are expressed in ubiquitously. Furthermore, we identified candidate genes as novel tissue- and cell type- specific markers. Our results show that the integration of scRNA-seq and CAGE-seq improves the highlight genes important for tooth development from numerous gene profiles. These findings contribute resolving the mechanism of tooth development and establish the basis of tooth regeneration in future.

Project description:Endometriosis is a debilitating gynecological disorder affecting approximately 10% of the female population. Despite its prevalence, robust methods to classify and treat endometriosis remain elusive. Changes throughout the menstrual cycle in tissue size, architecture, cellular composition, and individual cell phenotypes make it extraordinarily challenging to identify markers or cell types associated with uterine pathologies since disease-state alterations in gene and protein expression are convoluted with cycle phase variations. Here, we developed an integrated workflow to generate both proteomic and single-cell RNA-sequencing (scRNA-seq) data sets using tissues and cells isolated from the uteri of control and endometriotic donors. Using a linear mixed effect model (LMM), we identified proteins associated with cycle stage and disease, revealing a set of genes that drive separation across these two biological variables. Further, we analyzed our scRNA-seq data to identify cell types expressing cycle and disease- associated genes identified in our proteomic data. A module scoring approach was used to identify cell types driving the enrichment of certain biological pathways, revealing several pathways of interest across different cell subpopulations. Finally, we identified ligand-receptor pairs including Axl/Tyro3 – Gas6, that may modulate interactions between endometrial macrophages and/or endometrial stromal/epithelial cells. Analysis of these signaling pathways in an independent cohort of endometrial biopsies revealed a significant decrease in Tyro3 expression in patients with endometriosis compared to controls, both transcriptionally and through histological staining. This measured decrease in Tryo3 in patients with disease could serve as a novel diagnostic biomarker or treatment avenue for patients with endometriosis. Taken together, this integrated approach provides a framework for integrating LMMs, proteomic and RNA-seq data to deconvolve the complexities of complex uterine diseases and identify novel genes and pathways underlying endometriosis.

Project description:Pancreatic cancer is a complex disease with a desmoplastic stroma, extreme hypoxia, and inherent resistance to therapy. Understanding the signaling and adaptive response of such an aggressive cancer is key to making advances in therapeutic efficacy and understanding disease progression. Redox factor-1 (Ref-1), a redox signaling protein, regulates the DNA binding activity of several transcription factors, including HIF-1. The conversion of HIF-1 from an oxidized to reduced state leads to enhancement of its DNA binding. In our previously published work, knockdown of Ref-1 under normoxia resulted in altered gene expression patterns on pathways including EIF2, protein kinase A, and mTOR. In this study, single cell RNA sequencing (scRNA-seq) and proteomics were used to explore the effects of Ref-1 on metabolic pathways under hypoxia.Results: We also integrated the scRNA data analysis with the proteomic analysis and found that the differentially expressed genes and pathways identified from the scRNA-seq data are highly consistent to the significant proteins observed in the proteomics data, especially for the upregulated cell cycle and transcription pathways and downregulated metabolic, apoptosis and signaling pathways under hypoxia. Conclusion: The scRNA-seq and proteomics data consistently demonstrated down-regulated central metabolism pathways in APE1/Ref-1 knockdown vs scrambled control under both normoxia and hypoxia conditions. Experimental Methods: scRNA-seq comparing pancreatic cancer cells expressing less than 20% of the Ref-1 protein was analyzed using left truncated mixture Gaussian model. Matched samples were also collected for bulk proteomic analysis of the four conditions. scRNA-seq data was validated using proteomics and qRT-PCR. Ref-1’s role in mitochondrial function was confirmed using mitochondrial function assays and qRT-PCR. Results: We also integrated the scRNA data analysis with the proteomic analysis and found that the differentially expressed genes and pathways identified from the scRNA-seq data are highly consistent to the significant proteins observed in the proteomics data, especially for the upregulated cell cycle and transcription pathways and downregulated metabolic, apoptosis and signaling pathways under hypoxia. Conclusion: The scRNA-seq and proteomics data consistently demonstrated down-regulated central metabolism pathways in APE1/Ref-1 knockdown vs scrambled control under both normoxia and hypoxia conditions.

Dataset Information

Integrating single-cell transcriptomic data across different conditions, technologies, and species

Dataset's files

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets