Genomic

Dataset Information

0

NextGen/GENESiPS


ABSTRACT:

Variability in induced pluripotent stem cell (iPSC) lines remains a roadblock for disease modeling and regenerative medicine. Through linear mixed models we have described different sources of gene expression variability from RNA sequencing data in 317 human iPSC lines from 101 individuals. We found that ~50% of genome-wide expression variability is explained by variation across individuals and identified a set of expression quantitative trait loci that contribute to this variation. These analyses coupled with allele specific expression show that iPSCs retain a subject-specific gene expression pattern. Pathway enrichment and key driver analyses, based on predictive causal gene networks, found that Polycomb targets explain a significant part of the non-genetic variability present in iPSCs within and across individuals. These publically available iPSC lines and genetic datasets will be a resource to the scientific community and will open new avenues to reduce variability in iPSCs and improve their utility in disease modeling.

SNP array data from individuals included in RNA-seq transcriptome profiling study of human induced pluripotent stem cells to characterize gene expression variation across individuals and within multiple iPSC lines from the same individual. Genotyping was performed on patient blood.

Data availability:
  • SNP-genotyping: dbGaP - current study
  • RNA-seq counts: GEO - GSE79636
  • FASTQ files: SRA - SRP072417

PROVIDER: phs001139 | dbGaP |

SECONDARY ACCESSION(S): PRJNA324736PRJNA324735

REPOSITORIES: dbGaP

Dataset's files

Source:

Similar Datasets

2013-07-26 | E-GEOD-49231 | biostudies-arrayexpress
2013-07-26 | GSE49231 | GEO
2016-12-28 | GSE79636 | GEO
| phs001341 | dbGaP
2022-07-19 | GSE193571 | GEO
2014-12-17 | E-GEOD-64263 | biostudies-arrayexpress
2015-08-03 | GSE69868 | GEO
2023-10-03 | GSE182758 | GEO
2014-01-23 | E-GEOD-52431 | biostudies-arrayexpress
2015-11-19 | E-GEOD-71878 | biostudies-arrayexpress