Unknown,Transcriptomics,Genomics,Proteomics

Dataset Information

0

Identification of building principles of methylation states at CG rich regions by high-throughput editing of a mammalian genome


ABSTRACT: Methylation is a repressive modification of DNA prevalent throughout mammalian genomes yet mostly absent at CG rich stretches referred to as CGI. Here we identify their building principles by parallel genomic targeting of sequence libraries. Iterative insertions generated over 3,000 variants of genome-derived and artificial sequences at the same genomic site. Single molecule profiling of the methylation status of this collection allowed modeling the contribution of CG content and DNA binding factors towards the unmethylated state. It made the surprising prediction that the majority of CGs within endogenous islands are susceptible to methylation changes modulated by the presence of transcription factors, which is indeed confirmed by genome-wide methylation dynamics during multiple cellular differentiations. Our model further predicts blocks of constitutively unmethylated CGs independent from TF binding, which have a median size of ~300bp but are only present in half of all islands. Their constitutively unmethylated state is a hallmark of untransformed cells but their increased methylation is a specific and predictive feature of cancer. This study quantifies the two principal mechanisms governing methylation patterns in mammalian genomes. It provides a framework to interpret methylation data across normal and cancer samples and refines the concept of CpG islands. Methylation is a repressive modification of DNA prevalent throughout mammalian genomes yet mostly absent at CG rich stretches referred to as CGI. Here we identify their building principles by parallel genomic targeting of sequence libraries. Iterative insertions generated over 3,000 variants of genome-derived and artificial sequences at the same genomic site. Single molecule profiling of the methylation status of this collection allowed modeling the contribution of CG content and DNA binding factors towards the unmethylated state. It made the surprising prediction that the majority of CGs within endogenous islands are susceptible to methylation changes modulated by the presence of transcription factors, which is indeed confirmed by genome-wide methylation dynamics during multiple cellular differentiations. Our model further predicts blocks of constitutively unmethylated CGs independent from TF binding, which have a median size of ~300bp but are only present in half of all islands. Their constitutively unmethylated state is a hallmark of untransformed cells but their increased methylation is a specific and predictive feature of cancer. This study quantifies the two principal mechanisms governing methylation patterns in mammalian genomes. It provides a framework to interpret methylation data across normal and cancer samples and refines the concept of CpG islands. Libraries of DNA sequences were constructed either by mouse genome (129S6) or E.coli genome (NC_010473.1) subrepresentation or custom synthesis. DNA fragments were inserted into the genome of mouse embryonic stem cells by recombination mediated casette exchange (RMCE) at the B-globin locus. Methylation status of the inserted DNA sequences was profiled by bisulfite sequencing using a pair of universal primers flanking the fragments.

ORGANISM(S): Mus musculus

SUBMITTER: Dirk Schuebeler 

PROVIDER: E-GEOD-51170 | biostudies-arrayexpress |

REPOSITORIES: biostudies-arrayexpress

altmetric image

Publications

High-throughput engineering of a mammalian genome reveals building principles of methylation states at CG rich regions.

Krebs Arnaud R AR   Dessus-Babus Sophie S   Burger Lukas L   Schübeler Dirk D  

eLife 20140926


The majority of mammalian promoters are CpG islands; regions of high CG density that require protection from DNA methylation to be functional. Importantly, how sequence architecture mediates this unmethylated state remains unclear. To address this question in a comprehensive manner, we developed a method to interrogate methylation states of hundreds of sequence variants inserted at the same genomic site in mouse embryonic stem cells. Using this assay, we were able to quantify the contribution of  ...[more]

Similar Datasets

2014-09-30 | GSE51170 | GEO
2017-02-14 | GSE86323 | GEO
2017-02-14 | GSE76346 | GEO
2020-12-30 | GSE142053 | GEO
2020-12-30 | GSE142052 | GEO
2020-12-30 | GSE142051 | GEO
2012-02-15 | GSE30199 | GEO
2012-02-15 | GSE33468 | GEO
2012-02-15 | GSE33579 | GEO
2014-03-13 | E-GEOD-46692 | biostudies-arrayexpress