Genomics

Dataset Information

0

Stable unmethylated DNA demarcates expressed genes and their cis-regulatory space in plant genomes


ABSTRACT: The genomic sequences of diverse varieties of many crop species continue to be produced at a frenetic pace. However, it remains challenging to develop complete annotations of functional genes and regulatory elements in these genomes. Here, we explore the potential to use DNA methylation profiles to develop more complete and refined annotations. Using leaf tissue in maize, we define ~100,000 unmethylated regions (UMRs) that account for 5.8% of the genome; 33,375 UMRs (1.3% of the genome) are found greater than 2 kilobase pairs from genes. UMRs are highly stable in multiple vegetative tissues and they capture the vast majority of accessible chromatin regions from leaf tissue. However, many UMRs are not accessible in leaf (leaf-iUMRs) and these represent a set of genomic regions with potential to become accessible in specific cell types or developmental stages. Leaf-iUMRs often occur near genes that are expressed in other tissues and are enriched for transcription factor (TF) binding sites of TFs that are also not expressed in leaf tissue. The leaf-iUMRs exhibit unique chromatin modification patterns and are enriched for chromatin interactions with nearby genes. The total UMRs space in four additional monocots ranges from 80-120 megabases, which is remarkably similar considering the range in genome size of 271 megabases to 4.8 gigabases. In summary, based on the profile from a single tissue, DNA methylation signatures pinpoint both accessible regions and regions poised to become accessible or expressed in other tissues. Thus, UMRs can provide powerful filters to distill large genomes down to the small fraction putative functional elements and facilitate the discovery of tens of thousands of novel candidate regulatory regions.

ORGANISM(S): Zea mays

PROVIDER: GSE152046 | GEO | 2020/08/22

REPOSITORIES: GEO

Dataset's files

Source:
Action DRS
Other
Items per page:
1 - 1 of 1

Similar Datasets

2021-09-09 | GSE150929 | GEO
2013-08-06 | E-GEOD-48472 | biostudies-arrayexpress
2013-08-06 | GSE48472 | GEO
2021-01-04 | GSE164159 | GEO
2020-02-04 | GSE136348 | GEO
2021-02-22 | GSE159396 | GEO
2020-08-27 | GSE119099 | GEO
2023-02-24 | GSE212509 | GEO
2007-12-19 | E-GEOD-97 | biostudies-arrayexpress
2024-09-02 | MODEL2408160001 | BioModels