Stable unmethylated DNA demarcates expressed genes and their cis-regulatory space in plant genomes
Ontology highlight
ABSTRACT: The genomic sequences of diverse varieties of many crop species continue to be produced at a frenetic pace. However, it remains challenging to develop complete annotations of functional genes and regulatory elements in these genomes. Here, we explore the potential to use DNA methylation profiles to develop more complete and refined annotations. Using leaf tissue in maize, we define ~100,000 unmethylated regions (UMRs) that account for 5.8% of the genome; 33,375 UMRs (1.3% of the genome) are found greater than 2 kilobase pairs from genes. UMRs are highly stable in multiple vegetative tissues and they capture the vast majority of accessible chromatin regions from leaf tissue. However, many UMRs are not accessible in leaf (leaf-iUMRs) and these represent a set of genomic regions with potential to become accessible in specific cell types or developmental stages. Leaf-iUMRs often occur near genes that are expressed in other tissues and are enriched for transcription factor (TF) binding sites of TFs that are also not expressed in leaf tissue. The leaf-iUMRs exhibit unique chromatin modification patterns and are enriched for chromatin interactions with nearby genes. The total UMRs space in four additional monocots ranges from 80-120 megabases, which is remarkably similar considering the range in genome size of 271 megabases to 4.8 gigabases. In summary, based on the profile from a single tissue, DNA methylation signatures pinpoint both accessible regions and regions poised to become accessible or expressed in other tissues. Thus, UMRs can provide powerful filters to distill large genomes down to the small fraction putative functional elements and facilitate the discovery of tens of thousands of novel candidate regulatory regions.
ORGANISM(S): Zea mays
PROVIDER: GSE150929 | GEO | 2021/09/09
REPOSITORIES: GEO
ACCESS DATA