Unknown

Dataset Information

0

High-throughput functional testing of ENCODE segmentation predictions.


ABSTRACT: The histone modification state of genomic regions is hypothesized to reflect the regulatory activity of the underlying genomic DNA. Based on this hypothesis, the ENCODE Project Consortium measured the status of multiple histone modifications across the genome in several cell types and used these data to segment the genome into regions with different predicted regulatory activities. We measured the cis-regulatory activity of more than 2000 of these predictions in the K562 leukemia cell line. We tested genomic segments predicted to be Enhancers, Weak Enhancers, or Repressed elements in K562 cells, along with other sequences predicted to be Enhancers specific to the H1 human embryonic stem cell line (H1-hESC). Both Enhancer and Weak Enhancer sequences in K562 cells were more active than negative controls, although surprisingly, Weak Enhancer segmentations drove expression higher than did Enhancer segmentations. Lower levels of the covalent histone modifications H3K36me3 and H3K27ac, thought to mark active enhancers and transcribed gene bodies, associate with higher expression and partly explain the higher activity of Weak Enhancers over Enhancer predictions. While DNase I hypersensitivity (HS) is a good predictor of active sequences in our assay, transcription factor (TF) binding models need to be included in order to accurately identify highly expressed sequences. Overall, our results show that a significant fraction (-26%) of the ENCODE enhancer predictions have regulatory activity, suggesting that histone modification states can reflect the cis-regulatory activity of sequences in the genome, but that specific sequence preferences, such as TF-binding sites, are the causal determinants of cis-regulatory activity.

SUBMITTER: Kwasnieski JC 

PROVIDER: S-EPMC4199366 | biostudies-literature | 2014 Oct

REPOSITORIES: biostudies-literature

Similar Datasets

| S-EPMC4620046 | biostudies-literature
2012-05-24 | GSE38163 | GEO
| S-EPMC6324732 | biostudies-literature
2012-05-23 | E-GEOD-38163 | biostudies-arrayexpress
| S-EPMC6186149 | biostudies-literature
| S-EPMC1262724 | biostudies-literature
| S-EPMC8920439 | biostudies-literature
| S-EPMC8323024 | biostudies-literature
| S-EPMC3297080 | biostudies-literature
| S-EPMC4649152 | biostudies-literature