Epigenetic and genetic features that lead to discovery of enhancer function
Ontology highlight
ABSTRACT: The ability to measure epigenetic features, such as histone modifications and occupancy by transcription factors and co-activators, on a genome-wide scale is advancing the accuracy of CRM predictions. While integration of signals from multiple features is expected to improve predictions, the contribution of each feature to prediction accuracy is not known. We began with predictions of 4,915 erythroid enhancers based on genomic occupancy by TAL1, a key hematopoietic transcription factor that is strongly associated with gene induction in erythroid cells. Seventy of these DNA segments occupied by TAL1 (TAL1 OSs) were tested by transient transfections of cultured hematopoietic cells, and 56% of these were active as enhancers. Sixty-six TAL1 OSs were evaluated in transgenic mouse embryos, and 65% of these were active enhancers in various tissues. Inclusion of additional epigenetic features improved the prediction accuracy, with combinations of TAL1, GATA1, EP300, H3K4me1, and H3K27ac giving high accuracy of enhancer prediction (70%-75% success depending on method of clustering) while maintaining good sensitivity and specificity. Motifs that distinguish active from inactive TAL1 OSs implicate IRFs, STATs, and FOX protein families as candidate positive co-factors with TAL1, while REST (NRSF) and HOX family proteins are implicated in inactivity. While signals for evolutionary constraint were weak over the entire TAL1-bound DNA segments regardless of activity in either assay, phylogenetic preservation of a TF-binding site motif was associated with enhancer activity. The contribution of 8 epigenetic features including H3K27ac to identification of enhancers in 24h-induced G1E-ER4 cells.
ORGANISM(S): Mus musculus
SUBMITTER: Ross Hardison
PROVIDER: E-GEOD-61349 | biostudies-arrayexpress |
REPOSITORIES: biostudies-arrayexpress
ACCESS DATA