Unknown

Dataset Information

0

Flanking sequence context-dependent transcription factor binding in early Drosophila development.


ABSTRACT:

Background

Gene expression in the Drosophila embryo is controlled by functional interactions between a large network of protein transcription factors (TFs) and specific sequences in DNA cis-regulatory modules (CRMs). The binding site sequences for any TF can be experimentally determined and represented in a position weight matrix (PWM). PWMs can then be used to predict the location of TF binding sites in other regions of the genome, although there are limitations to this approach as currently implemented.

Results

In this proof-of-principle study, we analyze 127 CRMs and focus on four TFs that control transcription of target genes along the anterio-posterior axis of the embryo early in development. For all four of these TFs, there is some degree of conserved flanking sequence that extends beyond the predicted binding regions. A potential role for these conserved flanking sequences may be to enhance the specificity of TF binding, as the abundance of these sequences is greatly diminished when we examine only predicted high-affinity binding sites.

Conclusions

Expanding PWMs to include sequence context-dependence will increase the information content in PWMs and facilitate a more efficient functional identification and dissection of CRMs.

SUBMITTER: Stringham JL 

PROVIDER: S-EPMC3851692 | biostudies-literature | 2013 Oct

REPOSITORIES: biostudies-literature

altmetric image

Publications

Flanking sequence context-dependent transcription factor binding in early Drosophila development.

Stringham Jessica L JL   Brown Adam S AS   Drewell Robert A RA   Dresch Jacqueline M JM  

BMC bioinformatics 20131004


<h4>Background</h4>Gene expression in the Drosophila embryo is controlled by functional interactions between a large network of protein transcription factors (TFs) and specific sequences in DNA cis-regulatory modules (CRMs). The binding site sequences for any TF can be experimentally determined and represented in a position weight matrix (PWM). PWMs can then be used to predict the location of TF binding sites in other regions of the genome, although there are limitations to this approach as curr  ...[more]

Similar Datasets

2023-12-15 | GSE249797 | GEO
| S-EPMC8009085 | biostudies-literature
| S-EPMC151383 | biostudies-literature
| S-EPMC3967049 | biostudies-literature
2023-03-05 | GSE225867 | GEO
2024-07-31 | GSE212052 | GEO