Unknown

Dataset Information

0

Large-scale discovery of promoter motifs in Drosophila melanogaster.


ABSTRACT: A key step in understanding gene regulation is to identify the repertoire of transcription factor binding motifs (TFBMs) that form the building blocks of promoters and other regulatory elements. Identifying these experimentally is very laborious, and the number of TFBMs discovered remains relatively small, especially when compared with the hundreds of transcription factor genes predicted in metazoan genomes. We have used a recently developed statistical motif discovery approach, NestedMICA, to detect candidate TFBMs from a large set of Drosophila melanogaster promoter regions. Of the 120 motifs inferred in our initial analysis, 25 were statistically significant matches to previously reported motifs, while 87 appeared to be novel. Analysis of sequence conservation and motif positioning suggested that the great majority of these discovered motifs are predictive of functional elements in the genome. Many motifs showed associations with specific patterns of gene expression in the D. melanogaster embryo, and we were able to obtain confident annotation of expression patterns for 25 of our motifs, including eight of the novel motifs. The motifs are available through Tiffin, a new database of DNA sequence motifs. We have discovered many new motifs that are overrepresented in D. melanogaster promoter regions, and offer several independent lines of evidence that these are novel TFBMs. Our motif dictionary provides a solid foundation for further investigation of regulatory elements in Drosophila, and demonstrates techniques that should be applicable in other species. We suggest that further improvements in computational motif discovery should narrow the gap between the set of known motifs and the total number of transcription factors in metazoan genomes.

SUBMITTER: Down TA 

PROVIDER: S-EPMC1779301 | biostudies-literature |

REPOSITORIES: biostudies-literature

Similar Datasets

| S-EPMC8258673 | biostudies-literature
| S-EPMC4199363 | biostudies-literature
| S-EPMC3012054 | biostudies-literature
| S-EPMC2668186 | biostudies-literature
| S-EPMC4562235 | biostudies-literature
| S-EPMC4286233 | biostudies-literature
| S-EPMC3032922 | biostudies-literature
| S-EPMC8605023 | biostudies-literature
| S-EPMC3597034 | biostudies-literature
| S-EPMC4383921 | biostudies-literature