Unknown,Transcriptomics,Genomics,Proteomics

Dataset Information

0

Determination and Inference of Eukaryotic Transcription Factor Sequence Specificity


ABSTRACT: The DNA sequence preferences of the vast majority of eukaryotic transcription factors (TFs) are unknown. Using an approach designed to broadly sample both DNA-binding domain types and eukaryotic clades, we have determined DNA-binding motifs for 1,033 TFs from 131 diverse eukaryotes, encompassing 54 domain types. Closely related orthologs and paralogs typically have very similar sequence preferences; this property allows inference of motifs for roughly one third of the 166,851 known or predicted eukaryotic TFs. While the origins of most motifs can be dated to hundreds of millions of years ago, we also characterize more recent TF expansions. Sequences matching the motifs are enriched upstream of TSS in most eukaryotic lineages, and at informative eQTL SNPs in Arabidopsis promoters, demonstrating their utility in mapping transcriptional networks. The motifs are housed at http://cisbp.ccbr.utoronto.ca Protein binding microarray (PBM) experiments were performed for a set of 1048 diverse eukaryotic transcription factors. Briefly, the PBMs involved binding GST-tagged DNA-binding proteins to two double-stranded 44K Agilent microarrays, each containing a different DeBruijn sequence design, in order to determine their sequence preferences. Details of the PBM protocol are described in Berger et al., Nature Biotechnology 2006.

ORGANISM(S): synthetic construct

SUBMITTER: Matthew Weirauch 

PROVIDER: E-GEOD-53348 | biostudies-arrayexpress |

REPOSITORIES: biostudies-arrayexpress

Similar Datasets

2012-12-13 | E-GEOD-42864 | biostudies-arrayexpress
2015-02-17 | E-GEOD-52520 | biostudies-arrayexpress
2010-12-02 | E-GEOD-25723 | biostudies-arrayexpress
2011-09-14 | E-GEOD-31007 | biostudies-arrayexpress
2015-04-23 | E-GEOD-65719 | biostudies-arrayexpress
2015-02-17 | E-GEOD-52523 | biostudies-arrayexpress
2011-09-14 | E-GEOD-30992 | biostudies-arrayexpress
2013-02-15 | E-GEOD-44338 | biostudies-arrayexpress
2013-03-29 | E-GEOD-44437 | biostudies-arrayexpress
2011-12-10 | E-GEOD-34306 | biostudies-arrayexpress