Unknown,Transcriptomics,Genomics,Proteomics

Dataset Information

0

Multiplexed identification of the genomic targets of DNA-binding proteins


ABSTRACT: Transcription factors direct gene expression, and so there is much interest in mapping their genome-wide binding locations. M-BM- Current methods do not allow for the multiplexed analysis of TF binding, and this limits their throughput. We describe a novel method for determining the genomic target genes of multiple transcription factors simultaneously. DNA-binding proteins are endowed with the ability to direct transposon insertions into the genome near to where they bind. The transposon becomes a M-bM-^@M-^\Calling CardM-bM-^@M-^] marking the visit of the DNA-binding protein to that location. A unique sequence M-bM-^@M-^\barcodeM-bM-^@M-^] in the transposon matches it to the DNA-binding protein that directed its insertion. The sequences of the DNA flanking the transposon (which reveal where in the genome the transposon landed) and the barcode within the transposon (which identifies the TF that put it there) are determined by massively-parallel DNA sequencing. To demonstrate the methodM-bM-^@M-^Ys feasibility, we determined the genomic targets of eight transcription factors in a single experiment. The Calling Card method promises to significantly reduce the cost and labor needed to determine the genomic targets of many transcription factors in different environmental conditions and genetic backgrounds. These data contain Ty5 insertion sites mapped by an Illumina GAII analyzer in the S. cerevisiae genome for the background strain without any Sir4 present (1 run), in strains expressing Sir4-tagged copies of three well-characterized TFs: Gal4, Leu3, and Gcn4 (1 run each), and a multiplex of eight Sir4-tagged TFs pooled in a single experiment (2 biological replicates), and insertions from the Thi2-Sir4 fusion expressed from its native locus in two conditions (1 run each). The format of each insertions file is [chromosome number] [position of genomic base] [direction of insertion] [number of reads at that position]. Raw sequencing data comes in two varieties. Paired-end data contains a 5 bp barcode at the beginning of read #2. Single-end data contains a 2 bp barcode on the beggining of read #1.

ORGANISM(S): Saccharomyces cerevisiae

SUBMITTER: David Mayhew 

PROVIDER: E-GEOD-27381 | biostudies-arrayexpress |

REPOSITORIES: biostudies-arrayexpress

altmetric image

Publications

Calling Cards enable multiplexed identification of the genomic targets of DNA-binding proteins.

Wang Haoyi H   Mayhew David D   Chen Xuhua X   Johnston Mark M   Mitra Robi David RD  

Genome research 20110406 5


Transcription factors direct gene expression, so there is much interest in mapping their genome-wide binding locations. Current methods do not allow for the multiplexed analysis of TF binding, and this limits their throughput. We describe a novel method for determining the genomic target genes of multiple transcription factors simultaneously. DNA-binding proteins are endowed with the ability to direct transposon insertions into the genome near to where they bind. The transposon becomes a "Callin  ...[more]

Similar Datasets

2011-12-31 | E-GEOD-34791 | biostudies-arrayexpress
2014-02-11 | E-GEOD-54831 | biostudies-arrayexpress
2013-08-28 | E-GEOD-46210 | biostudies-arrayexpress
2011-02-18 | GSE27381 | GEO
2013-08-28 | GSE46210 | GEO
2022-10-11 | GSE147760 | GEO
2022-05-30 | E-MTAB-11351 | biostudies-arrayexpress
2005-08-26 | E-GEOD-3197 | biostudies-arrayexpress
2005-08-26 | GSE3197 | GEO
2022-10-01 | GSE214379 | GEO