Unknown,Transcriptomics,Genomics,Proteomics

Dataset Information

Measuring expression from randomly-designed DNA in S. cerevisiae

ABSTRACT: Here, we integrated essentially random synthetic DNA sequences into the yeast genome (HO locus) and measured the expression via custom Agilent tiling array. The purpose of this was to compare the predictions of where transcription would occur output by our computational model to the expression observed in the cell. The four constructs (A1B1, A1B2, A2B1, A2B2) were integrated into S. cerevisiae, cells were grown and from these both gDNA and mRNA were isolated. G-coupled dyes were used to label samples with Cy3 for mRNA and Cy5 for gDNA. Samples were sheared/hydrolyzed to an average size of about 250bp, hybridized to the array, and quantified. The GFF files include features of the region tageted for tiling array analysis in gff format (https://genome.ucsc.edu/FAQ/FAQformat.html#format3). These include the locations of genomic features (ORFs, transcripts) as well as the locations of the inserts, and the KanMX selectable marker. microarray_normalized_log+.wig and microarray_normalized_log-.wig include the tiling array data across each of the four inserts in wig format (https://genome.ucsc.edu/goldenPath/help/wiggle.html) for the top and bottom DNA strands, respectively. This data are provided on a log scale and were normalized as follows: First, we normalized the data using spacial detrending of the microarray slide. Next, we normalized the tiling array data using a background model (HUBER et al. 2006) which relates, for each probe (k), the signal intensity (yk) to the background signal (Bk) plus the amount of nucleic acid in the sample (xk) times the proportionality factor (ak), or yk=Bk+xk*ak. Here, we want to determine the amount of nucleic acid in the sample (xk). We modified this normalization protocol to take advantage of the fact that the target sequences of many probes were absent from some samples (e.g. the A1B1probe target sequences are absent from the A2B2 strain). Here, we estimated separate mRNA and gDNA background signals (Bk) for each probe by taking the median signal intensity of the probe in strains lacking the target DNA sequences. We estimated the proportionality factor for each probe (ak), corresponding to how probe intensity changes with the amount of nucleic acid, by comparing the background-normalized gDNA signal at each probe to the actual amount of gDNA present in the sample, where the target DNA was present in the sample. We estimated the amount of mRNA at each probe using this model, and scaled the amounts to the median intensity across the kanMX gene to make mRNA levels comparable across constructs. All files reference the sequences contained within the accompanying ".seq.txt" files. Note: the coordinates in these files do not correspond to genomic coordinates as the construct was integrated into the genome (thereby altering the length of the chromosome). However, positions 1..5001 in these files corresponds to positions 42000..47000 of chromosome 4 in the yeast genome (May, 2008 genome version) and positions 12341..20888 correspond to positions 47800..56347 of chromosome 4.

ORGANISM(S): Saccharomyces cerevisiae

SUBMITTER: Carl de Boer

PROVIDER: E-GEOD-47004 | biostudies-arrayexpress |

REPOSITORIES: biostudies-arrayexpress

ACCESS DATA

Dataset Information

Measuring expression from randomly-designed DNA in S. cerevisiae

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets

Similar Datasets

CTCF binding patterns in mouse trophoblast stem cells
2012-09-08 | GSE40667 | GEO

Indel array-based genotyping of pupariation site preference among Drosophila sechellia backcross progeny
2013-02-26 | GSE44443 | GEO

microRNA expression profiling in HPV-16/HIV associated Cervical Cancer
2018-08-01 | GSE81137 | GEO

Consensus Preprocessing of GEO and ArrayExpress Normal and Diseased Lung Tissue Microarray Data
2014-09-18 | GSE60486 | GEO

Site-specific silencing of regulatory elements as a mechanism of X-inactivation
2012-11-21 | GSE39406 | GEO

Genomic Analysis of wig-1 Pathways
2011-06-06 | GSE29751 | GEO

Formation of novel CENP-A domains on tandem repetitive DNA and across chromosome
2011-03-01 | E-TABM-1098 | biostudies-arrayexpress

Comprehensive transcriptional profiling analysis reveals candidate regulators in cold-stressed seedlings of Petunia hybrida
2015-01-15 | E-GEOD-64963 | biostudies-arrayexpress

Gene expression profiling after Wig-1 knockdown in HCT116 cancer cell line
2014-03-01 | GSE43046 | GEO

P. falciparum (lab strain 3d7) schizonts untreated control vs P. falciparum (lab strain 3d7) reference RNA pool
2012-07-14 | E-GEOD-33795 | biostudies-arrayexpress