Unknown,Transcriptomics,Genomics,Proteomics

Dataset Information

0

Histone Modifications by ChIP-seq from ENCODE/Broad Institute


ABSTRACT: This data was generated by ENCODE. If you have questions about the data, contact the submitting laboratory directly (mailto:nshoresh@broad.mit.edu). If you have questions about the Genome Browser track associated with this data, contact ENCODE (mailto:genome@soe.ucsc.edu). This track displays maps of chromatin state generated by the Broad/MGH ENCODE group using ChIP-seq. Chemical modifications (methylation, acetylation) to the histone proteins present in chromatin influence gene expression by changing how accessible the chromatin is to transcription. The ChIP-seq method involves first using formaldehyde to cross-link histones and other DNA-associated proteins to genomic DNA within cells. The cross-linked chromatin is subsequently extracted, mechanically sheared, and immunoprecipitated using specific antibodies. After reversal of cross-links, the immunoprecipitated DNA is sequenced and mapped to the human reference genome. The relative enrichment of each antibody-target (epitope) across the genome is inferred from the density of mapped fragments. For data usage terms and conditions, please refer to http://www.genome.gov/27528022 and http://www.genome.gov/Pages/Research/ENCODE/ENCODEDataReleasePolicyFinal2008.pdf ChIP-seq: Cells were grown according to the approved ENCODE cell culture protocols. Cells were fixed in 1% formaldehyde and resuspended in lysis buffer. Chromatin was sheared to 200-700 bp using a Diagenode Bioruptor. Solubilized chromatin was immunoprecipitated with antibodies against each of the histone antibodies listed above. Antibody-chromatin complexes were pulled-down using protein A-sepharose (or anti-IgM-conjugated agarose for RNA polymerase II), washed and then eluted. After cross-link reversal and proteinase K treatment, immunoprecipitated DNA was extracted with phenol-chloroform, ethanol precipitated, treated with RNAse and purified. One to ten nanograms of DNA were end-repaired, adapter-ligated and sequenced by Illumina Genome Analyzers as recommended by the manufacturer. Alignment: Sequence reads from each IP experiment were aligned to the human reference genome (GRCh37/hg19) using MAQ with default parameters, except '-C 11' and '-H output_file', which outputs up to 11 additional best matches for each read (if any are found) to a file. This information was used to filter out any read that had more than 10 best matches on the genome. Note: It is likely that instances where multiple reads align to the same position and with the same orientation are due to enhanced PCR amplification of a single DNA fragment. No attempt has been made, however, to remove such artifacts from the data, following ENCODE practices. Signal: Fragment densities were computed by counting the number of reads overlapping each 25 bp bin along the genome. Densities were computed using igvtools count with default parameters (in particular, '-w 25' to set window size of 25 bp and '-f mean' to report the mean value across the window), except for '-e' set to extend the reads to 200 bp, and the .wig output was converted to bigWig using wigToBigWig from the UCSC Kent software package. Peaks: Discrete intervals of ChIP-seq fragment enrichment were identified using Scripture, a scan statistics approach, under the assumption of uniform background signal. All data sets where processed with '-task chip', and with '-windows 100,200,500,1000,5000,10000,100000'. (No mask file nor the '-trim' option have been used.) The resulting called segments were then further filtered to remove intervals that are significantly enriched only because they contain smaller enriched intervals within them. This post-processing step has been implemented using Matlab. The use of the post-processing step allowed very large enriched intervals (of the order of Mbps for H3K27me3, for instance) to be detected, as well as much smaller intervals, without the need to tailor the parameters of Scripture based on prior expectations.

ORGANISM(S): Homo sapiens

SUBMITTER: ENCODE DCC 

PROVIDER: E-GEOD-29611 | biostudies-arrayexpress |

REPOSITORIES: biostudies-arrayexpress

Similar Datasets

2011-08-31 | E-GEOD-31755 | biostudies-arrayexpress
2012-09-07 | E-GEOD-40700 | biostudies-arrayexpress
2010-10-08 | E-GEOD-24565 | biostudies-arrayexpress
2012-09-07 | E-GEOD-40698 | biostudies-arrayexpress
2011-12-16 | E-GEOD-34448 | biostudies-arrayexpress
2010-12-28 | E-GEOD-26284 | biostudies-arrayexpress
2012-09-07 | E-GEOD-40699 | biostudies-arrayexpress
2014-11-19 | E-GEOD-51341 | biostudies-arrayexpress
2010-12-29 | E-GEOD-26328 | biostudies-arrayexpress
2011-08-12 | E-GEOD-31363 | biostudies-arrayexpress