Project description:<p>Non-coding elements in our genomes that play critical roles in complex disease are frequently marked by highly unstable RNA species. Sequencing nascent RNAs attached to an actively transcribing RNA polymerase complex can identify unstable RNAs, including those templated from gene-distal enhancers (eRNAs). However, nascent RNA sequencing techniques remain challenging to apply in some cell lines and especially to intact tissues, limiting broad applications in fields such as cancer genomics and personalized medicine. Here we report the development of chromatin run-on and sequencing (ChRO-seq), a novel run-on technology that maps the location of RNA polymerase using virtually any frozen tissue sample, including samples with degraded RNA that are intractable to conventional RNA-seq. We used ChRO-seq to develop the first maps of nascent transcription in 23 human glioblastoma (GBM) brain tumors and patient derived xenografts. Remarkably, >90,000 distal enhancers discovered using the signature of eRNA biogenesis within primary GBMs closely resemble those found in the normal human brain, and diverge substantially from GBM cell models. Despite extensive overall similarity, 12% of enhancers in each GBM distinguish normal and malignant brain tissue. These enhancers drive regulatory programs similar to the developing nervous system and are enriched for transcription factor binding sites that specify a stem-like cell fate. These results demonstrate that GBMs largely retain the enhancer landscape associated with their tissue of origin, but selectively adopt regulatory programs that are responsible for driving stem-like cell properties. We also identified enhancers and their associated transcription factors that regulate genes characteristic of each known GBM subtype, and discovered a core group of transcription factors that control the expression of genes associated with clinical outcomes. This study uncovers new insights into the molecular etiology of GBM and introduces ChRO-seq which can now be used to map regulatory programs contributing to a variety of complex diseases.</p>
Project description:Histones were isolated from brown adipose tissue and liver from mice housed at 28, 22, or 8 C. Quantitative top- or middle-down approaches were used to quantitate histone H4 and H3.2 proteoforms. See published article for complimentary RNA-seq and RRBS datasets.
Project description:Introgressed variants from other species can be an important source of genetic variation because they may arise rapidly, can include multiple mutations on a single haplotype, and have often been pretested by selection in the species of origin. Although introgressed alleles are generally deleterious, several studies have reported introgression as the source of adaptive alleles-including the rodenticide-resistant variant of Vkorc1 that introgressed from Mus spretus into European populations of Mus musculus domesticus. Here, we conducted bidirectional genome scans to characterize introgressed regions into one wild population of M. spretus from Spain and three wild populations of M. m. domesticus from France, Germany, and Iran. Despite the fact that these species show considerable intrinsic postzygotic reproductive isolation, introgression was observed in all individuals, including in the M. musculus reference genome (GRCm38). Mus spretus individuals had a greater proportion of introgression compared with M. m. domesticus, and within M. m. domesticus, the proportion of introgression decreased with geographic distance from the area of sympatry. Introgression was observed on all autosomes for both species, but not on the X-chromosome in M. m. domesticus, consistent with known X-linked hybrid sterility and inviability genes that have been mapped to the M. spretus X-chromosome. Tract lengths were generally short with a few outliers of up to 2.7 Mb. Interestingly, the longest introgressed tracts were in olfactory receptor regions, and introgressed tracts were significantly enriched for olfactory receptor genes in both species, suggesting that introgression may be a source of functional novelty even between species with high barriers to gene flow.
Project description:BACKGROUND: Long terminal repeat (LTR) retrotransposons make up a large fraction of the typical mammalian genome. They comprise about 8% of the human genome and approximately 10% of the mouse genome. On account of their abundance, LTR retrotransposons are believed to hold major significance for genome structure and function. Recent advances in genome sequencing of a variety of model organisms has provided an unprecedented opportunity to evaluate better the diversity of LTR retrotransposons resident in eukaryotic genomes. RESULTS: Using a new data-mining program, LTR_STRUC, in conjunction with conventional techniques, we have mined the GenBank mouse (Mus musculus) database and the more complete Ensembl mouse dataset for LTR retrotransposons. We report here that the M. musculus genome contains at least 21 separate families of LTR retrotransposons; 13 of these families are described here for the first time. CONCLUSIONS: All families of mouse LTR retrotransposons are members of the gypsy-like superfamily of retroviral-like elements. Several different families of unrelated non-autonomous elements were identified, suggesting that the evolution of non-autonomy may be a common event. High sequence similarity between several LTR retrotransposons identified in this study and those found in distantly-related species suggests that horizontal transfer has been a significant factor in the evolution of mouse LTR retrotransposons.