Project description:Oncogenic human papillomavirus (HPV) genomes are often integrated into host chromosomes in HPV-associated cancers. HPV genomes are integrated either as a single copy, or as tandem repeats of viral DNA interspersed with, or without, host DNA. Integration occurs frequently in common fragile sites susceptible to tandem repeat formation, and the flanking or interspersed host DNA often contains transcriptional enhancer elements. When co-amplified with the viral genome, these enhancers can form super-enhancer-like elements that drive high viral oncogene expression. Here, we compiled highly curated datasets of HPV integration sites in cervical (CESC) and head and neck squamous cell carcinoma (HNSCC) cancers and assessed the number of breakpoints, viral transcriptional activity, and host genome copy number at each insertion site. Tumors frequently contained multiple distinct HPV integration sites, but often only one “driver” site that expressed viral RNA. Since common fragile sites and active enhancer elements are cell-type specific, we mapped these regions in cervical cell lines using FANCD2 and Brd4/H3K27ac ChIP-seq, respectively. Large enhancer clusters, or super-enhancers, were also defined using the Brd4/H3K27ac ChIP-seq dataset. HPV integration breakpoints were enriched at both FANCD2-associated fragile sites, and enhancer-rich regions, and frequently showed adjacent focal DNA amplification in CESC samples. We identified recurrent integration “hotspots” that were enriched for super-enhancers, some of which function as regulatory hubs for cell-identity genes. We propose that during persistent infection, extrachromosomal HPV minichromosomes associate with these transcriptional epicenters, and accidental integration could promote viral oncogene expression and carcinogenesis.
Project description:In organisms ranging from vertebrates to plants, major components of centromeres are rapidly-evolving repeat sequences, such as tandem repeats (TRs) and transposable elements (TEs). These repeats harbor centromere-specific histone H3 (CENH3), which also evolves rapidly. Complete centromere structures recently determined in human and Arabidopsis suggest frequent integration and purging of retrotransposons within the TR regions of centromeres. Despite the high impact of “centrophilic” retrotransposons on the paradox of rapid centromere evolution, the mechanisms involved in centromere targeting remain poorly understood in any organism. Here we show that both Ty3 and Ty1/Copia LTR elements rapidly turnover within the centromeric TRs of Arabidopsis species. We demonstrate that the Ty1/Copia element Tal1 (Transposon of Arabidopsis lyrata 1) integrates de novo into regions occupied by CENH3 in A. thaliana, and that ectopic expansion of the CENH3 region results in spread of Tal1 integration regions. The integration spectra of chimeric TEs revealed the key structural variations responsible for the contrasting chromatin targeting specificities to centromeres versus gene-rich regions, which have recurrently converted during the evolution of these TEs. Our findings reveal the impact of centromeric chromatin on TE-mediated rapid centromere evolution, with relevance across eukaryotic genomes.
Project description:In organisms ranging from vertebrates to plants, major components of centromeres are rapidly-evolving repeat sequences, such as tandem repeats (TRs) and transposable elements (TEs). These repeats harbor centromere-specific histone H3 (CENH3), which also evolves rapidly. Complete centromere structures recently determined in human and Arabidopsis suggest frequent integration and purging of retrotransposons within the TR regions of centromeres. Despite the high impact of “centrophilic” retrotransposons on the paradox of rapid centromere evolution, the mechanisms involved in centromere targeting remain poorly understood in any organism. Here we show that both Ty3 and Ty1/Copia LTR elements rapidly turnover within the centromeric TRs of Arabidopsis species. We demonstrate that the Ty1/Copia element Tal1 (Transposon of Arabidopsis lyrata 1) integrates de novo into regions occupied by CENH3 in A. thaliana, and that ectopic expansion of the CENH3 region results in spread of Tal1 integration regions. The integration spectra of chimeric TEs revealed the key structural variations responsible for the contrasting chromatin targeting specificities to centromeres versus gene-rich regions, which have recurrently converted during the evolution of these TEs. Our findings reveal the impact of centromeric chromatin on TE-mediated rapid centromere evolution, with relevance across eukaryotic genomes.
2024-08-08 | GSE263224 | GEO
Project description:Capture Seuencing of Tandem Repeats
| PRJNA422490 | ENA
Project description:Genome Scanning for Short Tandem Repeats
Project description:Retroviruses integrate their genomes into the genomes of infected host cells and form a genetic platform for stable gene expression. Epigenetic silencing can, however, hamper the expression of integrated provirus. As gammaretroviruses (γRVs) preferentially integrate into sites of active promoters and enhancers, the high expression activity of γRVs can be attributed to the integration preference. Long terminal repeats (LTRs) of some γRVs were shown to act as potent promoters for gene expression. Here, we investigate the capacity of different γRV LTRs to drive stable expression inside a non-preferred epigenomic environment using MLV-derived BET-independent (Bin) vectors. We demonstrate that different γRV LTRs are either rapidly silenced or long-term active with active proviral population prevailing under normal and retargeted integration.
Project description:Centromeres are the chromosomal sites of assembly for kinetochores, the protein complexes that attach to spindle fibers and mediate separation of chromosomes to daughter cells in mitosis and meiosis. In most multicellular organisms, centromeres comprise a single specific family of tandem repeats, often 100-400 bp in length, found on every chromosome, typically in one location within heterochromatin. Drosophila melanogaster is unusual in that the heterochromatin contains many families of mostly short (5-12 bp) tandem repeats, none of which appears to be present at all centromeres, and none of which is found only at centromeres. Although centromere sequences from a minichromosome have been identified and candidate centromere sequences have been proposed, the DNA sequences at native Drosophila centromeres remain unknown. Here we use native chromatin immunoprecipitation to identify the centromeric sequences bound by the foundational kinetochore protein cenH3, known in vertebrates as CENP-A. In D. melanogaster, these sequences include a few families of 5-bp and 10-bp repeats, but in closely related D. simulans, a partially overlapping set of short repeats and more complex repeats comprise the centromeres. The results suggest that a recent expansion of short repeats is replacing more complex centromeric repeats in the melanogaster subgroup of Drosophila.
Project description:White areas of star-type bicolor petals of petunia (Petunia hybrida) are caused by post-transcriptional gene silencing (PTGS) of the key enzyme of anthocyanin biosynthesis. We observed blotched flowers and vein-clearing symptom in leaves in 3- to 4-month-old petunia plants. In order to determine the cause of blotched flowers, we examined an endogenous pararetrovirus, petunia vein clearing virus (PVCV), because this virus causes vein-clearing symptom and may have a suppressor of PTGS. Transcripts and episomal DNA derived from proviral PVCV loci highly accumulated in 3- to 4-month-old plants, indicating that PVCV was activated as the host plant ages. Furthermore, CG and CHG sites of the promoter region of PVCV were highly methylated in 1-month-old plants but these cytosines were not methylated in 3- to 4-month-old plants, suggesting that poor maintenance of DNA methylation activates propagation of PVCV. In parallel, de novo methylation at CHH sites and 24-nt small RNAs were detected on the promoter region of PVCV, indicating that RNA-directed DNA methylation was induced by PVCV activation. Detections of transcripts and episomal DNA of PVCV in blotched regions and suppressor activity of PTGS support the mechanism that suppression of PTGS by activated PVCV causes blotched flowers.