Project description:In plants, RNA polymerase II (Pol II) transcription of inverted DNA repeats produces hairpin RNAs that are processed by several DICER-LIKE enzymes into siRNAs that are 21-24-nt in length. When targeted to transcriptional regulatory regions, the 24-nt size class can induce RNA-directed DNA methylation (RdDM) and transcriptional gene silencing (TGS). In a forward genetic screen to identify mutants defective in RdDM of a target enhancer leading to TGS of a downstream GFP reporter gene in Arabidopsis thaliana, we recovered a structurally mutated silencer locus, named SM-NM-^T35S, in which the 35S promoter driving transcription of an inverted repeat of target enhancer sequences had been specifically deleted. Although Pol II-dependent, hairpin-derived 21-24-nt siRNAs were no longer generated at the newly created SM-NM-^T35S locus, the GFP reporter gene was nevertheless still partially silenced. Silencing was associated with methylation in a short tandem repeat in the upstream target enhancer and with low levels of 24-nt tandem repeat siRNAs. Introducing an nrpd1 mutation into the SM-NM-^T35S line fully released GFP silencing and eliminated both the tandem repeat methylation and associated 24-nt siRNAs, demonstrating their dependence on Pol IV. Deletion of the 35S promoter thus revealed a Pol IV-dependent pathway of 24-nt siRNA biogenesis that was previously inhibited or masked by the Pol II-dependent pathway in wild-type plants. Both Pol II- and Pol IV-dependent siRNAs accrued predominantly from cytosine (C)-containing segments of the tandem repeat monomer, suggesting that the local base composition influenced siRNA accumulation. Preferential accumulation of siRNAs at C-containing sequences was also observed at an endogenous tandem repeat comprising discrete C-rich and AT-rich sections. Our studies illuminate the potential complexity of siRNA generation at repeat-containing loci and show that Pol IV can act in siRNA biogenesis in the absence of a conventional Pol II promoter. Examination of whole-genome DNA methylation status in transgenic T+S Arabidopsis plant
Project description:Repetitive sequences are hotspots of evolution at multiple levels. However, due to technical difficulties involved in their assembly and analysis, the role of repeats in tumor evolution is poorly understood. We developed a rigorous motif-based methodology to quantify variations in the repeat content of proteomes and genomes, directly from proteomic and genomic raw sequence data, and applied it to analyze a wide range of tumors and normal tissues. We identify high similarity between the repeat-instability in tumors and their patient-matched normal tissues, but also tumor-specific signatures, both in protein expression and in the genome, that strongly correlate with cancer progression and robustly predict the tumorigenic state. In a patient, the hierarchy of genomic repeat instability signatures accurately reconstructs tumor evolution, with primary tumors differentiated from metastases. We find an inverse relationship between repeat-instability and point mutation load, within and across patients, and independently of other somatic aberrations. Thus, repeat-instability is a distinct, transient and compensatory adaptive mechanism in tumor evolution.