Project description:Oncogenic human papillomavirus (HPV) genomes are often integrated into host chromosomes in HPV-associated cancers. HPV genomes are integrated either as a single copy, or as tandem repeats of viral DNA interspersed with, or without, host DNA. Integration occurs frequently in common fragile sites susceptible to tandem repeat formation, and the flanking or interspersed host DNA often contains transcriptional enhancer elements. When co-amplified with the viral genome, these enhancers can form super-enhancer-like elements that drive high viral oncogene expression. Here, we compiled highly curated datasets of HPV integration sites in cervical (CESC) and head and neck squamous cell carcinoma (HNSCC) cancers and assessed the number of breakpoints, viral transcriptional activity, and host genome copy number at each insertion site. Tumors frequently contained multiple distinct HPV integration sites, but often only one “driver” site that expressed viral RNA. Since common fragile sites and active enhancer elements are cell-type specific, we mapped these regions in cervical cell lines using FANCD2 and Brd4/H3K27ac ChIP-seq, respectively. Large enhancer clusters, or super-enhancers, were also defined using the Brd4/H3K27ac ChIP-seq dataset. HPV integration breakpoints were enriched at both FANCD2-associated fragile sites, and enhancer-rich regions, and frequently showed adjacent focal DNA amplification in CESC samples. We identified recurrent integration “hotspots” that were enriched for super-enhancers, some of which function as regulatory hubs for cell-identity genes. We propose that during persistent infection, extrachromosomal HPV minichromosomes associate with these transcriptional epicenters, and accidental integration could promote viral oncogene expression and carcinogenesis.
Project description:PeCa is a rare carcinoma in developed countries, but it presents higher incidence rates in South America, Asia, and Africa, where limited economic and social conditions play a large impact leading to delay in diagnosis, and treatment initiation. The infection by HPV is a the risk factors and can occur through the canonical HPV/p53/RB1 pathway mediated by the E2/E6/E7 viral oncoproteins. During the transformation process, HPV inserts its genetic material into host Integration Sites (IS), affecting coding genes and miRNAs. In penile cancer (PeCa) there is limited data on the miRNAs. Considering the high frequency of HPV infection in PeCa patients in Northeast Brazil, global miRNA expression profiling was performed in high-risk HPV-associated PeCa. A panel of differentially expressed miRNAs (miRDE) was successfully constructed using 22 PeCa tissues and five non-tumor penile tissues.
Project description:We sequenced DNA from a bulk of Col x Ler F2 hybrid plants (WT and recq4) using Nanopore long-read sequencing and identified crossover sites with COmapper. For nanopore sequencing of gDNA from 1,000 pooled seedlings, 10-day-old seedlings were ground in liquid nitrogen using a mortar and pestle. The ground tissue was resuspended in four volumes of CTAB buffer (1% [w/v] CTAB, 50 mM Tris-HCl pH 8.0, 0.7 M NaCl, 10 mM EDTA) and incubated at 65°C for 30 min. Following chloroform extraction, isopropanol precipitation and removal of RNAs as above, the gDNA pellet was resuspended in 150 μl TE (10 mM Tris-HCl pH 8.0, 0.1 mM EDTA) buffer and gDNA was quantified using a Qubit dsDNA Broad Range assay kit (Thermo Fisher, Q32853). Nine micrograms of gDNA from pollen or seedlings was used to construct a nanopore long-read sequencing library using a Ligation Sequencing Kit V14 (Nanopore, SQK-LSK114). The libraries were sequenced using a PromethION platform (BGI, Hong Kong).
Project description:Human papillomavirus (HPV) integration is a critical step in cervical cancer development, while the oncogenic mechanism in genome-wide transcriptional level is still poorly understood. In this study, we employed integrative analysis on multi-omics data of cervical cancer cell lines. Through HPV integration detection, super enhancer (SE) identification, SE-associated gene expression and extrachromosomal DNA (ecDNA) investigation, we aimed to explore the genome-wide transcriptional influence of HPV integration. We identified 5 high-ranking cellular super enhancers generated by HPV integration (the HPV breakpoint induced cellular super enhancers, BP-cSE), leading to intra-chromosomal and inter-chromosomal regulations of chromosomal genes. The pathway analysis showed the dysregulated chromosomal genes were correlated to cervical cancer associated pathways. Importantly, we demonstrated that BP-cSE existed in the HPV-host ecDNA, explaining above transcription alterations. Our results suggest that HPV integration generates cellular super enhancers and functions as ecDNA to regulate unconstraint transcription, expanding the tumorigenic mechanism of HPV integration and providing insights of developing new diagnostic and therapeutic strategies.
Project description:We sequenced DNA from the leaves of ten Col x Ler F1 hybrid plants (WT and recq4) using Nanopore long-read sequencing and identified crossover sites with COmapper. These data were used as a negative control for COmapper, as no crossover sites were expected to be detected. For nanopore sequencing of gDNA from leaves, leaves from 10 5-week-old plants were ground in liquid nitrogen using a mortar and pestle. The ground tissue was resuspended in four volumes of CTAB buffer (1% [w/v] CTAB, 50 mM Tris-HCl pH 8.0, 0.7 M NaCl, 10 mM EDTA) and incubated at 65°C for 30 min. Following chloroform extraction, isopropanol precipitation and removal of RNAs as above, the gDNA pellet was resuspended in 150 μl TE (10 mM Tris-HCl pH 8.0, 0.1 mM EDTA) buffer and gDNA was quantified using a Qubit dsDNA Broad Range assay kit (Thermo Fisher, Q32853). Nine micrograms of gDNA from pollen or seedlings was used to construct a nanopore long-read sequencing library using a Ligation Sequencing Kit V14 (Nanopore, SQK-LSK114). The libraries were sequenced using a PromethION platform (BGI, Hong Kong).
Project description:Genomic DNA from 55 wild type Col x Ler F2 individuals was extracted using the CTAB method. Equal amounts of DNA from these 55 plants were pooled into two groups (pool 1 = 4 plants; pool 2 = 51 plants), and nine micrograms of gDNA from each pool was used to generate Nanopore sequencing libraries with the Ligation Sequencing Kit V14 (Nanopore, SQK-LSK114). The libraries were sequenced independently using PromethION (BGI, Hong Kong).
Project description:Transposon insertion site sequencing (TIS) is a powerful method for associating genotype to phenotype. However, all TIS methods described to date use short nucleotide sequence reads which cannot uniquely determine the locations of transposon insertions within repeating genomic sequences where the repeat units are longer than the sequence read length. To overcome this limitation, we have developed a TIS method using Oxford Nanopore sequencing technology that generates and uses long nucleotide sequence reads; we have called this method LoRTIS (Long Read Transposon Insertion-site Sequencing). This experiment data contains sequence files generated using Nanopore and Illumina platforms. Biotin1308.fastq.gz and Biotin2508.fastq.gz are fastq files generated from nanopore technology. Rep1-Tn.fastq.gz and Rep1-Tn.fastq.gz are fastq files generated using Illumina platform. In this study, we have compared the efficiency of two methods in identification of transposon insertion sites.
Project description:E18 mouse brain single cell profiling using the 10x Genomics Chromium instrument workflow with either Illumina short read sequencing for the standard gene profiling and Nanopore PromethION long read sequencing for isoform profiling.
Project description:We used the nanopore Cas9 targeted sequencing (nCATS) strategy to specifically sequence 125 L1HS-containing loci in parallel and measure their DNA methylation levels using nanopore long-read sequencing. Each targeted locus is sequenced at high coverage (~45X) with unambiguously mapped reads spanning the entire L1 element, as well as its flanking sequences over several kilobases. The genome-wide profile of L1 methylation was also assessed by bs-ATLAS-seq in the same cell lines (E-MTAB-10895).