Project description:Integrons are genetic platforms that acquire new genes encoded in integron cassettes (ICs), building arrays of adaptive functions for bacteria. ICs generally encode promoterless genes, whose expression relies on the PC promoter within the integron platform. Cassette arrays are assumed to be operon-like structures in which expression is dependent on the distance to the Pc. This is especially relevant in large sedentary chromosomal integrons (SCIs,) like the ones in Vibrio species. We have identified 29 gene-less cassettes in 4 Vibrio SCIs, and explored whether their function could be related to regulating the transcription of adjacent ICs. We show that most gene-less cassettes have promoter activity on the sense strand, enhancing the expression of downstream cassettes. Accordingly, we found that most of the superintegron in Vibrio cholerae is not silent. These promoter cassettes can trigger the expression of a silent dfrB9 resistance cassette downstream, increasing trimethoprim resistance >512-fold in V. cholerae and Escherichia coli. Additionally, one cassette had an antisense promoter capable of reducing trimethoprim resistance through transcriptional interference. Our findings highlight the regulatory role of gene-less cassettes in the expression of adjacent cassettes, emphasizing their significance in large SCIs and their clinical importance if captured by mobile integrons.
Project description:Integrons are genetic elements that enable bacterial adaptation by collecting new genes encoded in integron cassettes (ICs) to create a reservoir of adaptive functions. These cassettes typically lack their own promoters and rely on the integron platform for their expression. Integrons, well-known for spreading antibiotic resistance genes in clinically relevant Gram-negative species, include Mobile Integrons (MIs), that transport over 170 resistance genes. In contrast, Sedentary Chromosomal Integrons (SCIs), ubiquitous in Vibrio species, are primarily found within bacterial chromosomes. However, their functions are not related to antimicrobial resistance and are largely unexplored. SCIs, typified by the Superintegron (SI) in Vibrio cholerae, represent ancient and highly variable regions in bacterial genomes. The SI is extensive, housing 179 integron cassettes, mostly with unknown functions. Although 19 cassettes encode toxin-antitoxin (TA) systems, which stabilize the array, the intricacies of the SI are challenging to study due to its size and unique integrase. To investigate the SI's impact on V. cholerae, we developed the SeqDelTA approach, enabling the gradual deletion of the SI. This deletion facilitates the use of standard genetic tools without SI interference. Our in-depth analysis of the resulting ∆SI strain, covering various aspects, demonstrated no significant alterations in V. cholerae's physiology. Despite their extended coevolution, SCIs appear to be genetically isolated from the host genome.
Project description:Adenovirus is a common human pathogen that relies on host cell processes for transcription and processing of viral RNA and protein production. Although adenoviral promoters, splice junctions, and cleavage and polyadenylation sites have been characterized using low-throughput biochemical techniques or short read cDNA-based sequencing, these technologies do not fully capture the complexity of the adenoviral transcriptome. By combining Illumina short-read and nanopore long-read direct RNA sequencing approaches, we mapped transcription start sites and cleavage and polyadenylation sites across the adenovirus genome. In addition to confirming the known canonical viral early and late RNA cassettes, our analysis of splice junctions within long RNA reads revealed an additional 35 novel viral transcripts. These RNAs include fourteen new splice junctions which lead to expression of canonical open reading frames (ORF), six novel ORF-containing transcripts, and fifteen transcripts encoding for messages that potentially alter protein functions through truncations or fusion of canonical ORFs. In addition, we also detect RNAs that bypass canonical cleavage sites and generate potential chimeric proteins by linking separate gene transcription units. Of these, an evolutionary conserved protein was detected containing the N-terminus of E4orf6 fused to the downstream DBP/E2A ORF. Loss of this novel protein, E4orf6/DBP, was associated with aberrant viral replication center morphology and poor viral spread. Our work highlights how long-read sequencing technologies can reveal further complexity within viral transcriptomes.
Project description:We sequenced DNA from a bulk of Col x Ler F2 hybrid plants (WT and recq4) using Nanopore long-read sequencing and identified crossover sites with COmapper. For nanopore sequencing of gDNA from 1,000 pooled seedlings, 10-day-old seedlings were ground in liquid nitrogen using a mortar and pestle. The ground tissue was resuspended in four volumes of CTAB buffer (1% [w/v] CTAB, 50 mM Tris-HCl pH 8.0, 0.7 M NaCl, 10 mM EDTA) and incubated at 65°C for 30 min. Following chloroform extraction, isopropanol precipitation and removal of RNAs as above, the gDNA pellet was resuspended in 150 μl TE (10 mM Tris-HCl pH 8.0, 0.1 mM EDTA) buffer and gDNA was quantified using a Qubit dsDNA Broad Range assay kit (Thermo Fisher, Q32853). Nine micrograms of gDNA from pollen or seedlings was used to construct a nanopore long-read sequencing library using a Ligation Sequencing Kit V14 (Nanopore, SQK-LSK114). The libraries were sequenced using a PromethION platform (BGI, Hong Kong).
Project description:We sequenced DNA from the leaves of ten Col x Ler F1 hybrid plants (WT and recq4) using Nanopore long-read sequencing and identified crossover sites with COmapper. These data were used as a negative control for COmapper, as no crossover sites were expected to be detected. For nanopore sequencing of gDNA from leaves, leaves from 10 5-week-old plants were ground in liquid nitrogen using a mortar and pestle. The ground tissue was resuspended in four volumes of CTAB buffer (1% [w/v] CTAB, 50 mM Tris-HCl pH 8.0, 0.7 M NaCl, 10 mM EDTA) and incubated at 65°C for 30 min. Following chloroform extraction, isopropanol precipitation and removal of RNAs as above, the gDNA pellet was resuspended in 150 μl TE (10 mM Tris-HCl pH 8.0, 0.1 mM EDTA) buffer and gDNA was quantified using a Qubit dsDNA Broad Range assay kit (Thermo Fisher, Q32853). Nine micrograms of gDNA from pollen or seedlings was used to construct a nanopore long-read sequencing library using a Ligation Sequencing Kit V14 (Nanopore, SQK-LSK114). The libraries were sequenced using a PromethION platform (BGI, Hong Kong).
Project description:Genomic DNA from 55 wild type Col x Ler F2 individuals was extracted using the CTAB method. Equal amounts of DNA from these 55 plants were pooled into two groups (pool 1 = 4 plants; pool 2 = 51 plants), and nine micrograms of gDNA from each pool was used to generate Nanopore sequencing libraries with the Ligation Sequencing Kit V14 (Nanopore, SQK-LSK114). The libraries were sequenced independently using PromethION (BGI, Hong Kong).
Project description:Transposon insertion site sequencing (TIS) is a powerful method for associating genotype to phenotype. However, all TIS methods described to date use short nucleotide sequence reads which cannot uniquely determine the locations of transposon insertions within repeating genomic sequences where the repeat units are longer than the sequence read length. To overcome this limitation, we have developed a TIS method using Oxford Nanopore sequencing technology that generates and uses long nucleotide sequence reads; we have called this method LoRTIS (Long Read Transposon Insertion-site Sequencing). This experiment data contains sequence files generated using Nanopore and Illumina platforms. Biotin1308.fastq.gz and Biotin2508.fastq.gz are fastq files generated from nanopore technology. Rep1-Tn.fastq.gz and Rep1-Tn.fastq.gz are fastq files generated using Illumina platform. In this study, we have compared the efficiency of two methods in identification of transposon insertion sites.
Project description:We used the nanopore Cas9 targeted sequencing (nCATS) strategy to specifically sequence 125 L1HS-containing loci in parallel and measure their DNA methylation levels using nanopore long-read sequencing. Each targeted locus is sequenced at high coverage (~45X) with unambiguously mapped reads spanning the entire L1 element, as well as its flanking sequences over several kilobases. The genome-wide profile of L1 methylation was also assessed by bs-ATLAS-seq in the same cell lines (E-MTAB-10895).