Project description:Transposon insertion site sequencing (TIS) is a powerful method for associating genotype to phenotype. However, all TIS methods described to date use short nucleotide sequence reads which cannot uniquely determine the locations of transposon insertions within repeating genomic sequences where the repeat units are longer than the sequence read length. To overcome this limitation, we have developed a TIS method using Oxford Nanopore sequencing technology that generates and uses long nucleotide sequence reads; we have called this method LoRTIS (Long Read Transposon Insertion-site Sequencing). This experiment data contains sequence files generated using Nanopore and Illumina platforms. Biotin1308.fastq.gz and Biotin2508.fastq.gz are fastq files generated from nanopore technology. Rep1-Tn.fastq.gz and Rep1-Tn.fastq.gz are fastq files generated using Illumina platform. In this study, we have compared the efficiency of two methods in identification of transposon insertion sites.
Project description:The yeast Saccharomyces cerevisiae is a highly powerful model for systems genetics. While the advent of ordered deletion libraries has considerably facilitated yeast screening, it has also narrowed the diversity of screenable variants and precluded the exploration of the non-coding genome. Here, we present a versatile, time- and work-efficient method to functionally explore the yeast genome at an unprecedented throughput and resolution, using saturated transposon mutagenesis coupled to high-throughput sequencing. SAturated Transposon Analysis in Yeast (SATAY) allows the one-step mapping of all genetic loci in which a transposon can be inserted without disrupting a function necessary for cell growth. SATAY is especially suited to discover loci important for growth in various conditions. Here, we demonstrate that SATAY can be used to (2) reveal genetic interactions in single and multiple mutant strains, (1) reveal drug-resistant and -sensitive mutants, (3) detect not only essential genes, but also essential functional protein domains, and (4) generate not only null alleles, but also other informative genetic variants. Thus SATAY allows to easily explore the yeast genome at an unprecedented resolution and throughput.
Project description:Transcription factors direct gene expression, and so there is much interest in mapping their genome-wide binding locations. M-BM- Current methods do not allow for the multiplexed analysis of TF binding, and this limits their throughput. We describe a novel method for determining the genomic target genes of multiple transcription factors simultaneously. DNA-binding proteins are endowed with the ability to direct transposon insertions into the genome near to where they bind. The transposon becomes a M-bM-^@M-^\Calling CardM-bM-^@M-^] marking the visit of the DNA-binding protein to that location. A unique sequence M-bM-^@M-^\barcodeM-bM-^@M-^] in the transposon matches it to the DNA-binding protein that directed its insertion. The sequences of the DNA flanking the transposon (which reveal where in the genome the transposon landed) and the barcode within the transposon (which identifies the TF that put it there) are determined by massively-parallel DNA sequencing. To demonstrate the methodM-bM-^@M-^Ys feasibility, we determined the genomic targets of eight transcription factors in a single experiment. The Calling Card method promises to significantly reduce the cost and labor needed to determine the genomic targets of many transcription factors in different environmental conditions and genetic backgrounds. These data contain Ty5 insertion sites mapped by an Illumina GAII analyzer in the S. cerevisiae genome for the background strain without any Sir4 present (1 run), in strains expressing Sir4-tagged copies of three well-characterized TFs: Gal4, Leu3, and Gcn4 (1 run each), and a multiplex of eight Sir4-tagged TFs pooled in a single experiment (2 biological replicates), and insertions from the Thi2-Sir4 fusion expressed from its native locus in two conditions (1 run each). The format of each insertions file is [chromosome number] [position of genomic base] [direction of insertion] [number of reads at that position]. Raw sequencing data comes in two varieties. Paired-end data contains a 5 bp barcode at the beginning of read #2. Single-end data contains a 2 bp barcode on the beggining of read #1.
Project description:Transposable elements (TEs) occupy large fraction of metazoan genomes and pose constant threat to genomic integrity. This threat is particularly critical in germ cells, as changes in the genome that are induced by TEs will be transmitted to the next generation. Small non-coding piwi interacting (pi)RNAs recognize and silence a diverse set of TEs in germ cells. In mice, piRNA-guided transposon repression correlates with establishment of CpG DNA methylation on their sequences, yet the mechanism and the spectrum of genomic targets of piRNA silencing are unknown. Here we show that in addition to DNA methylation, the piRNA pathway is required to maintain a high level of the repressive H3K9me3 histone modification on long interspersed nuclear elements (LINEs) in germ cells. piRNA-dependent chromatin repression targets exclusively full-length elements of actively transposing LINE families, demonstrating the remarkable ability of the piRNA pathway to recognize active elements among the large number of genomic transposon fragments. Total of 34 libraries were analyzed. In case of ChIP libraries, every 'input' sample was used for normalization of the respective ChIP (H3K9me3 or H3) sample; the input libraries preceed ChIP libraries in the list below. There are four replicates (input-ChIP pairs) for H3K9me3 ChIP on liver cells, two for ChIP on testicular somatic cells for each genotype (Miwi2 Het and KO); and four for ChIP on male germ cells for each genotype (one from FACS-sorted germ cells, and three from MACS-sorted germ cells). Each replicate has a Het/KO pair, and the respective libraries were cloned from the material isolated from littermates. One H3 ChIP for each genotype is included, with respective inputs.
Project description:To understand how SF3B1K700E impacts co-transcriptional splicing, we peformed long read sequencing of RNA (nSRS-seq) in K562 cells (wild type and SF3B1K700E mutant) at 4 days after mutant allele expression We then performed co-transcriptional splicing efficiency analysis using data obtained from Long read sequencing
Project description:In order to study essential genomic elements in bacteria we prepare pMT85 and pMTnTetM438 mini‐transposon mutant libraries of M. pneumoniae. The dataset contains controls and the minitransposon libraries, after DNAs isolation the libraries and the controls were prepared for sequencing by HITS using standard Illumina paired‐end protocol.
Project description:Long-read RNA sequencing (RNA-seq) is a powerful technology for transcriptome analysis, but the relatively low throughput of current long-read sequencing platforms limits transcript coverage. We present TEQUILA-seq, a versatile, easy-to-implement, and low-cost method for targeted long-read RNA-seq. TEQUILA-seq can be broadly used for targeted sequencing of full-length transcripts in diverse biomedical research settings.