Project description:Genotyping of RpoD mutants via amplicon sequencing from the following manuscript: \\"Systematic dissection of σ70 sequence diversity and function in bacteria\\" by Park and Wang (2020). Includes raw sequencing reads from samples from MAGE-seq single codon saturation mutagenesis and high-throughput fitness competition experiment as well as the RpoD ortholog mutants generated through recombineering and CRISPR selection.
Project description:We use nucleosome maps obtained by high-throughput sequencing to study sequence specificity of intrinsic histone-DNA interactions. In contrast with previous approaches, we employ an analogy between a classical one-dimensional fluid of finite-size particles in an arbitrary external potential and arrays of DNA-bound histone octamers. We derive an analytical solution to infer free energies of nucleosome formation directly from nucleosome occupancies measured in high-throughput experiments. The sequence-specific part of free energies is then captured by fitting them to a sum of energies assigned to individual nucleotide motifs. We have developed hierarchical models of increasing complexity and spatial resolution, establishing that nucleosome occupancies can be explained by systematic differences in mono- and dinucleotide content between nucleosomal and linker DNA sequences, with periodic dinucleotide distributions and longer sequence motifs playing a secondary role. Furthermore, similar sequence signatures are exhibited by control experiments in which genomic DNA is either sonicated or digested with micrococcal nuclease in the absence of nucleosomes, making it possible that current predictions based on highthroughput nucleosome positioning maps are biased by experimental artifacts. Included are raw (eland) and mapped (wig) reads. The mapped reads are provided in eland and wiggle formats, and the raw reads are included in the eland file. This series includes only Mnase control data. The sonicated control is part of this already published accession, as is a in vitro nucleosome map: http://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE15188 We also studied data (in vitro and in vivo maps as well as a model) from http://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE13622 and from: http://www.ncbi.nlm.nih.gov/sra/?term=SRA001023
Project description:To identify more targets in soybean, particularly specific targets of Cd-stress-responsive miRNAs, high-throughput degradome sequencing was used. In total, we obtained 8913111 raw reads from the library which was constructed from a mixture of four samples (HX3-CK, HX3-Cd-treatment, ZH24-CK and ZH24-Cd-treatment). After removing the reads without the CAGAG adaptor, 5430126 unique raw-reads were obtained. The unique sequences were aligned to the G. max genome database, and 6516276 reads were mapped to the genome. The mapped reads from the libraries represented 51481 annotated G. max genes.
Project description:To define the sequence preference of SALL4 C2H2 zinc finger domains, we performed SELEX coupled with high-throughput sequencing (HT-SELEX) using the purified SALL4 ZFC1, ZFC2 and ZFC4 domains combined with no protein control experiment. We re-sequenced the libraries from E-MTAB-9236 with very high coverage to estimate the minimum number of reads required per sample for accurate results.
Project description:To identify more targets in soybean, particularly specific targets of Cd-stress-responsive miRNAs, high-throughput degradome sequencing was used. In total, we obtained 8913111 raw reads from the library which was constructed from a mixture of four samples (HX3-CK, HX3-Cd-treatment, ZH24-CK and ZH24-Cd-treatment). After removing the reads without the CAGAG adaptor, 5430126 unique raw-reads were obtained. The unique sequences were aligned to the G. max genome database, and 6516276 reads were mapped to the genome. The mapped reads from the libraries represented 51481 annotated G. max genes. Identification of miRNA targerts in soybean roots
Project description:Purpose:Our data significantly advance understanding of heat stress regulatory mechanism of miRNA in the head kidney of rainbow trout Methods:miRNAs of rainbow trout were involved in heat stress were identified by high-throughput sequencing of six small RNA libraries of the kidney tissues under control (18℃) and heat-treated (24℃) conditions Results:high-throughput sequencing was performed to identify miRNAs responsive to heat stress. We obtained 41,991,119 and 43,882,123 raw reads and 39,756,736 and 42,538,331 clean reads from under control (18℃) and heat-treated (24℃) .A total of 392 conserved miRNAs and 989 novel miRNAs were identified, of which 78 miRNAs were expressed in different response to heat stress. In addition to, including 393 negative correlation miRNA-target gene pairs Conclusions:through high-throughput sequencing of the six libraries from head kidney tissue of rainbow trout, the expression level of miRNA has significant changes after heat stress.
Project description:Purpose: To ensure that ABX464 acted specifically on HIV splicing and did not significantly or globally affect the splicing events of human genes, we used an assembly approach of HIV (YU2 strain) putative transcripts and human long non-coding sequences from paired-reads (2x75bp) captured on a NimbleGen SeqCap® EZ Developer Library (Roche/NimbleGen). Methods: Cells were infected with 80 ng of p24/106 cells of the YU-2 strain for 4 to 6 hours and then rinsed with PBS before medium renewal, followed by high-throughput RNAseq from custom SeqCap EZ capture libraries. Each raw dataset of the samples contained between 5 and 30 million paired-end reads (75 bp), with an average of approximately 12 million raw reads per sample. Results: The raw reads were then cleaned and assembled per library to generate contigs, giving an average of 930 contigs per sample for further analyses. Conclusions: Our results show that high-throughput analyses coupled with bioinformatics-specific tools offers a comprehensive and more accurate view of mRNA splicing within a cell.
Project description:We report the application of single-molecule-based sequencing technology for high-throughput profiling of histone modifications in endpoint tumours from Ezh2+/+ or Ezh2-/- Tet ON PyVmT mammary gland tumours. In order to understand the H3K27me3 targets in the context of mouse mamamry gland tumorigenesis, data were generated by deep sequencing a pool of 5 tumours per genoype, in duplicate using Illumina HiSeq2500. Using an optimized data analysis workflow, we mapped about 30 million sequence reads per sample to the mouse genome (build mm10). Raw reads were trimmed for length (n>=32), quality (phred score >= 30) and adaptor sequence using fastx v0.0.13.1.Trimmed reads were (pools of 5 different tumors per genotype) then aligned to the mouse reference genome mm10 using BWA v0.5.9 Broad peaks were called using MACS v1.4.1 software (mfold=10,30; bandwith=300; pvalue cutoff=1E-5) using sequenced libraries of input DNA as control. Peak list intersections were done using BEDTools v2.12.0. Binding peaks were considered overlapping if at least 1 base of the peaks overlapped. Our study demonstrated the first mapping of H3K27me3 in the endpint tumours of tetracycline ducible PyVmT mouse mamary tumours.