Project description:Genotyping of RpoD mutants via amplicon sequencing from the following manuscript: \\"Systematic dissection of σ70 sequence diversity and function in bacteria\\" by Park and Wang (2020). Includes raw sequencing reads from samples from MAGE-seq single codon saturation mutagenesis and high-throughput fitness competition experiment as well as the RpoD ortholog mutants generated through recombineering and CRISPR selection.
Project description:We use nucleosome maps obtained by high-throughput sequencing to study sequence specificity of intrinsic histone-DNA interactions. In contrast with previous approaches, we employ an analogy between a classical one-dimensional fluid of finite-size particles in an arbitrary external potential and arrays of DNA-bound histone octamers. We derive an analytical solution to infer free energies of nucleosome formation directly from nucleosome occupancies measured in high-throughput experiments. The sequence-specific part of free energies is then captured by fitting them to a sum of energies assigned to individual nucleotide motifs. We have developed hierarchical models of increasing complexity and spatial resolution, establishing that nucleosome occupancies can be explained by systematic differences in mono- and dinucleotide content between nucleosomal and linker DNA sequences, with periodic dinucleotide distributions and longer sequence motifs playing a secondary role. Furthermore, similar sequence signatures are exhibited by control experiments in which genomic DNA is either sonicated or digested with micrococcal nuclease in the absence of nucleosomes, making it possible that current predictions based on highthroughput nucleosome positioning maps are biased by experimental artifacts. Included are raw (eland) and mapped (wig) reads. The mapped reads are provided in eland and wiggle formats, and the raw reads are included in the eland file. This series includes only Mnase control data. The sonicated control is part of this already published accession, as is a in vitro nucleosome map: http://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE15188 We also studied data (in vitro and in vivo maps as well as a model) from http://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE13622 and from: http://www.ncbi.nlm.nih.gov/sra/?term=SRA001023
Project description:To identify more targets in soybean, particularly specific targets of Cd-stress-responsive miRNAs, high-throughput degradome sequencing was used. In total, we obtained 8913111 raw reads from the library which was constructed from a mixture of four samples (HX3-CK, HX3-Cd-treatment, ZH24-CK and ZH24-Cd-treatment). After removing the reads without the CAGAG adaptor, 5430126 unique raw-reads were obtained. The unique sequences were aligned to the G. max genome database, and 6516276 reads were mapped to the genome. The mapped reads from the libraries represented 51481 annotated G. max genes.
Project description:To define the sequence preference of SALL4 C2H2 zinc finger domains, we performed SELEX coupled with high-throughput sequencing (HT-SELEX) using the purified SALL4 ZFC1, ZFC2 and ZFC4 domains combined with no protein control experiment. We re-sequenced the libraries from E-MTAB-9236 with very high coverage to estimate the minimum number of reads required per sample for accurate results.
Project description:Purpose:Our data significantly advance understanding of heat stress regulatory mechanism of miRNA in the head kidney of rainbow trout Methods:miRNAs of rainbow trout were involved in heat stress were identified by high-throughput sequencing of six small RNA libraries of the kidney tissues under control (18℃) and heat-treated (24℃) conditions Results:high-throughput sequencing was performed to identify miRNAs responsive to heat stress. We obtained 41,991,119 and 43,882,123 raw reads and 39,756,736 and 42,538,331 clean reads from under control (18℃) and heat-treated (24℃) .A total of 392 conserved miRNAs and 989 novel miRNAs were identified, of which 78 miRNAs were expressed in different response to heat stress. In addition to, including 393 negative correlation miRNA-target gene pairs Conclusions:through high-throughput sequencing of the six libraries from head kidney tissue of rainbow trout, the expression level of miRNA has significant changes after heat stress.
Project description:To identify more targets in soybean, particularly specific targets of Cd-stress-responsive miRNAs, high-throughput degradome sequencing was used. In total, we obtained 8913111 raw reads from the library which was constructed from a mixture of four samples (HX3-CK, HX3-Cd-treatment, ZH24-CK and ZH24-Cd-treatment). After removing the reads without the CAGAG adaptor, 5430126 unique raw-reads were obtained. The unique sequences were aligned to the G. max genome database, and 6516276 reads were mapped to the genome. The mapped reads from the libraries represented 51481 annotated G. max genes. Identification of miRNA targerts in soybean roots
Project description:Purpose: To ensure that ABX464 acted specifically on HIV splicing and did not significantly or globally affect the splicing events of human genes, we used an assembly approach of HIV (YU2 strain) putative transcripts and human long non-coding sequences from paired-reads (2x75bp) captured on a NimbleGen SeqCap® EZ Developer Library (Roche/NimbleGen). Methods: Cells were infected with 80 ng of p24/106 cells of the YU-2 strain for 4 to 6 hours and then rinsed with PBS before medium renewal, followed by high-throughput RNAseq from custom SeqCap EZ capture libraries. Each raw dataset of the samples contained between 5 and 30 million paired-end reads (75 bp), with an average of approximately 12 million raw reads per sample. Results: The raw reads were then cleaned and assembled per library to generate contigs, giving an average of 930 contigs per sample for further analyses. Conclusions: Our results show that high-throughput analyses coupled with bioinformatics-specific tools offers a comprehensive and more accurate view of mRNA splicing within a cell.
Project description:We report the application of single-molecule-based sequencing technology for high-throughput profiling of histone modifications in endpoint tumours from Ezh2+/+ or Ezh2-/- Tet ON PyVmT mammary gland tumours. In order to understand the H3K27me3 targets in the context of mouse mamamry gland tumorigenesis, data were generated by deep sequencing a pool of 5 tumours per genoype, in duplicate using Illumina HiSeq2500. Using an optimized data analysis workflow, we mapped about 30 million sequence reads per sample to the mouse genome (build mm10). Raw reads were trimmed for length (n>=32), quality (phred score >= 30) and adaptor sequence using fastx v0.0.13.1.Trimmed reads were (pools of 5 different tumors per genotype) then aligned to the mouse reference genome mm10 using BWA v0.5.9 Broad peaks were called using MACS v1.4.1 software (mfold=10,30; bandwith=300; pvalue cutoff=1E-5) using sequenced libraries of input DNA as control. Peak list intersections were done using BEDTools v2.12.0. Binding peaks were considered overlapping if at least 1 base of the peaks overlapped. Our study demonstrated the first mapping of H3K27me3 in the endpint tumours of tetracycline ducible PyVmT mouse mamary tumours.
Project description:The rapid development of high-throughput sequencing is conducive to the discovery of many new theories. The purpose of this study is to explore the differentially expressed of tRF & tiRNA in cholangiocarcinoma by high-throughput sequencing technology. We collected cholangiocarcinoma and adjacent normal tissues from three patients. After RNA extraction and RNA library preparation, we determined the raw data of tRF & tiRNA in cholangiocarcinoma and adjacent normal tissues by high-throughput RNA sequencing. Raw data were generated after sequencing,image analysis,basecalling and quality filtering on lllumina sequencer.Firstly,Q30 was used to perform quality control.The adaptor sequences were trimmed and the adaptor-trimmed-reads(>=16nt) were left by cut adapt software(v1.9.3).Then,the raw counts of each tRF&tiRNA(MINTbasev2.0) was calculated for all samples,defined as the raw expression level soft.The results showed that a total of 20102 tsRNA were detected in the two groups, 9616tsRNA were upregulated and 10486 were downregulated. There were 535 differentially expressed tsRNA in cholangiocarcinoma after edger standardization, of which 241 were upregulated and 294 were downregulated (| log2 (foldchange) | = 1andpvalue < 0.05). This study shows that high-throughput sequencing technology is helpful for us to determine the expression of tRF & tiRNA in cholangiocarcinoma, and to screen out differentially expressed tRF & tiRNA, and further to explore the factors that affect the progress of cholangiocarcinoma.