Project description:A cDNA library was constructed by Novogene (CA, USA) using a Small RNA Sample Pre Kit, and Illumina sequencing was conducted according to company workflow, using 20 million reads. Raw data were filtered for quality as determined by reads with a quality score > 5, reads containing N < 10%, no 5' primer contaminants, and reads with a 3' primer and insert tag. The 3' primer sequence was trimmed and reads with a poly A/T/G/C were removed
Project description:We present a draft genome assembly that includes 200 Gb of Illumina reads, 4 Gb of Moleculo synthetic long-reads and 108 Gb of Chicago libraries, with a final size matching the estimated genome size of 2.7 Gb, and a scaffold N50 of 4.8 Mb. We also present an alternative assembly including 27 Gb raw reads generated using the Pacific Biosciences platform. In addition, we sequenced the proteome of the same individual and RNA from three different tissue types from three other species of squid species (Onychoteuthis banksii, Dosidicus gigas, and Sthenoteuthis oualaniensis) to assist genome annotation. We annotated 33,406 protein coding genes supported by evidence and the genome completeness estimated by BUSCO reached 92%. Repetitive regions cover 49.17% of the genome.
Project description:Purpose: The goal of this study is to compare endothelial small RNA transcriptome to identify the target of OASL under basal or stimulated conditions by utilizing miRNA-seq. Methods: Endothelial miRNA profilies of siCTL or siOASL transfected HUVECs were generated by illumina sequencing method, in duplicate. After sequencing, the raw sequence reads are filtered based on quality. The adapter sequences are also trimmed off the raw sequence reads. rRNA removed reads are sequentially aligned to reference genome (GRCh38) and miRNA prediction is performed by miRDeep2. Results: We identified known miRNA in species (miRDeep2) in the HUVECs transfected with siCTL or siOASL. The expression profile of mature miRNA is used to analyze differentially expressed miRNA(DE miRNA). Conclusions: Our study represents the first analysis of endothelial miRNA profiles affected by OASL knockdown with biologic replicates.
Project description:Nitrate-reducing iron(II)-oxidizing bacteria are widespread in the environment contribute to nitrate removal and influence the fate of the greenhouse gases nitrous oxide and carbon dioxide. The autotrophic growth of nitrate-reducing iron(II)-oxidizing bacteria is rarely investigated and poorly understood. The most prominent model system for this type of studies is enrichment culture KS, which originates from a freshwater sediment in Bremen, Germany. To gain insights in the metabolism of nitrate reduction coupled to iron(II) oxidation under in the absence of organic carbon and oxygen limited conditions, we performed metagenomic, metatranscriptomic and metaproteomic analyses of culture KS. Raw sequencing data of 16S rRNA amplicon sequencing, shotgun metagenomics (short reads: Illumina; long reads: Oxford Nanopore Technologies), metagenome assembly, raw sequencing data of shotgun metatranscriptomes (2 conditions, triplicates) can be found at SRA in https://www.ncbi.nlm.nih.gov/bioproject/PRJNA682552. This dataset contains proteomics data for 2 conditions (heterotrophic and autotrophic growth conditions) in triplicates.
Project description:Whole exome sequencing of 5 HCLc tumor-germline pairs. Genomic DNA from HCLc tumor cells and T-cells for germline was used. Whole exome enrichment was performed with either Agilent SureSelect (50Mb, samples S3G/T, S5G/T, S9G/T) or Roche Nimblegen (44.1Mb, samples S4G/T and S6G/T). The resulting exome libraries were sequenced on the Illumina HiSeq platform with paired-end 100bp reads to an average depth of 120-134x. Bam files were generated using NovoalignMPI (v3.0) to align the raw fastq files to the reference genome sequence (hg19) and picard tools (v1.34) to flag duplicate reads (optical or pcr), unmapped reads, reads mapping to more than one location, and reads failing vendor QC.
Project description:HDMYZ cells were treated with 2ug/ml ActD for 0, 4 and 12 hours. Small RNAs of 15-40 bases were gel-purified from 10 ug total RNA, and subjected to multiplex Illumina small RNA library preparation. Small RNA libraries were sequenced on a HiSeq2000 (Illumina) with 3 samples per lane. To quantify miRNA and isoform abundance, sequence reads were processed by the miRDeep2 package, with the following modifications. First, to remove adaptor sequence, we removed both the main adaptor sequence present in the sequencing reads, as well as the second most abundant adaptor variant. In addition, we did not restrict the size of small RNAs during adaptor removal. Second, we used miRBase v18 for mapping the reads. Third, for quantifying miRNA and isoform frequency, we limited reads to more or equal to 15 bases in length with zero mis-match during mapping. The number of reads that were mapped to known miRNAs was used to normalize read frequencies for each miRNA or each miRNA isoform. For quantification purposes, we only considered miRNAs or isoforms that had frequency >= 1x10e-6 in samples without ActD treatment, which correspond to ~21-30 reads in raw count. These miRNAs or isoforms were referred to as reliably quantifiable.To analyze mapping to the genome, we removed reads that mapped to miRNA precursors. The rest of the reads were then mapped to the genome with Bowtie.
Project description:Skin mRNA profiles of wild-type (WT) and EGFR Inhibitors (Gefitinib and Afatinib) induced rash rats were generated by deep sequencing, in triplicate, using Illumina GAIIx. Raw sequences were mapped to the rat reference sequence by Hisat2 v2.0.5. FeatureCounts v1.5.0-p3 was used to count the reads numbers mapped to each gene. FPKM of each gene was calculated based on the length of the gene and reads count mapped to this gene. Differential expression analysis was performed using the DESeq2 R package (1.16.1).
Project description:Total RNAs were isolated from WT splenic NK cells, and subjected to standard m6A MeRIP, in two replicates, using Illumina Novaseq 6000 platform. The raw sequencing reads were mapped to the genome of Mus musculus (mm10) with default parameters. ExomePeak was used to identify m6A peaks, which were annotated by intersection with gene architecture using ChIPseeker. Sequence motifs enriched in peak regions were identified using Homer.