Project description:We present a draft genome assembly that includes 200 Gb of Illumina reads, 4 Gb of Moleculo synthetic long-reads and 108 Gb of Chicago libraries, with a final size matching the estimated genome size of 2.7 Gb, and a scaffold N50 of 4.8 Mb. We also present an alternative assembly including 27 Gb raw reads generated using the Pacific Biosciences platform. In addition, we sequenced the proteome of the same individual and RNA from three different tissue types from three other species of squid species (Onychoteuthis banksii, Dosidicus gigas, and Sthenoteuthis oualaniensis) to assist genome annotation. We annotated 33,406 protein coding genes supported by evidence and the genome completeness estimated by BUSCO reached 92%. Repetitive regions cover 49.17% of the genome.
Project description:This dataset includes whole transcriptome sequencing reads from 8 cell lines based on TruSeq stranded mRNA kit (Illumina). They are all 2*75bp reads sequenced using Illumina NextSeq500.
Project description:<div>Olive (<i>Olea europaea</i>) has a long history of medicinal and nutritional values own to it rich in polyphenol and fatty acids (FAs) in fruits. In order to better understand the biosynthesis important of these metabolites, we generated comprehensive Iso-Seq full-length and illumina RNA-seq transcriptome, and targeted metabolomics dataset of different olive fruits maturity. The targeted metabolomics by using both GC/MS and LC/MS were totally quantified 35 FAs and 13 polyphenols. Iso-Seq library was constructed and sequenced by PacBio Sequel System, and a total of 5,891,652 (10.55 G) with an average length of 1,791 subreads were obtained. 492,350 circular consensus sequences (CCSs) were formed after merging and error correction through subread comparison. Of the 492,350 CCSs, 399,263 were found to be full-length non chimera (FLNC) reads, and 187,517 consensus reads were finally obtained by using clustering algorithm of Iterative clustering for error (IEC). These multiomics data provide a foundation to elucidate the mechanisms regulating biosynthesis of polyphenol and FAs during the maturation of olive fruits.</div><div><b><br></b></div><div><b>Polyphenols UPLC-MS</b> protocols and data are reported in the current study <b>MTBLS814</b>.</div><div><br></div><div><b>GC-MS</b> protocols and data associated to this study are reported in <b><a href="https://www.ebi.ac.uk/metabolights/MTBLS855">MTBLS855</a></b>.</div><div><br></div><div><span _ngcontent-iov-c3="" class="ng-star-inserted"><b>Tyrosol only UPLC-MS</b> <span _ngcontent-iov-c3="" class="ng-star-inserted">protocols and data associated to this study are reported in <b><a href="https://www.ebi.ac.uk/metabolights/MTBLS1127">MTBLS1127</a>.</b></span></span></div><div><br></div><div><br></div>
Project description:<div>Olive (Olea europaea) has a long history of medicinal and nutritional values own to it rich in polyphenol and fatty acids (FAs) in fruits. In order to better understand the biosynthesis important of these metabolites, we generated comprehensive Iso-Seq full-length and illumina RNA-seq transcriptome, and targeted metabolomics dataset of different olive fruits maturity. The targeted metabolomics by using both GC/MS and LC/MS were totally quantified 35 FAs and 13 polyphenols. Iso-Seq library was constructed and sequenced by PacBio Sequel System, and a total of 5,891,652 (10.55 G) with an average length of 1,791 subreads were obtained. 492,350 circular consensus sequences (CCSs) were formed after merging and error correction through subread comparison. Of the 492,350 CCSs, 399,263 were found to be full-length non chimera (FLNC) reads, and 187,517 consensus reads were finally obtained by using clustering algorithm of Iterative clustering for error (IEC). These multiomics data provide a foundation to elucidate the mechanisms regulating biosynthesis of polyphenol and FAs during the maturation of olive fruits.</div><div><br></div><div><div><b>GC-MS</b> protocols and data are reported in the current study <b>MTBLS855</b>.</div><div><br></div><div><span _ngcontent-jcp-c3="" class="ng-star-inserted"><b>Polyphenols UPLC-MS</b></span> protocols and data associated to this study are reported in <b><a href="http://www.ebi.ac.uk/metabolights/editor/study/MTBLS814">MTBLS814</a></b>.</div><div><br></div><div><b>Tyrosol only UPLC-MS</b> <span _ngcontent-iov-c3="" class="ng-star-inserted">protocols and data associated to this study are reported in <b><a href="http://www.ebi.ac.uk/metabolights/editor/study/MTBLS814"><a href="https://www.ebi.ac.uk/metabolights/MTBLS1127">MTBLS1127</a>.</a></b></span></div></div>
Project description:For both PBMC and cells from the in vitro cultures, RNA purification and library generation was performed using the Chromium Single Cell Controller apparatus and associated protocols (10X Genomics). Libraries were sequenced by 75-bp single-end reading on a NextSeq500 sequencer (Illumina). Reads were aligned on the GRCh38 human genome assembly. Data analysis was performed using the R software package Seurat (https://github.com/satijalab/seurat)
Project description:Parental or NLUCAT1 CRISPR/Cas9-deleted A549 clones (WT A549 Hx, n = 1 clone; DEL A549 Hx, n = 1 clone) were exposed to hypoxia (Hx) at 1% O2 for 24 hours. Libraries were generated from 500 ng of total RNAs using TruSeq Stranded Total RNA kit with Ribo-Zero (Illumina) and sequenced on a NextSeq500 sequencer (Illumina) with 2x75bp paired-end chemistry. Reads were aligned to the human genome release hg19 using STAR v2.4.0a with default parameters.
Project description:Five kidney sections of 10 μm of thickness were sliced from each FFPE block of three amyloidosis affected and three healthy cats. The total miRNAome was isolated using the miRNeasy FFPE kit by Qiagen. A total amount of 200ng of extracted miRNA/sample was sequenced using the smallRNA-seq kit by Illumina and the Illumina NextSeq500 platform. The read length obtained was 1x75bp for a total of 30 million reads. The quality control of the reads was assessed with FastQC and the adapter sequences were removed using Cutadapt.
Project description:Since short reads from Illumina RNA-seq data are challenging to map to repetitive elements , we wanted to confirm the bulk RNA-seq findings using an orthogonal method, namely, using the long read technology of Pacific Biosciences (PacBio) full-length transcriptome sequencing. This dataset provided around 1.1 (WT) and 1.3 (RBM4 KO) million sequence reads of 2.6 kb average length mapping to the human genome.
Project description:Total RNA was isolated from mid-log phase Streptococcus agalactiae 874391 wild-type cells grown in Todd-Hewitt broth (THB) medium and sequenced using Illumina NextSeq500
Project description:This study is to decipher the effect of SUN overexpressed NIL on gene expression during fruit development in tomato. RNA libraries were sequenced through Illumina Nextseq500 at Georgia Genomics and Bioinformatics Core (GGBC) of University of Georgia for 75 bp single end sequencing. Clean reads were mapped to ITAG3.2. Processed data are normalized into TPM.