Project description:We present a draft genome assembly that includes 200 Gb of Illumina reads, 4 Gb of Moleculo synthetic long-reads and 108 Gb of Chicago libraries, with a final size matching the estimated genome size of 2.7 Gb, and a scaffold N50 of 4.8 Mb. We also present an alternative assembly including 27 Gb raw reads generated using the Pacific Biosciences platform. In addition, we sequenced the proteome of the same individual and RNA from three different tissue types from three other species of squid species (Onychoteuthis banksii, Dosidicus gigas, and Sthenoteuthis oualaniensis) to assist genome annotation. We annotated 33,406 protein coding genes supported by evidence and the genome completeness estimated by BUSCO reached 92%. Repetitive regions cover 49.17% of the genome.
Project description:Characterization of gene expression levels with RNA-seq was performed on self-renewing (SR) and senescent (SEN) human adult adipose derived mesenchymal stem cells (hADSCs) using the Roche 454 pyrosequencing platform.
Project description:Next-generation sequencing (NGS) technology applications like RNA-sequencing (RNA-seq) have dramatically expanded the potential for novel genomics discoveries, but the proliferation of various platforms and protocols for RNA-seq has created a need for reference data sets to help gauge the performance characteristics of these disparate methods. Here we describe the results of the ABRF-NGS Study on RNA-seq, which leverages replicate experiments across multiple sites using two reference RNA standards tested with four protocols (polyA selected, ribo-depleted, size selected, and degraded RNA), and examined across five NGS platforms (IlluminaM-bM-^@M-^Ys HiSeqs, Life TechnologiesM-bM-^@M-^Y Personal Genome Machine and Proton, Roche 454 GS FLX, and Pacific Biosciences RS). These results show high (R2 >0.9) intra-platform consistency across test sites, high inter-platform concordance (R2 >0.8) for transcriptome profiling, and a large set of novel splice junctions observed across all platforms. Also, we observe that protocols using ribosomal RNA depletion can both salvage degraded RNA samples and also be readily compared to polyA-enriched fractions. These data provide a broad foundation for standardization, evaluation and improvement of RNA-seq methods. Two reference RNA standards tested with four protocols (polyA selected, ribo-depleted, size selected, and degraded RNA), and examined across five NGS platforms (IlluminaM-bM-^@M-^Ys HiSeqs, Life TechnologiesM-bM-^@M-^Y Personal Genome Machine and Proton, Roche 454 GS FLX, and Pacific Biosciences RS). Please note that the samples were named following the ABRF-Platform-Site-Sample-Replicate# format. For example, ABRF-454-CNL-A-1 means Sample A was run on 454 platform at Cornell and this is the first replicate, and ABRF-454-CNL-A-2 means the same exact sample was ran with same machine at same location and is 2nd replicate.
Project description:Next-generation sequencing (NGS) technology applications like RNA-sequencing (RNA-seq) have dramatically expanded the potential for novel genomics discoveries, but the proliferation of various platforms and protocols for RNA-seq has created a need for reference data sets to help gauge the performance characteristics of these disparate methods. Here we describe the results of the ABRF-NGS Study on RNA-seq, which leverages replicate experiments across multiple sites using two reference RNA standards tested with four protocols (polyA selected, ribo-depleted, size selected, and degraded RNA), and examined across five NGS platforms (Illumina’s HiSeqs, Life Technologies’ Personal Genome Machine and Proton, Roche 454 GS FLX, and Pacific Biosciences RS). These results show high (R2 >0.9) intra-platform consistency across test sites, high inter-platform concordance (R2 >0.8) for transcriptome profiling, and a large set of novel splice junctions observed across all platforms. Also, we observe that protocols using ribosomal RNA depletion can both salvage degraded RNA samples and also be readily compared to polyA-enriched fractions. These data provide a broad foundation for standardization, evaluation and improvement of RNA-seq methods. Two reference RNA standards tested with four protocols (polyA selected, ribo-depleted, size selected, and degraded RNA), and examined across five NGS platforms (Illumina’s HiSeqs, Life Technologies’ Personal Genome Machine and Proton, Roche 454 GS FLX, and Pacific Biosciences RS). Please note that the samples were named following the ABRF-Platform-Site-Sample-Replicate# format. For example, ABRF-454-CNL-A-1 means Sample A was run on 454 platform at Cornell and this is the first replicate, and ABRF-454-CNL-A-2 means the same exact sample was ran with same machine at same location and is 2nd replicate.
Project description:Next-generation sequencing (NGS) technology applications like RNA-sequencing (RNA-seq) have dramatically expanded the potential for novel genomics discoveries, but the proliferation of various platforms and protocols for RNA-seq has created a need for reference data sets to help gauge the performance characteristics of these disparate methods. Here we describe the results of the ABRF-NGS Study on RNA-seq, which leverages replicate experiments across multiple sites using two reference RNA standards tested with four protocols (polyA selected, ribo-depleted, size selected, and degraded RNA), and examined across five NGS platforms (Illumina’s HiSeqs, Life Technologies’ Personal Genome Machine and Proton, Roche 454 GS FLX, and Pacific Biosciences RS). These results show high (R2 >0.9) intra-platform consistency across test sites, high inter-platform concordance (R2 >0.8) for transcriptome profiling, and a large set of novel splice junctions observed across all platforms. Also, we observe that protocols using ribosomal RNA depletion can both salvage degraded RNA samples and also be readily compared to polyA-enriched fractions. These data provide a broad foundation for standardization, evaluation and improvement of RNA-seq methods. Two reference RNA standards tested with four protocols (polyA selected, ribo-depleted, size selected, and degraded RNA), and examined across five NGS platforms (Illumina’s HiSeqs, Life Technologies’ Personal Genome Machine and Proton, Roche 454 GS FLX, and Pacific Biosciences RS). Please note that the samples were named following the ABRF-Platform-Site-Sample-Replicate# format. For example, ABRF-454-CNL-A-1 means Sample A was run on 454 platform at Cornell and this is the first replicate, and ABRF-454-CNL-A-2 means the same exact sample was ran with same machine at same location and is 2nd replicate.
Project description:Next-generation sequencing (NGS) technology applications like RNA-sequencing (RNA-seq) have dramatically expanded the potential for novel genomics discoveries, but the proliferation of various platforms and protocols for RNA-seq has created a need for reference data sets to help gauge the performance characteristics of these disparate methods. Here we describe the results of the ABRF-NGS Study on RNA-seq, which leverages replicate experiments across multiple sites using two reference RNA standards tested with four protocols (polyA selected, ribo-depleted, size selected, and degraded RNA), and examined across five NGS platforms (Illumina’s HiSeqs, Life Technologies’ Personal Genome Machine and Proton, Roche 454 GS FLX, and Pacific Biosciences RS). These results show high (R2 >0.9) intra-platform consistency across test sites, high inter-platform concordance (R2 >0.8) for transcriptome profiling, and a large set of novel splice junctions observed across all platforms. Also, we observe that protocols using ribosomal RNA depletion can both salvage degraded RNA samples and also be readily compared to polyA-enriched fractions. These data provide a broad foundation for standardization, evaluation and improvement of RNA-seq methods. Two reference RNA standards tested with four protocols (polyA selected, ribo-depleted, size selected, and degraded RNA), and examined across five NGS platforms (Illumina’s HiSeqs, Life Technologies’ Personal Genome Machine and Proton, Roche 454 GS FLX, and Pacific Biosciences RS). Please note that the samples were named following the ABRF-Platform-Site-Sample-Replicate# format. For example, ABRF-454-CNL-A-1 means Sample A was run on 454 platform at Cornell and this is the first replicate, and ABRF-454-CNL-A-2 means the same exact sample was ran with same machine at same location and is 2nd replicate.