Systematic assessment of next-generation sequencing for quantitative small RNA profiling: human plasma pool
Ontology highlight
ABSTRACT: Small RNA-seq is increasingly being used for profiling of small RNAs. Quantitative characteristics of long RNA-seq have been extensively described, but small RNA-seq involves fundamentally different methods for library preparation, with distinct protocols and technical variations that have not been fully and systematically studied. Using common sets of reference samples, we evaluated the accuracy, reproducibility and bias of small RNA-seq library preparation for five distinct protocols and across nine different laboratories. As part of this larger study, we assessed reproducibility and diversity of sequencing of mature miRNAs, generating libraries from RNA extracted from human plasma that was pooled from 11 healthy individuals. We find that protocol-specific biases are largely recapitulated in the biological samples, and that inter-protocol bias correction factors estimated from the more comprehensive equimolar synthetic pools can be applied to the biological samples to get more comparable, and in some cases, more accurate estimates of miRNA abundance.
Project description:Small RNA-seq is increasingly being used for profiling of small RNAs. Quantitative characteristics of long RNA-seq have been extensively described, but small RNA-seq involves fundamentally different methods for library preparation, with distinct protocols and technical variations that have not been fully and systematically studied. Using common sets of reference samples, we evaluated the accuracy, reproducibility and bias of small RNA-seq library preparation for five distinct protocols and across nine different laboratories. As part of this larger study, we assessed sequencing bias and reproducibility using an equimolar pool of 1,152 small RNA sequences ranging from 15-90 nt, and primarily comprised of annotated human microRNAs. We observed extensive protocol-specific and sequence-specific bias that was largely mitigated in protocols employing sequencing adapters with randomized end-nucleotides. We find that sequencing bias is highly reproducible across labs using the same library preparation technologies, and use the data to calculate inter-protocol bias correction factors. These results provide strong evidence for the feasibility of reproducible cross-laboratory small RNA-seq studies, even those involving analysis of data generated using different protocols.
Project description:Small RNA-seq is increasingly being used for profiling of small RNAs. Quantitative characteristics of long RNA-seq have been extensively described, but small RNA-seq involves fundamentally different methods for library preparation, with distinct protocols and technical variations that have not been fully and systematically studied. Using common sets of reference samples, we evaluated the accuracy, reproducibility and bias of small RNA-seq library preparation for five distinct protocols and across nine different laboratories. As part of this larger study, we assessed reproducibility and accuracy of relative expression measurements using two pools of synthetic small RNA sequences, where subsets of the sRNAs vary in relative amount between pools A and B. The pools each contain 334 small RNAs, varying by 15 different ratios between pools A and B (10:1, 8:1, 5:1, 4:1, 3:1, 2:1, 1.5:1, 1:1, 1:1.5, 1:2, 1:3, 1:4 1:5, 1:8, 1:10). We find that although the sequencing bias varies extensively between protocols, the relative abundance measured between samples A and B is largely reproducible and accurate across labs and protocols. These results suggest that measurements of differential expression should be comparable across institutions and library preparation technologies.
Project description:Small RNA-seq is increasingly being used for profiling of small RNAs. Quantitative characteristics of long RNA-seq have been extensively described, but small RNA-seq involves fundamentally different methods for library preparation, with distinct protocols and technical variations that have not been fully and systematically studied. Using common sets of reference samples, we evaluated the accuracy, reproducibility and bias of small RNA-seq library preparation for five distinct protocols and across nine different laboratories. As part of this larger study, we assessed reproducibility and accuracy of relative expression measurements using two pools of synthetic small RNA sequences, where subsets of the sRNAs vary in relative amount between pools A and B. The pools each contain 334 small RNAs, varying by 15 different ratios between pools A and B (10:1, 8:1, 5:1, 4:1, 3:1, 2:1, 1.5:1, 1:1, 1:1.5, 1:2, 1:3, 1:4 1:5, 1:8, 1:10). We find that although the sequencing bias varies extensively between protocols, the relative abundance measured between samples A and B is largely reproducible and accurate across labs and protocols. These results suggest that measurements of differential expression should be comparable across institutions and library preparation technologies.
Project description:Small RNA-seq is increasingly being used for profiling of small RNAs. Quantitative characteristics of long RNA-seq have been extensively described, but small RNA-seq involves fundamentally different methods for library preparation, with distinct protocols and technical variations that have not been fully and systematically studied. We report here the results of a study using common references (synthetic RNA pools of defined composition, as well as plasma-derived RNA) to evaluate the accuracy, reproducibility and bias of small RNA-seq library preparation for five distinct protocols and across nine different laboratories. We observed protocol-specific and sequence-specific bias, which was ameliorated using adapters for ligation with randomized end-nucleotides, and computational correction factors. Despite this technical bias, relative quantification using small RNA-seq was remarkably accurate and reproducible, even across multiple laboratories using different methods. These results provide strong evidence for the feasibility of reproducible cross-laboratory small RNA-seq studies, even those involving analysis of data generated using different protocols. This SuperSeries is composed of the SubSeries listed below.
Project description:Objectives: The sequencing by the PolyA selection is the most common approach for library preparation. With limited amount or degraded RNA, alternative protocols such as the NuGEN have been developed. However, it is not yet clear how the different library preparations affect the downstream analyses of the broad applications of RNA sequencing. Methods and Materials: Eight human mammary epithelial cell (HMEC) lines with high quality RNA were sequenced by Illumina’s mRNA-Seq PolyA selection and NuGEN ENCORE library preparation. The following analyses and comparisons were conducted: 1) the numbers of genes captured by each protocol; 2) the impact of protocols on differentially expressed gene detection between biological replicates; 3) expressed single nucleotide variant (SNV) detection; 4) non-coding RNAs, particularly lincRNA detection; and 5) intragenic gene expression. Results: Sequences from the NuGEN protocol had lower (75%) alignment rate than the PolyA (over 90%). The NuGEN protocol detected fewer genes (12-20% less) with a significant portion of reads mapped to non-coding regions. A large number of genes were differentially detected between the two protocols. About 17-20% of the differentially expressed genes between biological replicates were commonly detected between the two protocols. Significantly higher numbers of SNVs (5-6 times) were detected in the NuGEN samples, which were largely from intragenic and intergenic regions. The NuGEN captured fewer exons (25% less) and had higher base level coverage variance. While 6.3% of reads were mapped to intragenic regions in the PolyA samples, the percentages were much higher (20-25%) for the NuGEN samples. The NuGEN protocol did not detect more known non-coding RNAs such as lincRNAs, but targeted small and “novel” lincRNAs. Conclusion: Different library preparations can have significant impacts on downstream analysis and interpretation of RNA-seq data. The NuGEN provides an alternative for limited or degraded RNA but it has limitations for some RNA-seq applications.
Project description:Here we present a performance evaluation of three different plate-based scRNA-seq protocols. Two commercially available kits; NEBNext Single Cell/ Low Input RNA Library Prep Kit (NEB), SMART-seq HT kit (Takara), and non-commercially available protocol Genome & transcriptome sequencing (G&T). We evaluated each kit based upon sensitivity, reproducibility, ease of use and price point per cell. This evaluation is specifically relevant for implementation of scRNA-seq in e.g. diagnostics, requiring high sensitivity in regards of gene detection, high reproducibility between samples, minimum hands on time and smooth sample preparation for technicians, as well as keeping a reasonable price point per patient. G&T protocol delivered the highest detection of genes per single cell, at the absolute lowest price point. Takara's kit delivered likewise high gene detection per single cell, and high reproducibility between sample, however at the absolute highest price points. Here we present pros and cons of each protocol, sequencing breast cancer cell line T47D.
Project description:MiRNAs are non-coding RNAs that regulate gene expression. MiRNAs mostly localise within the cytosol but are also found in the mitochondria where they can regulate the expression of mitochondrial-encoded transcripts. Small RNA library preparation protocols are well described when using total cellular RNA as the template, however, these methods are not directly applicable to total RNA extracted from fractionated cells, such as isolated mitochondria. The aim of this study was to optimise the small RNA library preparation protocol for use with small (<60ng) amounts of total mitochondrial RNA.
Project description:In this study, we present a comprehensive evaluation of four RNA-Seq library preparation methods. We used three standard input protocols, the Illumina TruSeq Stranded Total RNA and TruSeq Stranded mRNA kits, and a modified NuGEN Ovation v2 kit; and an ultra-low-input RNA protocol, the TaKaRa SMARTer Ultra Low RNA Kit v3. Our evaluation of these kits included quality control measures such as overall reproducibility, 5’ and 3’ end-bias, and the identification of DEGs, lncRNAs, and alternatively spliced transcripts. Overall, we found that the two Illumina kits were most similar in terms of recovering DEGs, and the Illumina, modified NuGEN, and TaKaRa kits allowed identification of a similar set of DEGs. However, we also discovered that the Illumina, NuGEN and TaKaRa kits each enriched for different sets of genes.
Project description:In this study, we present a comprehensive evaluation of four RNA-Seq library preparation methods. We used three standard input protocols, the Illumina TruSeq Stranded Total RNA and TruSeq Stranded mRNA kits, and a modified NuGEN Ovation v2 kit; and an ultra-low-input RNA protocol, the TaKaRa SMARTer Ultra Low RNA Kit v3. Our evaluation of these kits included quality control measures such as overall reproducibility, 5’ and 3’ end-bias, and the identification of DEGs, lncRNAs, and alternatively spliced transcripts. Overall, we found that the two Illumina kits were most similar in terms of recovering DEGs, and the Illumina, modified NuGEN, and TaKaRa kits allowed identification of a similar set of DEGs. However, we also discovered that the Illumina, NuGEN and TaKaRa kits each enriched for different sets of genes.