Project description:A technical comparison of Agilent SureSelect Focussed Exome and TruSight One Mendeliome sequencing kits, by triplicate sequencing of the cell line DNA of NA12878, NA12891, NA12892 and NIST RM8395
Project description:Targeted enrichment-based next-generation sequencing or whole exome sequencing were taken for patients with hypomyelinating leukodystrophies to reveal genetic aetiologies. All genomic DNA used in the experiments was extracted from the peripheral leukocytes. A complete kit was synthetized using the Agilent SureSelect Target Enrichment technique, capturing the coding regions from 104 candidate genes, including their exons and exon-intron boundaries (11,473 probes, 383.065 kbp in total). The following NGS which included equipment and reagents was performed on an Illumina NEXTSEQ500 platform manufactured by Illumina (San Diego, California, USA) using paired-end sequencing of 110 bp. The clean paired-end reads were aligned to the human reference genome build hg19, which was previously annotated using ANNOVAR, in addition to insertion-deletion (indel) and single-nucleotide polymorphism (SNP) calling.
Project description:Targeted enrichment-based next-generation sequencing or whole exome sequencing were taken for patients with hypomyelinating leukodystrophies to reveal genetic aetiologies. All genomic DNA used in the experiments was extracted from the peripheral leukocytes. A complete kit was synthetized using the Agilent SureSelect Target Enrichment technique, capturing the coding regions from 104 candidate genes, including their exons and exon-intron boundaries (11,473 probes, 383.065 kbp in total). The following NGS which included equipment and reagents was performed on an Illumina NEXTSEQ500 platform manufactured by Illumina (San Diego, California, USA) using paired-end sequencing of 110 bp. The clean paired-end reads were aligned to the human reference genome build hg19, which was previously annotated using ANNOVAR, in addition to insertion-deletion (indel) and single-nucleotide polymorphism (SNP) calling.
Project description:Accurate and comprehensive genomic annotation, including the full list of protein-coding genes, is vital for understanding the molecular mechanisms of human biology. We have previously shown that the genome contains a multitude of yet hidden functional exons and transcripts, some of which might represent novel mRNAs. These results resonate with those from other groups and strongly argue that two decades after the completion of the first draft of the human genome sequence, the current annotation of human genes and transcripts remains far from being complete. Using a targeted RNA enrichment technique, we showed that one of the novel functional exons previously discovered by us and currently annotated as part of a long non-coding RNA, is actually a part of a novel protein-coding gene, InSETG-4, which encodes a novel human protein with no known homologs or motifs. We found that InSETG-4 is induced by various DNA-damaging agents across multiple cell types and therefore might represent a novel component of DNA damage response. Despite its low abundance in bulk cell populations, InSETG-4 exhibited expression restricted to a small fraction of cells, as demonstrated by the amplification-based single-molecule fluorescence in situ hybridization (asmFISH) analysis. This study argues that yet undiscovered human protein-coding genes exist and provides an example of how targeted RNA enrichment techniques can help to fill this major gap in our knowledge of the information encoded in the human genome.
Project description:Norovirus full-genome sequencing is challenging due to sequence heterogeneity among genomes. Previous methods have relied on PCR amplification, which is problematic due to primer design, and transcriptome sequencing (RNA-Seq), which nonspecifically sequences all RNA, including host and bacterial RNA, in stool specimens. Target enrichment uses a panel of custom-designed 120-mer RNA baits that are complementary to all publicly available norovirus sequences, with multiple baits targeting each position of the genome, which overcomes the challenge of primer design. Norovirus genomes are enriched from stool RNA extracts to minimize the sequencing of nontarget RNA. SureSelect target enrichment and Illumina sequencing were used to sequence full genomes from 507 norovirus-positive stool samples with reverse transcription-real-time PCR cycle threshold (CT) values of 10 to 43. Sequencing on an Illumina MiSeq system in batches of 48 generated, on average, 81% on-target reads per sample and 100% genome coverage with >12,000-fold read depth. Samples included genotypes GI.1, GI.2, GI.3, GI.6, GI.7, GII.1, GII.2, GII.3, GII.4, GII.5, GII.6, GII.7, GII.13, GII.14, and GII.17. When outliers were accounted for, we generated >80% genome coverage for all positive samples, regardless of CT values. A total of 164 samples were tested in parallel with conventional PCR genotyping of the capsid shell domain; 164/164 samples were successfully sequenced, compared to 158/164 samples that were amplified by PCR. Four of the samples that failed capsid PCR analysis had low titers, which suggests that target enrichment is more sensitive than gel-based PCR. Two samples failed PCR due to primer mismatches; target enrichment uses multiple baits targeting each position, thus accommodating sequence heterogeneity among norovirus genomes.
Project description:Obtaining reliable and high fidelity next-generation sequencing (NGS) data requires to choose a suitable sequencing platform and a library preparation approach, which both have their inherent assay-specific limitations. Here, we present the results of successful adaptation of SureSelect hybridisation-based target enrichment protocol for the sequencing on the Ion Torrent S5 platform, which is designed to work preferably with amplicon-based panels. In our study, we applied a custom NGS panel to screen a cohort of 16 unrelated patients affected by premature fusion of the cranial sutures, i.e. craniosynostosis (CS). CS occurs either as an isolated malformation or in a syndromic form, representing a genetically heterogeneous and clinically variable group of disorders. The approach presented here allowed us to achieve high quality NGS data and confirmed molecular diagnosis in 19% of cases, reaching the diagnostic yield similar to some of the published research reports. In conclusion, we demonstrated that an alternative enrichment strategy for library preparations can be successfully applied prior to sequencing on the Ion Torrent S5 platform. Also, we proved that the custom NGS panel designed by us represents a useful and effective tool in the molecular diagnostics of patients with CS.
Project description:Microarray-based enrichment of selected genomic loci is a powerful method for genome complexity reduction. Since the vast majority of exons in vertebrate genomes are smaller than 150 nt, we have explored the use of short fragment libraries (85-110bp) to achieve higher enrichment specificity by reducing carryover and adverse effects of flanking intronic sequences. These short fragment libraries were enriched for 1.69 Mb of exonic sequences, using custom 244K microarrays, and sequenced using AB/SOLiD. High enrichment specificity (60 M-bM-^@M-^S 75%) was obtained at 67-213x average coverage, with 77-92% and 90-98% of targeted regions covered with more than 25% and 10% of the average coverage, respectively. As a more appropriate measure of the evenness of coverage, which is relatively independent of sequencing depth, we introduce the evenness of coverage parameter E. E values up to 75% were achieved. To verify the accuracy of SNP/mutation detection we evaluated 384 known non-reference SNPs in the targeted regions. At ~ 200x average sequence coverage, we were able to survey 96.4% of 1.69 Mb of genomic sequence with only 4.2% false negative calls while 3.6% of targeted regions were marked as unsurveyed. A total of 1197 new variants were detected. Verification revealed only 8 false positive calls, resulting in an overall false positive rate of less than 1 per ~200,000 bp (0.0005%, equivalent to an overall phred score of 55). 4 samples + capture design file