Project description:Microarray-based enrichment of selected genomic loci is a powerful method for genome complexity reduction. Since the vast majority of exons in vertebrate genomes are smaller than 150 nt, we have explored the use of short fragment libraries (85-110bp) to achieve higher enrichment specificity by reducing carryover and adverse effects of flanking intronic sequences. These short fragment libraries were enriched for 1.69 Mb of exonic sequences, using custom 244K microarrays, and sequenced using AB/SOLiD. High enrichment specificity (60 M-bM-^@M-^S 75%) was obtained at 67-213x average coverage, with 77-92% and 90-98% of targeted regions covered with more than 25% and 10% of the average coverage, respectively. As a more appropriate measure of the evenness of coverage, which is relatively independent of sequencing depth, we introduce the evenness of coverage parameter E. E values up to 75% were achieved. To verify the accuracy of SNP/mutation detection we evaluated 384 known non-reference SNPs in the targeted regions. At ~ 200x average sequence coverage, we were able to survey 96.4% of 1.69 Mb of genomic sequence with only 4.2% false negative calls while 3.6% of targeted regions were marked as unsurveyed. A total of 1197 new variants were detected. Verification revealed only 8 false positive calls, resulting in an overall false positive rate of less than 1 per ~200,000 bp (0.0005%, equivalent to an overall phred score of 55). 4 samples + capture design file
Project description:Microarray-based enrichment of selected genomic loci is a powerful method for genome complexity reduction. Since the vast majority of exons in vertebrate genomes are smaller than 150 nt, we have explored the use of short fragment libraries (85-110bp) to achieve higher enrichment specificity by reducing carryover and adverse effects of flanking intronic sequences. These short fragment libraries were enriched for 1.69 Mb of exonic sequences, using custom 244K microarrays, and sequenced using AB/SOLiD. High enrichment specificity (60 – 75%) was obtained at 67-213x average coverage, with 77-92% and 90-98% of targeted regions covered with more than 25% and 10% of the average coverage, respectively. As a more appropriate measure of the evenness of coverage, which is relatively independent of sequencing depth, we introduce the evenness of coverage parameter E. E values up to 75% were achieved. To verify the accuracy of SNP/mutation detection we evaluated 384 known non-reference SNPs in the targeted regions. At ~ 200x average sequence coverage, we were able to survey 96.4% of 1.69 Mb of genomic sequence with only 4.2% false negative calls while 3.6% of targeted regions were marked as unsurveyed. A total of 1197 new variants were detected. Verification revealed only 8 false positive calls, resulting in an overall false positive rate of less than 1 per ~200,000 bp (0.0005%, equivalent to an overall phred score of 55).
Project description:High-grade serous ovarian cancer (HGSC), the principal cause of death from gynecological malignancies in the world, has not significantly benefited from recent progress in cancer immunotherapy. While HGSC infiltration by lymphocytes correlates with superior survival, the nature of antigens that can elicit anti-HGSC immune responses is unknown. The goal of this study was to establish the global landscape of HGSC tumor-specific antigens (TSAs) using a mass spectrometry pipeline that interrogates all reading frames of all genomic regions. In 23 HGSC tumors, we identified 113 TSAs. Classic TSA discovery approaches focusing only on mutated exonic sequences would have uncovered only seven of these TSAs. Other mutated TSAs resulted from out-of-frame exonic translation or from non-exonic sequences. The most interesting group of TSAs (n = 94) derived from aberrantly expressed unmutated genomic sequences which are not expressed in normal tissues. These aberrantly expressed TSAs (aeTSAs) derived primarily from non-exonic sequences, in particular intronic (31%) and intergenic (22%). Their expression was regulated at the transcriptional level by variations in gene copy number and DNA methylation. While mutated TSAs were unique to individual tumors, aeTSAs were shared by a large proportion of HGSCs. We conclude that, in view of their number and the fact that they are shared by many tumors, aeTSAs may be the most attractive targets for HGSC immunotherapy.
2020-02-18 | PXD014062 | Pride
Project description:Highly efficient degradation of phenanthrene strain sequences
Project description:Structural variations (SVs) contribute significantly to the variability of the human genome and extensive genomic rearrangements are a hallmark of cancer. Genomic DNA paired-end-tag (DNA-PET) sequencing is an attractive approach to identify genomic SVs. The current application of PET sequencing with short insert size DNA is insufficient for the comprehensive mapping of SVs in low complexity and repeat-rich genomic regions. We have developed a robust procedure to generate PET sequencing data using large DNA inserts of 10 - 20 kb for the identification of SVs. We compared the characteristics of the large insert libraries with short insert (1 kb) libraries with the same sequencing depths and costs. Although short insert libraries bear an advantage in identifying small deletions, they do not provide a significantly better breakpoint resolution. Large inserts are superior to short inserts in providing higher physical genome coverage and therefore achieve greater sensitivity for the identification of the different types of SVs, including copy number neutral and complex events. Further, large inserts allow the identification of SVs within repetitive sequences which cannot be spanned by short inserts.
Project description:Whole exome sequencing (WES) was performed on genomic DNA derived from two patients with Sotos Syndrome Features. Sequencing (100 base pair paired-end) was performed on an Illumina Hiseq 2000 sequencer after enrichment of 62Mb of exonic and adjacent intronic sequences with TruSeq Exome Enrichment Kit (Illumina, San Diego, CA, USA).
Project description:We performed whole exome sequencing (WES) on genomic DNA derived from two patients with Sotos Syndrome Features. Sequencing (100 base pair paired-end) was performed on an Illumina Hiseq 2000 sequencer after enrichment of 62Mb of exonic and adjacent intronic sequences with TruSeq Exome Enrichment Kit (Illumina, San Diego, CA, USA).
Project description:Structural variations (SVs) contribute significantly to the variability of the human genome and extensive genomic rearrangements are a hallmark of cancer. Genomic DNA paired-end-tag (DNA-PET) sequencing is an attractive approach to identify genomic SVs. The current application of PET sequencing with short insert size DNA is insufficient for the comprehensive mapping of SVs in low complexity and repeat-rich genomic regions. We have developed a robust procedure to generate PET sequencing data using large DNA inserts of 10 - 20 kb for the identification of SVs. We compared the characteristics of the large insert libraries with short insert (1 kb) libraries with the same sequencing depths and costs. Although short insert libraries bear an advantage in identifying small deletions, they do not provide a significantly better breakpoint resolution. Large inserts are superior to short inserts in providing higher physical genome coverage and therefore achieve greater sensitivity for the identification of the different types of SVs, including copy number neutral and complex events. Further, large inserts allow the identification of SVs within repetitive sequences which cannot be spanned by short inserts. Structural variations of three cancer cell lines using short (1 kb) and long (10 kb and 20 kb) insert size DNA fragments