Project description:DNA WGS Long Read Sequence (PromethION) for manuscript titled: "Performance of Somatic Structural Variant Calling in Lung Cancer using Oxford Nanopore Sequencing Technology"
Project description:We describe a new generalizable library generation chemistry with increased efficiency that is amendable to tagmentation-based split-pool barcoding strategies, such as single-cell combinatorial indexing (sci). Symmetrical strand sci (‘s3’) uses a novel uracil-based adapter switching approach that provides an improved rate of conversion of source DNA into viable sequencing library fragments following tagmentation. We apply this chemistry to assay chromatin accessibility (s3-ATAC) to profile human cortical and mouse whole brain tissues, with mouse datasets demonstrating a 6-to-13-fold improvement in usable reads obtained per cell when compared to other available methods performed on the same sample type. We also demonstrate the generalizability of s3 by applying it to single-cell whole genome sequencing (s3-WGS), and whole genome plus chromatin conformation (s3-GCC), for structural variant calling in a patient-derived cancer cell line model. Using the high-coverage profiles produced by the s3 technologies we characterized preserved clonal structure and identified a putative subclone-specific translocation.
Project description:DNA WGS Short Read Sequence (Illumina NovaSeq) for manuscript titled: "Performance of Somatic Structural Variant Calling in Lung Cancer using Oxford Nanopore Sequencing Technology"
Project description:We evaluated linked-read whole genome sequencing (WGS) for detection of structural chromosomal rearrangements in primary samples of varying DNA quality from 12 patients diagnosed with ALL. Linked-read WGS enabled precise, allele-specific, digital karyotyping at a base-pair resolution for a wide range of structural variants including complex rearrangements, aneuploidy assessment and gene deletions. Additional RNA-sequencing and copy number aberrations (CNA) data from Illumina Infinium arrays were also generated and assessed against the linked-read WGS data. RNA-sequencing data was used to support structural chromosomal rearrangements detected in the linked-read WGS data by detecting expressed fusion genes as a consequence of the rearrangements. Illumina Infinium arrays (450k array and/or SNP array) were used to assess CNA status to further support the findings in the linked-read WGS data. The processed CNA data from the primary ALL patient samples has been deposited to GEO. RNA-sequencing, linked-read WGS data, and raw SNP array data from the primary ALL patient samples will not be deposited because the patient/parent consent does not cover depositing data that may be used for large-scale determination of germline variants in a repository. The ALL samples were collected 10-20 years ago from pediatric patients aged 2-15 years, some whom have deceased. The linked-read WGS data and the RNA-sequencing data sets generated in the study are available upon reasonable request from the corresponding author Jessica.Nordlund@medsci.uu.se.
Project description:We evaluated linked-read whole genome sequencing (WGS) for detection of structural chromosomal rearrangements in primary samples of varying DNA quality from 12 patients diagnosed with ALL. Linked-read WGS enabled precise, allele-specific, digital karyotyping at a base-pair resolution for a wide range of structural variants including complex rearrangements, aneuploidy assessment and gene deletions. Additional RNA-sequencing and copy number aberrations (CNA) data from Illumina Infinium arrays were also generated and assessed against the linked-read WGS data. RNA-sequencing data was used to support structural chromosomal rearrangements detected in the linked-read WGS data by detecting expressed fusion genes as a consequence of the rearrangements. Illumina Infinium arrays (450k array and/or SNP array) were used to assess CNA status to further support the findings in the linked-read WGS data. The processed CNA data from the primary ALL patient samples has been deposited to GEO. RNA-sequencing, linked-read WGS data, and raw SNP array data from the primary ALL patient samples will not be deposited because the patient/parent consent does not cover depositing data that may be used for large-scale determination of germline variants in a repository. The ALL samples were collected 10-20 years ago from pediatric patients aged 2-15 years, some whom have deceased. The linked-read WGS data and the RNA-sequencing data sets generated in the study are available upon reasonable request from the corresponding author Jessica.Nordlund@medsci.uu.se.
Project description:We evaluated linked-read whole genome sequencing (WGS) for detection of structural chromosomal rearrangements in primary samples of varying DNA quality from 12 patients diagnosed with ALL. Linked-read WGS enabled precise, allele-specific, digital karyotyping at a base-pair resolution for a wide range of structural variants including complex rearrangements, aneuploidy assessment and gene deletions. Additional RNA-sequencing and copy number aberrations (CNA) data from Illumina Infinium arrays were also generated and assessed against the linked-read WGS data. RNA-sequencing data was used to support structural chromosomal rearrangements detected in the linked-read WGS data by detecting expressed fusion genes as a consequence of the rearrangements. Illumina Infinium arrays (450k array and/or SNP array) were used to assess CNA status to further support the findings in the linked-read WGS data. The processed CNA data from the primary ALL patient samples has been deposited to GEO. RNA-sequencing, linked-read WGS data, and raw SNP array data from the primary ALL patient samples will not be deposited because the patient/parent consent does not cover depositing data that may be used for large-scale determination of germline variants in a repository. The ALL samples were collected 10-20 years ago from pediatric patients aged 2-15 years, some whom have deceased. The linked-read WGS data and the RNA-sequencing data sets generated in the study are available upon reasonable request from the corresponding author Jessica.Nordlund@medsci.uu.se.
Project description:We performed long read whole genome sequencing of NA19240 on the Oxford Nanopore PromethION and identified structural variants using recently developed tools,
Project description:Recent advances in throughput and accuracy mean that the Oxford Nanopore Technologies (ONT) PromethION platform is a now a viable solution for WGS. New bioinformatic methods have been developed to take advantage of this long read data, however much of the validation of these tools has focussed on calling germline variants (both SNVs and structural variants). Somatic variants are outnumbered many-fold by germline variants and their detection is further complicated due to their frequency varying depending on tumour purity/subclonality. Here, we evaluate the extent to which Nanopore WGS enables genome-wide detection and analysis of somatic variation. We do this through sequencing tumour and germline genomes for a patient with diffuse B-cell lymphoma. We examine the capability of currently available tools for calling somatic variants in ONT data by comparing the data with results from 150bp short-read sequencing of the same samples. We then conduct a detailed analysis of the performance of multiple long-read mappers and structural variant callers for calling large, somatic structural variants (SVs) in ONT data. Our protocol achieved yields of up to 96 mapped Gb per PromethION flow cell with average read lengths of ~5kb. Calling germline SNVs from these data achieved good specificity and sensitivity. However, results of somatic SNV calling highlight the need for the development of specialized joint calling algorithms. Our analysis of structural variants shows that the comparative performance of different tools varies significantly between SV types, and suggest long reads are especially advantageous for calling large somatic deletions and duplications. Finally, we highlight the utility of long reads for phasing clinically relevant variants by using the ONT data to confirm that a somatic 1.6Mb deletion and a p.(Arg249Met) mutation involving TP53 are oriented in trans.