Metabolomics,Unknown,Transcriptomics,Genomics,Proteomics

Dataset Information

Enhanced whole exome sequencing by higherDNA insert lengths

ABSTRACT: Background: Whole exome sequencing (WES) has been proven to serve as a valuable basis for various applications such as variant calling and copy number variation (CNV) analyses. For those analyses the read coverage should be optimally balanced throughout protein coding regions at sufficient read depth. Unfortunately, WES is known for its uneven coverage within coding regions due to GC-rich regions or off-target enrichment. Results: In order to examine the irregularities of WES within genes, we applied Agilent SureSelectXT exome capture on human samples and sequenced these via Illumina in 2x101 paired-end mode. As we suspected the sequenced insert length to be crucial in the uneven coverage of exome captured samples, we sheared 12 genomic DNA samples to two different DNA insert size lengths, namely 130 and 170 bp. Interestingly, although mean coverages of target regions were clearly higher in samples of 130 bp insert length, the level of evenness was more pronounced in 170 bp samples. Moreover, merging overlapping paired-end reads revealed a positive effect on evenness indicating overlapping reads as another reason for the unevenness. In addition, mutation analysis on a subset of the samples was performed. In these isogenic subclones almost twofold mutations were failed in the 130 bp samples when compared to the 170 bp samples. Visual inspection of the discarded mutation sites exposed low coverages at the sites embedded in high amplitudes of coverage depth in the affected region. Conclusions: Producing longer insert reads could be a good strategy to achieve better uniform read coverage in coding regions and hereby enhancing the effective sequencing yield to provide an improved basis for further variant calling and CNV analyses.

INSTRUMENT(S): Illumina HiSeq 2500

ORGANISM(S): Homo sapiens

SUBMITTER:

PROVIDER: E-MTAB-4527 | biostudies-arrayexpress |

SECONDARY ACCESSION(S): ERP014466

REPOSITORIES: biostudies-arrayexpress

ACCESS DATA

Publications

Subclones in B-lymphoma cell lines: isogenic models for the study of gene regulation.

Quentmeier Hilmar H Pommerenke Claudia C Ammerpohl Ole O Geffers Robert R Hauer Vivien V MacLeod Roderick A F RA Nagel Stefan S Romani Julia J Rosati Emanuela E Rosén Anders A Uphoff Cord C CC Zaborski Margarete M Drexler Hans G HG

Oncotarget 20160901 39

Genetic heterogeneity though common in tumors has been rarely documented in cell lines. To examine how often B-lymphoma cell lines are comprised of subclones, we performed immunoglobulin (IG) heavy chain hypermutation analysis. Revealing that subclones are not rare in B-cell lymphoma cell lines, 6/49 IG hypermutated cell lines (12%) consisted of subclones with individual IG mutations. Subclones were also identified in 2/284 leukemia/lymphoma cell lines exhibiting bimodal CD marker expression. We ...[more]

PMID: 27566572

Publication: 1/2

Similar Datasets

Project description:Current methods for detection of copy number aberrations (CNA) from whole-exome sequencing (WES) data are based on the read counts of the captured exons only. However, accurate CNA determination is complicated by the non-uniform read depth and uneven distribution of exons. Therefore, we developed ENCODER (ENhanced COpy number Detection from Exome Reads), which eludes these problems. By exploiting the ‘off-target’ sequence reads, it allows for creation of robust copy number profiles from WES. The accuracy of ENCODER compares to approaches specifically designed for copy number detection, and outperforms current exon-based WES methods, particularly in samples of low quality. Current methods for detection of copy number aberrations (CNA) from whole-exome sequencing (WES) data are based on the read counts of the captured exons only. However, accurate CNA determination is complicated by the non-uniform read depth and uneven distribution of exons. Therefore, we developed ENCODER (ENhanced COpy number Detection from Exome Reads), which eludes these problems. By exploiting the ‘off-target’ sequence reads, it allows for creation of robust copy number profiles from WES. The accuracy of ENCODER compares to approaches specifically designed for copy number detection, and outperforms current exon-based WES methods, particularly in samples of low quality. Current methods for detection of copy number aberrations (CNA) from whole-exome sequencing (WES) data are based on the read counts of the captured exons only. However, accurate CNA determination is complicated by the non-uniform read depth and uneven distribution of exons. Therefore, we developed ENCODER (ENhanced COpy number Detection from Exome Reads), which eludes these problems. By exploiting the ‘off-target’ sequence reads, it allows for creation of robust copy number profiles from WES. The accuracy of ENCODER compares to approaches specifically designed for copy number detection, and outperforms current exon-based WES methods, particularly in samples of low quality. DNA copy number profiles generated with a new tool, ENCODER, were compared to DNA copy number profiles from SNP6, NimbleGen and low-coverage Whole Genome Sequencing. DNA copy number profiles of mouse squamous cell lung cancer (SCLC) were generated with ENCODER from whole exome sequencing data and compared to results from the NimbleGen array

Project description:Current methods for detection of copy number aberrations (CNA) from whole-exome sequencing (WES) data are based on the read counts of the captured exons only. However, accurate CNA determination is complicated by the non-uniform read depth and uneven distribution of exons. Therefore, we developed ENCODER (ENhanced COpy number Detection from Exome Reads), which eludes these problems. By exploiting the ‘off-target’ sequence reads, it allows for creation of robust copy number profiles from WES. The accuracy of ENCODER compares to approaches specifically designed for copy number detection, and outperforms current exon-based WES methods, particularly in samples of low quality. Current methods for detection of copy number aberrations (CNA) from whole-exome sequencing (WES) data are based on the read counts of the captured exons only. However, accurate CNA determination is complicated by the non-uniform read depth and uneven distribution of exons. Therefore, we developed ENCODER (ENhanced COpy number Detection from Exome Reads), which eludes these problems. By exploiting the ‘off-target’ sequence reads, it allows for creation of robust copy number profiles from WES. The accuracy of ENCODER compares to approaches specifically designed for copy number detection, and outperforms current exon-based WES methods, particularly in samples of low quality. Current methods for detection of copy number aberrations (CNA) from whole-exome sequencing (WES) data are based on the read counts of the captured exons only. However, accurate CNA determination is complicated by the non-uniform read depth and uneven distribution of exons. Therefore, we developed ENCODER (ENhanced COpy number Detection from Exome Reads), which eludes these problems. By exploiting the ‘off-target’ sequence reads, it allows for creation of robust copy number profiles from WES. The accuracy of ENCODER compares to approaches specifically designed for copy number detection, and outperforms current exon-based WES methods, particularly in samples of low quality. DNA copy number profiles generated with a new tool, ENCODER, were compared to DNA copy number profiles from SNP6, NimbleGen and low-coverage Whole Genome Sequencing. DNA copy number profiles of melanoma PDX sample were generated with ENCODER from whole exome sequencing data and compared to results from the SNP6 platform.

Dataset Information

Enhanced whole exome sequencing by higherDNA insert lengths

Publications

Subclones in B-lymphoma cell lines: isogenic models for the study of gene regulation.

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets