Project description:Illumina Infinium whole genome genotyping (WGG) arrays are increasingly being applied in cancer genomics to study gene copy number alterations and allele-specific aberrations such as loss-of-heterozygosity (LOH). Methods developed for normalization of WGG arrays have mostly focused on diploid, normal samples. However, for cancer samples genomic aberrations may confound normalization and data interpretation. Therefore, we examined the effects of the conventionally used normalization method for Illumina Infinium arrays when applied to cancer samples. We demonstrate an asymmetry in the detection of the two alleles for each SNP, which deleteriously influences both allelic proportions and copy number estimates. The asymmetry is caused by a remaining bias between the two dyes used in the Infinium II assay after using the normalization method in Illumina’s proprietary software (BeadStudio). We propose a quantile normalization strategy for correction of this dye bias. We tested the normalization strategy using 535 individual hybridizations from 10 data sets from the analysis of cancer genomes and normal blood samples generated on Illumina Infinium II 300k version 1 and 2, 370k and 550k BeadChips. We show that the proposed normalization strategy successfully removes asymmetry in estimates of both allelic proportions and copy numbers. Additionally, the normalization strategy reduces the technical variation for copy number estimates while retaining the response to copy number alterations. The proposed normalization strategy represents a valuable low-level analysis tool that improves the quality of data obtained from Illumina Infinium arrays, in particular when used for LOH and copy number variation studies.
Project description:Illumina Infinium whole genome genotyping (WGG) arrays are increasingly being applied in cancer genomics to study gene copy number alterations and allele-specific aberrations such as loss-of-heterozygosity (LOH). Methods developed for normalization of WGG arrays have mostly focused on diploid, normal samples. However, for cancer samples genomic aberrations may confound normalization and data interpretation. Therefore, we examined the effects of the conventionally used normalization method for Illumina Infinium arrays when applied to cancer samples. We demonstrate an asymmetry in the detection of the two alleles for each SNP, which deleteriously influences both allelic proportions and copy number estimates. The asymmetry is caused by a remaining bias between the two dyes used in the Infinium II assay after using the normalization method in Illumina’s proprietary software (BeadStudio). We propose a quantile normalization strategy for correction of this dye bias. We tested the normalization strategy using 535 individual hybridizations from 10 data sets from the analysis of cancer genomes and normal blood samples generated on Illumina Infinium II 300k version 1 and 2, 370k and 550k BeadChips. We show that the proposed normalization strategy successfully removes asymmetry in estimates of both allelic proportions and copy numbers. Additionally, the normalization strategy reduces the technical variation for copy number estimates while retaining the response to copy number alterations. The proposed normalization strategy represents a valuable low-level analysis tool that improves the quality of data obtained from Illumina Infinium arrays, in particular when used for LOH and copy number variation studies. To investigate the effects of a quantile normalization of Illumina Infinium data, compared to conventional normalization using BeadStudio (www.illumina.com), we renormalized 535 individual hybridizations conducted on Illumina 300K, 370K and 550K BeadChips. Sample types included breast cancer, colon cancer, urothelial carcinoma, leukemia as well as normal blood and HapMap samples. This series includes the 6 breast cancers hybridized on Illumina HumanHap 550K BeadChips.
Project description:Amplicon-based targeted re-sequencing analysis was performed in the patient-derived gliobastoma cell culture samples. For this purpose, genomic DNA (gDNA) was isolated and DNA libraries were prepared using the TruSeq Custom Amplicon Low Input (Illumina, Inc.) technology. By this, a pool of 375 amplicons was generated for each single sample in order to enrich for the target genes ATRX1, EGFR, IDH1, NF1, PDGFRA, PIK3CG, PIK3R1, PTEN, RB1 and TP53. Sequencing was performed on the Illumina MiSeq® next generation sequencing system (Illumina Inc.) and its 2 x 250 bp paired-end v2 read chemistry. The resulting reads were quality controlled and mapped against the human reference genome (hg19). For all samples, sequence variations of the amplified regions of interest in comparison to the human reference sequence were identified and filtered based on reliability.