Project description:BackgroundCopy number alteration is a main genetic structural variation that plays an important role in tumor initialization and progression. Accurate detection of copy number alterations is necessary for discovering cancer-causing genes. Whole-exome sequencing has become a widely used technology in the last decade for detecting various types of genomic aberrations in cancer genomes. However, there are several major issues encountered in these detection problems, including normal cell contamination, tumor aneuploidy, and intra-tumor heterogeneity. Especially, deciphering the intra-tumor heterogeneity is imperative for identifying clonal and subclonal copy number alterations.ResultsWe introduce CloneCNA, a novel bioinformatics tool for efficiently addressing these issues and automatically detecting clonal and subclonal somatic copy number alterations from heterogeneous tumor samples. CloneCNA fully explores the log ratio of read counts between paired tumor-normal samples and tumor B allele frequency of germline heterozygous SNP positions, further employs efficient statistical models to quantitatively represent copy number status of tumor sample containing multiple clones. We examine CloneCNA on simulated heterogeneous and real tumor samples, and the results demonstrate that CloneCNA has higher power to detect copy number alterations than existing methods.ConclusionsCloneCNA, a novel algorithm is developed to efficiently and accurately identify somatic copy number alterations from heterogeneous tumor samples. We demonstrate the statistical framework of CloneCNA represents a remarkable advance for tumor whole-exome sequencing data. We expect that CloneCNA will promote cancer-focused studies for investigating the role of clonal evolution and elucidating critical events benefiting tumor tumourigenesis and progression.
Project description:Multiple large-scale genomic profiling efforts have been undertaken in osteosarcoma to define the genomic drivers of tumorigenesis, therapeutic response, and disease recurrence. The spatial and temporal intratumor heterogeneity could also play a role in promoting tumor growth and treatment resistance. We conducted longitudinal whole-genome sequencing of 37 tumor samples from 8 patients with relapsed or refractory osteosarcoma. Each patient had at least one sample from a primary site and a metastatic or relapse site. Subclonal copy-number alterations were identified in all patients except one. In 5 patients, subclones from the primary tumor emerged and dominated at subsequent relapses. MYC gain/amplification was enriched in the treatment-resistant clones in 6 of 7 patients with multiple clones. Amplifications in other potential driver genes, such as CCNE1, RAD21, VEGFA, and IGF1R, were also observed in the resistant copy-number clones. A chromosomal duplication timing analysis revealed that complex genomic rearrangements typically occurred prior to diagnosis, supporting a macroevolutionary model of evolution, where a large number of genomic aberrations are acquired over a short period of time followed by clonal selection, as opposed to ongoing evolution. A mutational signature analysis of recurrent tumors revealed that homologous repair deficiency (HRD)-related SBS3 increases at each time point in patients with recurrent disease, suggesting that HRD continues to be an active mutagenic process after diagnosis. Overall, by examining the clonal relationships between temporally and spatially separated samples from patients with relapsed/refractory osteosarcoma, this study sheds light on the intratumor heterogeneity and potential drivers of treatment resistance in this disease.SignificanceThe chemoresistant population in recurrent osteosarcoma is subclonal at diagnosis, emerges at the time of primary resection due to selective pressure from neoadjuvant chemotherapy, and is characterized by unique oncogenic amplifications.
Project description:As next-generation sequencing technology advances and the cost decreases, whole genome sequencing (WGS) has become the preferred platform for the identification of somatic copy number alteration (CNA) events in cancer genomes. To more effectively decipher these massive sequencing data, we developed a software program named SEG, shortened from the word "segment". SEG utilizes mapped read or fragment density for CNA discovery. To reduce CNA artifacts arisen from sequencing and mapping biases, SEG first normalizes the data by taking the log2-ratio of each tumor density against its matching normal density. SEG then uses dynamic programming to find change-points among a contiguous log2-ratio data series along a chromosome, dividing the chromosome into different segments. SEG finally identifies those segments having CNA. Our analyses with both simulated and real sequencing data indicate that SEG finds more small CNAs than other published software tools.
Project description:Liquid biopsies, i.e. the analysis of circulating tumor cells (CTCs) or circulating tumor DNA (ctDNA), are evolving into promising clinical tools. Indeed, a plethora of liquid biopsy technologies to deduce non-invasively characteristics of the tumor genome from the peripheral blood have been developed over the last few years. For example, liquid biopsies have been used to assess the tumor burden, to monitor the evolution of tumor genomes, to unravel mechanisms of resistance, to establish the tumor heterogeneity, and for the identification of prognostic and predictive markers. In this review we focus on methods to establish genome-wide profiles of somatic copy number alterations (SCNAs) from plasma DNA and show how they provide novel insights into the biology of cancer and their impact on the management of patients.
Project description:Gastric cancer, a leading worldwide cause of cancer mortality, shows high geographic and ethnic variation in incidence rates, which are highest in East Asia. The anatomic locations and clinical behavior also differ by geography, leading to the controversial idea that Eastern and Western forms of the disease are distinct. In view of these differences, we investigated whether gastric cancers from Eastern and Western patients show distinct genomic profiles. We used high-density profiling of somatic copy-number aberrations to analyze the largest collection to date of gastric adenocarcinomas and utilized genotyping data to rigorously annotate ethnic status. The size of this collection allowed us to accurately identify regions of significant copy-number alteration and separately to evaluate tumors arising in Eastern and Western patients. Among molecular subtypes classified by The Cancer Genome Atlas, the frequency of gastric cancers showing chromosomal instability was modestly higher in Western patients. After accounting for this difference, however, gastric cancers arising in Easterners and Westerners have highly similar somatic copy-number patterns. Only one genomic event, focal deletion of the phosphatase gene PTPRD, was significantly enriched in Western cases, though also detected in Eastern cases. Thus, despite the different risk factors and clinical features, gastric cancer appears to be a fundamentally similar disease in both populations and the divergent clinical outcomes cannot be ascribed to different underlying structural somatic genetic aberrations.
Project description:Somatic copy number alterations (SCNAs) are a pervasive trait of human cancers that contributes to tumorigenesis by affecting the dosage of multiple genes at the same time. In the past decade, The Cancer Genome Atlas (TCGA) and the International Cancer Genome Consortium (ICGC) initiatives have generated and made publicly available SCNA genomic profiles from thousands of tumor samples across multiple cancer types. Here, we present a comprehensive analysis of 853,218 SCNAs across 10,729 tumor samples belonging to 32 cancer types using TCGA data. We then discuss current models for how SCNAs likely arise during carcinogenesis and how genomic SCNA profiles can inform clinical practice. Lastly, we highlight open questions in the field of cancer-associated SCNAs.
Project description:Exome sequencing constitutes an important technology for the study of human hereditary diseases and cancer. However, the ability of this approach to identify copy number alterations in primary tumor samples has not been fully addressed. Here we show that somatic copy number alterations can be reliably estimated using exome sequencing data through a strategy that we have termed exome2cnv. Using data from 86 paired normal and primary tumor samples, we identified losses and gains of complete chromosomes or large genomic regions, as well as smaller regions affecting a minimum of one gene. Comparison with high-resolution comparative genomic hybridization (CGH) arrays revealed a high sensitivity and a low number of false positives in the copy number estimation between both approaches. We explore the main factors affecting sensitivity and false positives with real data, and provide a side by side comparison with CGH arrays. Together, these results underscore the utility of exome sequencing to study cancer samples by allowing not only the identification of substitutions and indels, but also the accurate estimation of copy number alterations.
Project description:Somatic copy number alterations (SCNAs) are important biological characteristics that can identify genome-wide alterations in renal cell carcinoma (RCC). Recent studies have shown that SCNAs have potential value for determining the prognosis of RCC. We examined SCNAs using the Affymetrix platform to analyze samples from 59 patients with clear cell RCCs (ccRCCs) including first cohort (30 cases) and second cohort (validation cohort, 29 cases). We stratified SCNAs in the ccRCCs using a hierarchical cluster analysis based on SCNA types, including gain, loss of heterozygosity (LOH), copy neutral LOH, mosaic, and mixed types. In this way, the examined two cohorts were categorized into two subgroups (1 and 2). Although the frequency of mixed type was higher in subgroup 1 than in subgroup 2 in the two cohorts, the association did not reach statistical significance. There was a significant difference in the frequency of metachronous metastasis between subgroups 1 and 2 (subgroup 2?>?1). In addition, subgroup 2 was retained in multivariate analysis of both cohorts. We examined whether there were specific alleles differing between subgroups 1 and 2 in both cohorts. We found that there was indeed a statistically significant difference in the 3p mixed types. Among the 3p mixed type, we found that 3p24.3 mixed type was inversely correlated with the presence of metachronous metastasis in ccRCC. The association was also retained in multivariate analysis in second cohort. We suggest that the 3p24.3 mixed type may be a novel marker to predict a favorable prognosis in ccRCC.
Project description:MicroRNAs (miRNAs) are key molecules that regulate biological processes such as cell proliferation, differentiation, and apoptosis in cancer. Somatic copy number alterations (SCNAs) are common genetic mutations that play essential roles in cancer development. Here, we investigated the association between miRNAs and SCNAs in cancer. We collected 2538 tumor samples for seven cancer types from The Cancer Genome Atlas. We found that 32-84% of miRNAs are in SCNA regions, with the rate depending on the cancer type. In these regions, we identified 80 SCNA-miRNAs whose expression was mainly associated with SCNAs in at least one cancer type and showed that these SCNA-miRNAs are related to cancer by survival analysis and literature searching. We also identified 58 SCNA-miRNAs common in the seven cancer types (CC-SCNA-miRNAs) and showed that these CC-SCNA-miRNAs are more likely to be related with protein and gene expression than other miRNAs. Furthermore, we experimentally validated the oncogenic role of miR-589. In conclusion, our results suggest that SCNA-miRNAs significantly alter biological processes related to cancer development, confirming the importance of SCNAs in non-coding regions in cancer.