Beyond the coding genome: non-coding mutations and cancer.
Ontology highlight
ABSTRACT: Latest advancements in genomics involving individuals from different races and geographical locations has led to the identification of thousands of common as well as rare genetic variants and copy number variations (CNVs). These studies have surprisingly revealed that the majority of genetic variation is not present within the coding region but rather in the non-coding region of the genome, which is also termed as "Medical Genome". This short review describes how mutations/variations within; regulatory sequences, architectural proteins and transcriptional regulators give rise to the aberrant gene expression profiles that drives cellular transformations and malignancies.
Project description:We address here the issue of prioritizing non-coding mutations in the tumoral genome. To this aim, we created two independent computational models. The first (germline) model estimates purifying selection based on population SNP data. The second (somatic) model estimates tumor mutation density based on whole genome tumor sequencing. We show that each model reflects a different set of constraints acting either on the normal or tumor genome, and we identify the specific genome features that most contribute to these constraints. Importantly, we show that the somatic mutation model carries independent functional information that can be used to narrow down the non-coding regions that may be relevant to cancer progression. On this basis, we identify positions in non-coding RNAs and the non-coding parts of mRNAs that are both under purifying selection in the germline and protected from mutation in tumors, thus introducing a new strategy for future detection of cancer driver elements in the expressed non-coding genome.
Project description:Non-coding mutations can create splice sites, however the true extent of how such somatic non-coding mutations affect RNA splicing are largely unexplored. Here we use the MiSplice pipeline to analyze 783 cancer cases with WGS data and 9494 cases with WES data, discovering 562 non-coding mutations that lead to splicing alterations. Notably, most of these mutations create new exons. Introns associated with new exon creation are significantly larger than the genome-wide average intron size. We find that some mutation-induced splicing alterations are located in genes important in tumorigenesis (ATRX, BCOR, CDKN2B, MAP3K1, MAP3K4, MDM2, SMAD4, STK11, TP53 etc.), often leading to truncated proteins and affecting gene expression. The pattern emerging from these exon-creating mutations suggests that splice sites created by non-coding mutations interact with pre-existing potential splice sites that originally lacked a suitable splicing pair to induce new exon formation. Our study suggests the importance of investigating biological and clinical consequences of noncoding splice-inducing mutations that were previously neglected by conventional annotation pipelines. MiSplice will be useful for automatically annotating the splicing impact of coding and non-coding mutations in future large-scale analyses.
Project description:Pancreatic cancer is the fourth-leading cause of cancer-related deaths worldwide. The outcomes in patients with pancreatic cancer remain unsatisfactory. In the current review, we summarize the genetic and epigenetic architecture of metastatic pancreatic cancer beyond the BRCA mutations, focusing on the genetic alterations and the molecular pathology in pancreatic cancer. This review focuses on the molecular targets for the treatment of pancreatic cancer, with a correlation to future treatments. The potential approach addressed in this review may lead to the identification of a subset of patients with specific biological behaviors and treatment responses.
Project description:Cardiomyopathies (CMs) are a group of cardiac pathologies caused by an intrinsic defect within the myocardium. The relative contribution of genetic mutations in the pathogenesis of certain CMs, such as hypertrophic cardiomyopathy (HCM), arrythmogenic right/left ventricular cardiomyopathy (ARVC) and left ventricular non-compacted cardiomyopathy (LVNC) has been established in comparison to dilated cardiomyopathy (DCM) and restrictive cardiomyopathy (RCM). The aim of this article is to review mutations in the non-coding parts of the genome, namely, microRNA, promoter elements, enhancer/silencer elements, 3'/5'UTRs and introns, that are involved in the pathogenesis CMs. Additionally, we will explore the role of some long non-coding RNAs in the pathogenesis of CMs.
Project description:The functional consequences of somatic non-coding mutations in ovarian cancer (OC) are unknown. To identify regulatory elements (RE) and genes perturbed by acquired non-coding variants, here we establish epigenomic and transcriptomic landscapes of primary OCs using H3K27ac ChIP-seq and RNA-seq, and then integrate these with whole genome sequencing data from 232 OCs. We identify 25 frequently mutated regulatory elements, including an enhancer at 6p22.1 which associates with differential expression of ZSCAN16 (P?=?6.6?×?10-4) and ZSCAN12 (P?=?0.02). CRISPR/Cas9 knockout of this enhancer induces downregulation of both genes. Globally, there is an enrichment of single nucleotide variants in active binding sites for TEAD4 (P?=?6?×?10-11) and its binding partner PAX8 (P?=?2×10-10), a known lineage-specific transcription factor in OC. In addition, the collection of cis REs associated with PAX8 comprise the most frequently mutated set of enhancers in OC (P?=?0.003). These data indicate that non-coding somatic mutations disrupt the PAX8 transcriptional network during OC development.
Project description:Numerous pan-genomic studies identified alterations in protein-coding genes and signaling pathways involved in bladder carcinogenesis, while non-coding somatic alterations remain weakly explored. The goal of this study was to identify clinical biomarkers in non-coding regions for bladder cancer patients. We have previously identified in bladder tumors two non-coding mutational hotspots occurring at high frequencies (≥30%). These mutations are located close to the GPR126 and PLEKHS1 genes, at the guanine or the cytosine of a TGAACA core motif flanked, on both sides, by a stretch of palindromic sequences. Here, we hypothesize that such a pattern of recurrent non-coding mutations could be a signature of somatic genomic instability specifically involved in bladder cancer. We analyzed 26 additional mutable non-coding sites with the same core motif in a cohort of 103 bladder cancers composed of 44 NMIBC cases and 59 MIBC cases using high-resolution melting (HRM) and Sanger sequencing. Five bladder cancers were additionally analyzed for protein-coding gene mutations using a targeted NGS panel composed of 571 genes. Expression levels of three members of the APOBEC3 family genes were assessed using real-time quantitative RT-PCR. Non-coding somatic mutations were observed for at least one TGAACA core motif locus in 62.1% (64/103) of bladder tumor samples. These non-coding mutations co-occurred in the bladder tumors but were absent in prostate tumor, HPV-positive Head and Neck Squamous Cell Carcinoma, and high microsatellite instability (MSI-H) colorectal tumor series. This signature of palindromic non-coding somatic mutations, specific to bladder tumors, was not associated with patients' outcome and was more frequent in females. Interestingly, this signature was associated with high tumor mutational burden (TMB) and high expression levels of APOBEC3B and interferon inducible genes. We identified a new type of somatic genomic instability targeting the TGAACA core motif loci flanked by palindromic sequences in bladder cancer. This mutational signature is a promising candidate clinical biomarker for the early detection of relapse and a major low-cost alternative to the TMB to monitor the response to immunotherapy for bladder cancer patients.
Project description:Cancer-related mutations have been mainly identified in protein-coding regions. Recent studies have demonstrated that mutations in non-coding regions of the genome could also be a risk factor for cancer. However, the non-coding regions comprise 98% of the total length of the human genome and contain a huge number of mutations, making it difficult to interpret their impacts on pathogenesis of cancer. To comprehensively identify cancer-related non-coding mutations, we focused on recurrent mutations in non-coding regions using somatic mutation data from COSMIC and whole-genome sequencing data from The Cancer Genome Atlas (TCGA). We identified 21 574 recurrent mutations in non-coding regions that were shared by at least two different samples from both COSMIC and TCGA databases. Among them, 580 candidate cancer-related non-coding recurrent mutations were identified based on epigenomic and chromatin structure datasets. One of such mutation was located in RREB1 binding site that is thought to interact with TEAD1 promoter. Our results suggest that mutations may disrupt the binding of RREB1 to the candidate enhancer region and increase TEAD1 expression levels. Our findings demonstrate that non-coding recurrent mutations and coding mutations may contribute to the pathogenesis of cancer.
Project description:In the past decades, studies of the Janus kinases (JAKs) and signal transducers and activators of transcription (STATs) signaling have uncovered highly conserved programs linking cytokine signaling to the regulation of essential cellular mechanisms such as proliferation, invasion, survival, inflammation and immunity. Inhibitors of the JAK/STAT pathway are used for treatment of autoimmune diseases, such as rheumatoid arthritis or psoriasis. Aberrant JAK/STAT signaling has been identified to contribute to cancer progression and metastatic development. Targeting of JAK/STAT pathway is currently one of the most promising therapeutic strategies in prostate cancer (PCa), hematopoietic malignancies and sarcomas. Notably, newly identified regulators of JAK/STAT signaling, the non-coding RNAs transcripts and their role as important targets and potential clinical biomarkers are highlighted in this review. In addition to the established role of the JAK/STAT signaling pathway in traditional cytokine signaling the non-coding RNAs add yet another layer of hidden regulation and function. Understanding the crosstalk of non-coding RNA with JAK/STAT signaling in cancer is of critical importance and may result in better patient stratification not only in terms of prognosis but also in the context of therapy.
Project description:Cancer is a disease of the genome caused by oncogene activation and tumor suppressor gene inhibition. Deep sequencing studies including large consortia such as TCGA and ICGC identified numerous tumor-specific mutations not only in protein-coding sequences but also in non-coding sequences. Although 98% of the genome is not translated into proteins, most studies have neglected the information hidden in this "dark matter" of the genome. Malignancy-driving mutations can occur in all genetic elements outside the coding region, namely in enhancer, silencer, insulator, and promoter as well as in 5'-UTR and 3'-UTR Intron or splice site mutations can alter the splicing pattern. Moreover, cancer genomes contain mutations within non-coding RNA, such as microRNA, lncRNA, and lincRNA A synonymous mutation changes the coding region in the DNA and RNA but not the protein sequence. Importantly, oncogenes such as TERT or miR-21 as well as tumor suppressor genes such as TP53/p53, APC, BRCA1, or RB1 can be affected by these alterations. In summary, coding-independent mutations can affect gene regulation from transcription, splicing, mRNA stability to translation, and hence, this largely neglected area needs functional studies to elucidate the mechanisms underlying tumorigenesis. This review will focus on the important role and novel mechanisms of these non-coding or allegedly silent mutations in tumorigenesis.
Project description:Methodological advances that allow deeper characterization of non-coding elements in the genome have started to reveal the full spectrum of deregulation in cancer. We generated an inducible cell model to track transcriptional changes after induction of a well-known leukemia-inducing fusion gene, ETV6-RUNX1. Our data revealed widespread transcriptional alterations outside coding elements in the genome. This adds to the growing list of various alterations in the non-coding genome in cancer and pinpoints their role in diseased cellular state.