Unknown

Dataset Information

0

Proteogenomic analysis prioritises functional single nucleotide variants in cancer samples.


ABSTRACT: Massively parallel DNA sequencing enables the detection of thousands of germline and somatic single nucleotide variants (SNVs) in cancer samples. The functional analysis of these mutations is often carried out through in silico predictions, with further downstream experimental validation rarely performed. Here, we examine the potential of using mass spectrometry-based proteomics data to further annotate the function of SNVs in cancer samples. RNA-seq and whole genome sequencing (WGS) data from Jurkat cells were used to construct a custom database of single amino acid variant (SAAV) containing peptides and identified over 1,000 such peptides in two Jurkat proteomics datasets. The analysis enabled the detection of a truncated form of splicing regulator YTHDC1 at the protein level. To extend the functional annotation further, a Jurkat phosphoproteomics dataset was analysed, identifying 463 SAAV containing phosphopeptides. Of these phosphopeptides, 24 SAAVs were found to directly impact the phosphorylation event through the creation of either a phosphorylation site or a kinase recognition motif. We identified a novel phosphorylation site created by a SAAV in splicing factor SF3B1, a protein that is frequently mutated in leukaemia. To our knowledge, this is the first study to use phosphoproteomics data to directly identify novel phosphorylation events arising from the creation of phosphorylation sites by SAAVs. Our study reveals multiple functional mutations impacting the splicing pathway in Jurkat cells and demonstrates potential benefits of an integrative proteogenomics analysis for high-throughput functional annotation of SNVs in cancer.

SUBMITTER: Ma S 

PROVIDER: S-EPMC5707065 | biostudies-literature | 2017 Nov

REPOSITORIES: biostudies-literature

altmetric image

Publications

Proteogenomic analysis prioritises functional single nucleotide variants in cancer samples.

Ma Shiyong S   Menon Ranjeeta R   Poulos Rebecca C RC   Wong Jason W H JWH  

Oncotarget 20170927 56


Massively parallel DNA sequencing enables the detection of thousands of germline and somatic single nucleotide variants (SNVs) in cancer samples. The functional analysis of these mutations is often carried out through <i>in silico</i> predictions, with further downstream experimental validation rarely performed. Here, we examine the potential of using mass spectrometry-based proteomics data to further annotate the function of SNVs in cancer samples. RNA-seq and whole genome sequencing (WGS) data  ...[more]

Similar Datasets

| S-EPMC6072548 | biostudies-literature
| S-EPMC6678189 | biostudies-literature
| S-EPMC8574974 | biostudies-literature
| S-EPMC4046713 | biostudies-literature
| S-EPMC6382573 | biostudies-literature
| S-EPMC3918476 | biostudies-literature
| S-EPMC6192032 | biostudies-literature
| S-EPMC6460560 | biostudies-literature
| S-EPMC7693424 | biostudies-literature
| S-EPMC5702214 | biostudies-literature