Proteomics

Dataset Information

0

JUMPg: an Integrative Proteogenomics Pipeline Identifying Unannotated Proteins in Human Brain and Cancer Cells


ABSTRACT: Proteogenomics is an emerging approach to improve gene annotation and interpretation of proteomics data. Here we present JUMPg, an integrative proteogenomics pipeline including customized database construction, tag-based database search, peptide-spectrum match filtering, and data visualization. JUMPg creates multiple databases of DNA polymorphisms, mutations, splice junctions, partially trypticity, as well as protein fragments translated from the whole transcriptome in all six frames after RNA-seq de novo assembly. We use a multistage strategy to search these databases sequentially, in which the performance is optimized by re-searching only unmatched high quality spectra, and re-using amino acid tags generated by the JUMP search engine. The identified peptides/proteins are displayed with gene loci using the UCSC genome browser. The JUMPg is applied to process a label-free mass spectrometry dataset of Alzheimer’s disease postmortem brain, uncovering 496 new peptides of amino acid substitutions, alternative splicing, frame shift, and “non-coding gene” translation. The novel protein PNMA6BL specifically expressed in the brain is highlighted. We also tested JUMPg to analyze a stable-isotope labeled dataset of multiple myeloma cells, revealing 991 sample-specific peptides that include protein sequences in the immunoglobulin light chain variable region. Thus, the JUMPg program is an effective proteogenomics tool for multi-omics data integration.

INSTRUMENT(S): Q Exactive

ORGANISM(S): Homo Sapiens (human)

TISSUE(S): Brain

DISEASE(S): Alzheimer's Disease

SUBMITTER: xusheng wang  

LAB HEAD: Junmin Peng

PROVIDER: PXD004010 | Pride | 2017-02-20

REPOSITORIES: Pride

Dataset's files

Source:
Action DRS
ad_pl01.1.pepXML Pepxml
ad_pl01.raw Raw
ad_pl02.1.pepXML Pepxml
ad_pl02.raw Raw
ad_pl03.1.pepXML Pepxml
Items per page:
1 - 5 of 20
altmetric image

Publications

JUMPg: An Integrative Proteogenomics Pipeline Identifying Unannotated Proteins in Human Brain and Cancer Cells.

Li Yuxin Y   Wang Xusheng X   Cho Ji-Hoon JH   Shaw Timothy I TI   Wu Zhiping Z   Bai Bing B   Wang Hong H   Zhou Suiping S   Beach Thomas G TG   Wu Gang G   Zhang Jinghui J   Peng Junmin J  

Journal of proteome research 20160613 7


Proteogenomics is an emerging approach to improve gene annotation and interpretation of proteomics data. Here we present JUMPg, an integrative proteogenomics pipeline including customized database construction, tag-based database search, peptide-spectrum match filtering, and data visualization. JUMPg creates multiple databases of DNA polymorphisms, mutations, splice junctions, partially trypticity, as well as protein fragments translated from the whole transcriptome in all six frames upon RNA-se  ...[more]

Similar Datasets

2017-03-16 | MSV000080641 | MassIVE
2019-08-19 | PXD012744 | Pride
2017-12-20 | GSE92659 | GEO
2016-12-22 | GSE67174 | GEO
2016-11-25 | GSE88790 | GEO
2020-08-01 | GSE143263 | GEO
| PRJNA153537 | ENA
2017-04-24 | PXD004896 | Pride
2015-02-02 | PXD001677 | Pride
2024-01-28 | PXD041585 | Pride