Unknown

Dataset Information

0

Identification of gene fusions from human lung cancer mass spectrometry data.


ABSTRACT:

Background

Tandem mass spectrometry (MS/MS) technology has been applied to identify proteins, as an ultimate approach to confirm the original genome annotation. To be able to identify gene fusion proteins, a special database containing peptides that cross over gene fusion breakpoints is needed.

Methods

It is impractical to construct a database that includes all possible fusion peptides originated from potential breakpoints. Focusing on 6259 reported and predicted gene fusion pairs from ChimerDB 2.0 and Cancer Gene Census, we for the first time created a database CanProFu that comprehensively annotates fusion peptides formed by exon-exon linkage between these pairing genes.

Results

Applying this database to mass spectrometry datasets of 40 human non-small cell lung cancer (NSCLC) samples and 39 normal lung samples with stringent searching criteria, we were able to identify 19 unique fusion peptides characterizing gene fusion events. Among them 11 gene fusion events were only found in NSCLC samples. And also, 4 alternative splicing events were characterized in cancerous or normal lung samples.

Conclusions

The database and workflow in this work can be flexibly applied to other MS/MS based human cancer experiments to detect gene fusions as potential disease biomarkers or drug targets.

SUBMITTER: Sun H 

PROVIDER: S-EPMC4042237 | biostudies-literature | 2013

REPOSITORIES: biostudies-literature

altmetric image

Publications

Identification of gene fusions from human lung cancer mass spectrometry data.

Sun Han H   Xing Xiaobin X   Li Jing J   Zhou Fengli F   Chen Yunqin Y   He Ying Y   Li Wei W   Wei Guangwu G   Chang Xiao X   Jia Jia J   Li Yixue Y   Xie Lu L  

BMC genomics 20131209


<h4>Background</h4>Tandem mass spectrometry (MS/MS) technology has been applied to identify proteins, as an ultimate approach to confirm the original genome annotation. To be able to identify gene fusion proteins, a special database containing peptides that cross over gene fusion breakpoints is needed.<h4>Methods</h4>It is impractical to construct a database that includes all possible fusion peptides originated from potential breakpoints. Focusing on 6259 reported and predicted gene fusion pairs  ...[more]

Similar Datasets

| S-EPMC2655099 | biostudies-literature
| PRJEB45336 | ENA
| S-EPMC3916180 | biostudies-literature
2020-10-20 | GSE157610 | GEO
| EGAC00001002236 | EGA
| S-EPMC8185368 | biostudies-literature
| S-EPMC5096969 | biostudies-literature
| S-EPMC6986310 | biostudies-literature
| S-EPMC5462478 | biostudies-literature
2005-09-20 | GSE2744 | GEO