Unknown

Dataset Information

0

Proteoform Identification by Combining RNA-Seq and Top-Down Mass Spectrometry.


ABSTRACT: In proteogenomic studies, genomic and transcriptomic variants are incorporated into customized protein databases for the identification of proteoforms, especially proteoforms with sample-specific variants. Most proteogenomic research has been focused on combining genomic or transcriptomic data with bottom-up mass spectrometry data. In the last decade, top-down mass spectrometry has attracted increasing attention because of its capacity to identify various proteoforms with alterations. However, top-down proteogenomics, in which genomic or transcriptomic data are combined with top-down mass spectrometry data, has not been widely adopted, and there is still a lack of software tools for top-down proteogenomic data analysis. In this paper, we introduce TopPG, a proteogenomic tool for generating proteoform sequence databases with genetic alterations and alternative splicing events. Experiments on top-down proteogenomic data of DLD-1 colorectal cancer cells showed that TopPG coupled with database search confidently identified proteoforms with sample-specific alterations.

SUBMITTER: Chen W 

PROVIDER: S-EPMC7775893 | biostudies-literature | 2021 Jan

REPOSITORIES: biostudies-literature

altmetric image

Publications

Proteoform Identification by Combining RNA-Seq and Top-Down Mass Spectrometry.

Chen Wenrong W   Liu Xiaowen X  

Journal of proteome research 20201112 1


In proteogenomic studies, genomic and transcriptomic variants are incorporated into customized protein databases for the identification of proteoforms, especially proteoforms with sample-specific variants. Most proteogenomic research has been focused on combining genomic or transcriptomic data with bottom-up mass spectrometry data. In the last decade, top-down mass spectrometry has attracted increasing attention because of its capacity to identify various proteoforms with alterations. However, t  ...[more]

Similar Datasets

| S-EPMC11839095 | biostudies-literature
| S-EPMC5181555 | biostudies-literature
| S-EPMC5825287 | biostudies-literature
| S-EPMC6698994 | biostudies-literature
| S-EPMC3532624 | biostudies-literature
| S-EPMC9250612 | biostudies-literature
| S-EPMC8543976 | biostudies-literature
| S-EPMC10563160 | biostudies-literature
| S-EPMC10375574 | biostudies-literature
| S-EPMC10557138 | biostudies-literature