Unknown

Dataset Information

0

TagGraph reveals vast protein modification landscapes from large tandem mass spectrometry datasets.


ABSTRACT: Although mass spectrometry is well suited to identifying thousands of potential protein post-translational modifications (PTMs), it has historically been biased towards just a few. To measure the entire set of PTMs across diverse proteomes, software must overcome the dual challenges of covering enormous search spaces and distinguishing correct from incorrect spectrum interpretations. Here, we describe TagGraph, a computational tool that overcomes both challenges with an unrestricted string-based search method that is as much as 350-fold faster than existing approaches, and a probabilistic validation model that we optimized for PTM assignments. We applied TagGraph to a published human proteomic dataset of 25?million mass spectra and tripled confident spectrum identifications compared to its original analysis. We identified thousands of modification types on almost 1?million sites in the proteome. We show alternative contexts for highly abundant yet understudied PTMs such as proline hydroxylation, and its unexpected association with cancer mutations. By enabling broad characterization of PTMs, TagGraph informs as to how their functions and regulation intersect.

SUBMITTER: Devabhaktuni A 

PROVIDER: S-EPMC6447449 | biostudies-literature | 2019 Apr

REPOSITORIES: biostudies-literature

altmetric image

Publications

TagGraph reveals vast protein modification landscapes from large tandem mass spectrometry datasets.

Devabhaktuni Arun A   Lin Sarah S   Zhang Lichao L   Swaminathan Kavya K   Gonzalez Carlos G CG   Olsson Niclas N   Pearlman Samuel M SM   Rawson Keith K   Elias Joshua E JE  

Nature biotechnology 20190401 4


Although mass spectrometry is well suited to identifying thousands of potential protein post-translational modifications (PTMs), it has historically been biased towards just a few. To measure the entire set of PTMs across diverse proteomes, software must overcome the dual challenges of covering enormous search spaces and distinguishing correct from incorrect spectrum interpretations. Here, we describe TagGraph, a computational tool that overcomes both challenges with an unrestricted string-based  ...[more]

Similar Datasets

| S-EPMC3322561 | biostudies-literature
| S-EPMC2773710 | biostudies-literature
| S-EPMC8215474 | biostudies-literature
| S-EPMC4184452 | biostudies-literature
| S-EPMC8341206 | biostudies-literature
| S-EPMC11007677 | biostudies-literature
| S-EPMC10032035 | biostudies-literature
| S-EPMC4304122 | biostudies-literature
| S-EPMC8364181 | biostudies-literature
| S-EPMC3477243 | biostudies-literature