Unknown

Dataset Information

0

Spectral probabilities and generating functions of tandem mass spectra: a strike against decoy databases.


ABSTRACT: A key problem in computational proteomics is distinguishing between correct and false peptide identifications. We argue that evaluating the error rates of peptide identifications is not unlike computing generating functions in combinatorics. We show that the generating functions and their derivatives ( spectral energy and spectral probability) represent new features of tandem mass spectra that, similarly to Delta-scores, significantly improve peptide identifications. Furthermore, the spectral probability provides a rigorous solution to the problem of computing statistical significance of spectral identifications. The spectral energy/probability approach improves the sensitivity-specificity tradeoff of existing MS/MS search tools, addresses the notoriously difficult problem of "one-hit-wonders" in mass spectrometry, and often eliminates the need for decoy database searches. We therefore argue that the generating function approach has the potential to increase the number of peptide identifications in MS/MS searches.

SUBMITTER: Kim S 

PROVIDER: S-EPMC2689316 | biostudies-literature | 2008 Aug

REPOSITORIES: biostudies-literature

altmetric image

Publications

Spectral probabilities and generating functions of tandem mass spectra: a strike against decoy databases.

Kim Sangtae S   Gupta Nitin N   Pevzner Pavel A PA  

Journal of proteome research 20080703 8


A key problem in computational proteomics is distinguishing between correct and false peptide identifications. We argue that evaluating the error rates of peptide identifications is not unlike computing generating functions in combinatorics. We show that the generating functions and their derivatives ( spectral energy and spectral probability) represent new features of tandem mass spectra that, similarly to Delta-scores, significantly improve peptide identifications. Furthermore, the spectral pr  ...[more]

Similar Datasets

| S-EPMC4611636 | biostudies-other
| S-EPMC2621003 | biostudies-literature
| S-EPMC4256515 | biostudies-literature
| S-EPMC2533155 | biostudies-literature
| S-EPMC3101864 | biostudies-other
| S-EPMC4721644 | biostudies-literature
| S-EPMC2938093 | biostudies-literature
| S-EPMC2527591 | biostudies-literature
| S-EPMC5297990 | biostudies-literature
| S-EPMC3905687 | biostudies-literature