Dataset Information

Tandem Mass Spectrum Identification via Cascaded Search.

ABSTRACT: Accurate assignment of peptide sequences to observed fragmentation spectra is hindered by the large number of hypotheses that must be considered for each observed spectrum. A high score assigned to a particular peptide-spectrum match (PSM) may not end up being statistically significant after multiple testing correction. Researchers can mitigate this problem by controlling the hypothesis space in various ways: considering only peptides resulting from enzymatic cleavages, ignoring possible post-translational modifications or single nucleotide variants, etc. However, these strategies sacrifice identifications of spectra generated by rarer types of peptides. In this work, we introduce a statistical testing framework, cascade search, that directly addresses this problem. The method requires that the user specify a priori a statistical confidence threshold as well as a series of peptide databases. For instance, such a cascade of databases could include fully tryptic, semitryptic, and nonenzymatic peptides or peptides with increasing numbers of modifications. Cascaded search then gradually expands the list of candidate peptides from more likely peptides toward rare peptides, sequestering at each stage any spectrum that is identified with a specified statistical confidence. We compare cascade search to a standard procedure that lumps all of the peptides into a single database, as well as to a previously described group FDR procedure that computes the FDR separately within each database. We demonstrate, using simulated and real data, that cascade search identifies more spectra at a fixed FDR threshold than with either the ungrouped or grouped approach. Cascade search thus provides a general method for maximizing the number of identified spectra in a statistically rigorous fashion.

SUBMITTER: Kertesz-Farkas A

PROVIDER: S-EPMC4533645 | biostudies-literature | 2015 Aug

REPOSITORIES: biostudies-literature

ACCESS DATA

Publications

Tandem Mass Spectrum Identification via Cascaded Search.

Kertesz-Farkas Attila A Keich Uri U Noble William Stafford WS

Journal of proteome research 20150630 8

Accurate assignment of peptide sequences to observed fragmentation spectra is hindered by the large number of hypotheses that must be considered for each observed spectrum. A high score assigned to a particular peptide-spectrum match (PSM) may not end up being statistically significant after multiple testing correction. Researchers can mitigate this problem by controlling the hypothesis space in various ways: considering only peptides resulting from enzymatic cleavages, ignoring possible post-tr ...[more]

PMID: 26084232

Dataset Information

Tandem Mass Spectrum Identification via Cascaded Search.

Publications

Tandem Mass Spectrum Identification via Cascaded Search.

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets

Similar Datasets

A Tandem Mass Spectrometry Sequence Database Search Method for Identification of O-Fucosylated Proteins by Mass Spectrometry.
| S-EPMC6445572 | biostudies-literature

Learning score function parameters for improved spectrum identification in tandem mass spectrometry experiments.
| S-EPMC3436966 | biostudies-literature

In Search of Disentanglement in Tandem Mass Spectrometry Datasets.
| S-EPMC10526774 | biostudies-literature

Fast multi-blind modification search through tandem mass spectrometry.
| S-EPMC3322561 | biostudies-literature

Effective Leveraging of Targeted Search Spaces for Improving Peptide Identification in Tandem Mass Spectrometry Based Proteomics.
| S-EPMC4748730 | biostudies-literature

Ariadne: a database search engine for identification and chemical analysis of RNA using tandem mass spectrometry data.
| S-EPMC2665244 | biostudies-literature

FDRAnalysis: a tool for the integrated analysis of tandem mass spectrometry identification results from multiple search engines.
| S-EPMC3707089 | biostudies-literature

Database search algorithm for identification of intact cross-links in proteins and peptides using tandem mass spectrometry.
| S-EPMC4141472 | biostudies-literature

Robust Accurate Identification and Biomass Estimates of Microorganisms via Tandem Mass Spectrometry.
| S-EPMC10501333 | biostudies-literature

Cycloquest: identification of cyclopeptides via database search of their mass spectra against genome databases.
| S-EPMC3242011 | biostudies-literature