Proteomics

Dataset Information

0

Fast open modification spectral library searching through approximate nearest neighbor indexing


ABSTRACT: Open modification searching (OMS) is a powerful search strategy that identifies peptides carrying any type of modification by allowing a modified spectrum to match against its unmodified variant by using a very wide precursor mass window. A drawback of this strategy, however, is that it leads to a large increase in search time. Although performing an open search can be done using existing spectral library search engines by simply setting a wide precursor mass window, none of these tools have been optimized for OMS, leading to excessive runtimes and suboptimal identification results. This data set contains the evaluation results of the ANN-SoLo tool for fast and accurate open spectral library searching. ANN-SoLo uses approximate nearest neighbor indexing to speed up OMS by selecting only a limited number of the most relevant library spectra to compare to an unknown query spectrum. This approach is combined with a cascade search strategy to maximize the number of identified unmodified and modified spectra while strictly controlling the false discovery rate, as well as a shifted dot product score to sensitively match modified spectra to their unmodified counterparts. ANN-SoLo achieves state-of-the-art performance in terms of speed and the number of identifications. On a previously published human cell line data set, ANN-SoLo confidently identifies more spectra than SpectraST or MSFragger and achieves a speedup of an order of magnitude compared to SpectraST.

INSTRUMENT(S): TripleTOF 5600, Q Exactive

ORGANISM(S): Homo Sapiens (human) Saccharomyces Cerevisiae (baker's Yeast)

TISSUE(S): Cell Culture

SUBMITTER: Wout Bittremieux  

LAB HEAD: Kris Laukens

PROVIDER: PXD009861 | Pride | 2021-05-25

REPOSITORIES: Pride

Dataset's files

Source:
altmetric image

Publications

Fast Open Modification Spectral Library Searching through Approximate Nearest Neighbor Indexing.

Bittremieux Wout W   Meysman Pieter P   Noble William Stafford WS   Laukens Kris K  

Journal of proteome research 20180913 10


Open modification searching (OMS) is a powerful search strategy that identifies peptides carrying any type of modification by allowing a modified spectrum to match against its unmodified variant by using a very wide precursor mass window. A drawback of this strategy, however, is that it leads to a large increase in search time. Although performing an open search can be done using existing spectral library search engines by simply setting a wide precursor mass window, none of these tools have bee  ...[more]

Similar Datasets

2019-12-06 | PXD013641 | Pride
2015-06-03 | PXD001468 | Pride
2016-07-22 | PXD002613 | Pride
2012-03-08 | E-GEOD-36305 | biostudies-arrayexpress
2018-07-11 | PXD008783 | Pride
2018-07-11 | PXD008782 | Pride
2018-10-11 | PXD009900 | Pride
2019-10-21 | PXD009597 | Pride
2022-02-24 | PXD025019 | Pride
2014-09-01 | PXD001003 | Pride