Proteomics

Dataset Information

0

Open-pFind enables precise, comprehensive and rapid peptide identification in shotgun proteomics, part 2


ABSTRACT: Shotgun proteomics has grown rapidly in recent decades, but a large fraction of tandem mass spectrometry (MS/MS) data in shotgun proteomics are not successfully identified. We have developed a novel database search algorithm, Open-pFind, to efficiently identify peptides even in an ultra-large search space which takes into account unexpected modifications, amino acid mutations, semi- or non-specific digestion and co-eluting peptides. Tested on two metabolically labeled MS/MS datasets, Open-pFind reported 50.5‒117.0% more peptide-spectrum matches (PSMs) than the seven other advanced algorithms. More importantly, the Open-pFind results were more credible judged by the verification experiments using stable isotopic labeling. Tested on four additional large-scale datasets, 70‒85% of the spectra were confidently identified, and high-quality spectra were nearly completely interpreted by Open-pFind. Further, Open-pFind was over 40 times faster than the other three open search algorithms and 2‒3 times faster than three restricted search algorithms. Re-analysis of an entire human proteome dataset consisting of ~25 million spectra using Open-pFind identified a total of 14,064 proteins encoded by 12,723 genes by requiring at least two uniquely identified peptides. In this search results, Open-pFind also excelled in an independent test for false positives based on the presence or absence of olfactory receptors. Thus, a practical use of the open search strategy has been realized by Open-pFind for the truly global-scale proteomics experiments of today and in the future.

INSTRUMENT(S): Q Exactive

ORGANISM(S): Saccharomyces Cerevisiae (baker's Yeast)

SUBMITTER: Hao Chi  

LAB HEAD: Si-Min He

PROVIDER: PXD008783 | Pride | 2018-07-11

REPOSITORIES: Pride

Dataset's files

Source:
Action DRS
10.raw Raw
14.raw Raw
21.raw Raw
22.raw Raw
23.raw Raw
Items per page:
1 - 5 of 25

Similar Datasets

2018-07-11 | PXD008782 | Pride
2019-11-04 | PXD015759 | Pride
2019-03-12 | PXD005912 | Pride
2022-02-24 | PXD025019 | Pride
2024-09-04 | PXD045561 | Pride
2022-03-04 | PXD031032 | Pride
2010-01-07 | E-TABM-638 | biostudies-arrayexpress
2016-10-12 | PXD003759 | Pride
2021-05-25 | PXD009861 | Pride
2022-02-04 | PXD030081 | Pride