NovoBoard: a comprehensive framework for evaluating the false discovery rate and accuracy of de novo peptide sequencing
Ontology highlight
ABSTRACT: De novo peptide sequencing is a fundamental research area in mass spectrometry (MS) based proteomics. However, those methods have often been evaluated using a couple of simple metrics that do not fully reflect their overall performance. Moreover, there has not been an established method to estimate the false discovery rate (FDR) and the significance of de novo peptide-spectrum matches (PSMs). Here we propose NovoBoard, a comprehensive framework to evaluate the performance of de novo peptide sequencing methods. The framework consists of diverse benchmark datasets (including tryptic, nontryptic, immunopeptidomics, and different species), and a standard set of accuracy metrics to evaluate the fragment ions, amino acids, and peptides of the de novo results. More importantly, a new approach is designed to evaluate de novo peptide sequencing methods on target-decoy spectra and to estimate their FDRs. Our results thoroughly reveal the strengths and weaknesses of different de novo peptide sequencing methods, and how their performances depend on specific applications and the types of data. Our FDR estimation also shows that some tools may perform better than the others in distinguishing between de novo PSMs and random matches, and can be used to assess the significance of de novo PSMs.
INSTRUMENT(S): Q Exactive
ORGANISM(S): Homo Sapiens (human)
SUBMITTER: Ngoc Hieu Tran
LAB HEAD: Baozhen Shan
PROVIDER: PXD055277 | Pride | 2024-08-28
REPOSITORIES: Pride
ACCESS DATA