Unknown

Dataset Information

0

Enhancing peptide identification confidence by combining search methods.


ABSTRACT: Confident peptide identification is one of the most important components in mass-spectrometry-based proteomics. We propose a method to properly combine the results from different database search methods to enhance the accuracy of peptide identifications. The database search methods included in our analysis are SEQUEST (v27 rev12), ProbID (v1.0), InsPecT (v20060505), Mascot (v2.1), X! Tandem (v2007.07.01.2), OMSSA (v2.0) and RAId_DbS. Using two data sets, one collected in profile mode and one collected in centroid mode, we tested the search performance of all 21 combinations of two search methods as well as all 35 possible combinations of three search methods. The results obtained from our study suggest that properly combining search methods does improve retrieval accuracy. In addition to performance results, we also describe the theoretical framework which in principle allows one to combine many independent scoring methods including de novo sequencing and spectral library searches. The correlations among different methods are also investigated in terms of common true positives, common false positives, and a global analysis. We find that the average correlation strength, between any pairwise combination of the seven methods studied, is usually smaller than the associated standard error. This indicates only weak correlation may be present among different methods and validates our approach in combining the search results. The usefulness of our approach is further confirmed by showing that the average cumulative number of false positive peptides agrees reasonably well with the combined E-value. The data related to this study are freely available upon request.

SUBMITTER: Alves G 

PROVIDER: S-EPMC2658881 | biostudies-literature | 2008 Aug

REPOSITORIES: biostudies-literature

altmetric image

Publications

Enhancing peptide identification confidence by combining search methods.

Alves Gelio G   Wu Wells W WW   Wang Guanghui G   Shen Rong-Fong RF   Yu Yi-Kuo YK  

Journal of proteome research 20080618 8


Confident peptide identification is one of the most important components in mass-spectrometry-based proteomics. We propose a method to properly combine the results from different database search methods to enhance the accuracy of peptide identifications. The database search methods included in our analysis are SEQUEST (v27 rev12), ProbID (v1.0), InsPecT (v20060505), Mascot (v2.1), X! Tandem (v2007.07.01.2), OMSSA (v2.0) and RAId_DbS. Using two data sets, one collected in profile mode and one col  ...[more]

Similar Datasets

| S-EPMC11636256 | biostudies-literature
2017-01-02 | PXD005118 | Pride
2018-10-04 | GSE112623 | GEO
| S-EPMC7786379 | biostudies-literature
| S-EPMC10656014 | biostudies-literature
| S-EPMC3676282 | biostudies-literature
| S-EPMC11507693 | biostudies-literature
| S-EPMC9903325 | biostudies-literature
| S-EPMC3212438 | biostudies-literature
| S-EPMC6011013 | biostudies-literature