Unknown

Dataset Information

0

TIDD: tool-independent and data-dependent machine learning for peptide identification.


ABSTRACT:

Background

In shotgun proteomics, database search engines have been developed to assign peptides to tandem mass (MS/MS) spectra and at the same time post-processing (or rescoring) approaches over the search results have been proposed to increase the number of confident peptide identifications. The most popular post-processing approaches such as Percolator and PeptideProphet have improved rates of peptide identifications by combining multiple scores from database search engines while applying machine learning techniques. Existing post-processing approaches, however, are limited when dealing with results from new search engines because their features for machine learning must be optimized specifically for each search engine.

Results

We propose a universal post-processing tool, called TIDD, which supports confident peptide identifications regardless of the search engine adopted. TIDD can work for any (including newly developed) search engines because it calculates universal features that assess peptide-spectrum match quality while it allows additional features provided by search engines (or users) as well. Even though it relies on universal features independent of search tools, TIDD showed similar or better performance than Percolator in terms of peptide identification. TIDD identified 10.23-38.95% more PSMs than target-decoy estimation for MSFragger, which is not supported by Percolator. TIDD offers an easy-to-use simple graphical user interface for user convenience.

Conclusions

TIDD successfully eliminated the requirement for an optimal feature engineering per database search tool, and thus, can be applied directly to any database search results including newly developed ones.

SUBMITTER: Li H 

PROVIDER: S-EPMC8969291 | biostudies-literature | 2022 Mar

REPOSITORIES: biostudies-literature

altmetric image

Publications

TIDD: tool-independent and data-dependent machine learning for peptide identification.

Li Honglan H   Na Seungjin S   Hwang Kyu-Baek KB   Paek Eunok E  

BMC bioinformatics 20220330 1


<h4>Background</h4>In shotgun proteomics, database search engines have been developed to assign peptides to tandem mass (MS/MS) spectra and at the same time post-processing (or rescoring) approaches over the search results have been proposed to increase the number of confident peptide identifications. The most popular post-processing approaches such as Percolator and PeptideProphet have improved rates of peptide identifications by combining multiple scores from database search engines while appl  ...[more]

Similar Datasets

| S-EPMC6842143 | biostudies-literature
2021-07-26 | GSE175955 | GEO
| S-EPMC9421197 | biostudies-literature
| S-EPMC4857761 | biostudies-literature
| S-EPMC11681168 | biostudies-literature
| S-EPMC10165132 | biostudies-literature
| S-EPMC9206279 | biostudies-literature
| S-EPMC7131989 | biostudies-literature
2024-08-25 | PXD044852 | JPOST Repository
| S-EPMC9531543 | biostudies-literature