Unknown

Dataset Information

0

SPEQ: quality assessment of peptide tandem mass spectra with deep learning.


ABSTRACT:

Motivation

In proteomics, database search programs are routinely used for peptide identification from tandem mass spectrometry data. However, many low-quality spectra cannot be interpreted by any programs. Meanwhile, certain high-quality spectra may not be identified due to incompleteness of the database or failure of the software. Thus, spectrum quality (SPEQ) assessment tools are helpful programs that can eliminate poor-quality spectra before the database search and highlight the high-quality spectra that are not identified in the initial search. These spectra may be valuable candidates for further analyses.

Results

We propose SPEQ: a spectrum quality assessment tool that uses a deep neural network to classify spectra into high-quality, which are worthy candidates for interpretation, and low-quality, which lack sufficient information for identification. SPEQ was compared with a few other prediction models and demonstrated improved prediction accuracy.

Availability and implementation

Source code and scripts are freely available at github.com/sor8sh/SPEQ, implemented in Python.

SUBMITTER: Gholamizoj S 

PROVIDER: S-EPMC8896601 | biostudies-literature | 2022 Mar

REPOSITORIES: biostudies-literature

altmetric image

Publications

SPEQ: quality assessment of peptide tandem mass spectra with deep learning.

Gholamizoj Soroosh S   Ma Bin B  

Bioinformatics (Oxford, England) 20220301 6


<h4>Motivation</h4>In proteomics, database search programs are routinely used for peptide identification from tandem mass spectrometry data. However, many low-quality spectra cannot be interpreted by any programs. Meanwhile, certain high-quality spectra may not be identified due to incompleteness of the database or failure of the software. Thus, spectrum quality (SPEQ) assessment tools are helpful programs that can eliminate poor-quality spectra before the database search and highlight the high-  ...[more]

Similar Datasets

| S-EPMC3128668 | biostudies-literature
2019-05-30 | PXD010871 | Pride
| S-EPMC8556919 | biostudies-literature
| S-EPMC2938093 | biostudies-literature
| S-EPMC11436754 | biostudies-literature
2019-05-21 | PXD010595 | Pride
| S-EPMC5297990 | biostudies-literature
| S-EPMC3166376 | biostudies-literature
| S-EPMC11890581 | biostudies-literature
2021-03-12 | MSV000087047 | MassIVE