Unknown

Dataset Information

0

A dynamic noise level algorithm for spectral screening of peptide MS/MS spectra.


ABSTRACT:

Background

High-throughput shotgun proteomics data contain a significant number of spectra from non-peptide ions or spectra of too poor quality to obtain highly confident peptide identifications. These spectra cannot be identified with any positive peptide matches in some database search programs or are identified with false positives in others. Removing these spectra can improve the database search results and lower computational expense.

Results

A new algorithm has been developed to filter tandem mass spectra of poor quality from shotgun proteomic experiments. The algorithm determines the noise level dynamically and independently for each spectrum in a tandem mass spectrometric data set. Spectra are filtered based on a minimum number of required signal peaks with a signal-to-noise ratio of 2. The algorithm was tested with 23 sample data sets containing 62,117 total spectra.

Conclusions

The spectral screening removed 89.0% of the tandem mass spectra that did not yield a peptide match when searched with the MassMatrix database search software. Only 6.0% of tandem mass spectra that yielded peptide matches considered to be true positive matches were lost after spectral screening. The algorithm was found to be very effective at removal of unidentified spectra in other database search programs including Mascot, OMSSA, and X!Tandem (75.93%-91.00%) with a small loss (3.59%-9.40%) of true positive matches.

SUBMITTER: Xu H 

PROVIDER: S-EPMC2939612 | biostudies-literature | 2010 Aug

REPOSITORIES: biostudies-literature

altmetric image

Publications

A dynamic noise level algorithm for spectral screening of peptide MS/MS spectra.

Xu Hua H   Freitas Michael A MA  

BMC bioinformatics 20100823


<h4>Background</h4>High-throughput shotgun proteomics data contain a significant number of spectra from non-peptide ions or spectra of too poor quality to obtain highly confident peptide identifications. These spectra cannot be identified with any positive peptide matches in some database search programs or are identified with false positives in others. Removing these spectra can improve the database search results and lower computational expense.<h4>Results</h4>A new algorithm has been develope  ...[more]

Similar Datasets

| S-EPMC2974764 | biostudies-literature
| S-EPMC6731086 | biostudies-literature
| S-EPMC9400911 | biostudies-literature
| S-EPMC4256515 | biostudies-literature
| S-EPMC4119474 | biostudies-literature
| S-EPMC9903325 | biostudies-literature
| S-EPMC3221600 | biostudies-literature
| S-EPMC3712280 | biostudies-literature
| S-EPMC2621003 | biostudies-literature
| S-EPMC3489469 | biostudies-literature