Unknown

Dataset Information

0

Dinosaur: A Refined Open-Source Peptide MS Feature Detector.


ABSTRACT: In bottom-up mass spectrometry (MS)-based proteomics, peptide isotopic and chromatographic traces (features) are frequently used for label-free quantification in data-dependent acquisition MS but can also be used for the improved identification of chimeric spectra or sample complexity characterization. Feature detection is difficult because of the high complexity of MS proteomics data from biological samples, which frequently causes features to intermingle. In addition, existing feature detection algorithms commonly suffer from compatibility issues, long computation times, or poor performance on high-resolution data. Because of these limitations, we developed a new tool, Dinosaur, with increased speed and versatility. Dinosaur has the functionality to sample algorithm computations through quality-control plots, which we call a plot trail. From the evaluation of this plot trail, we introduce several algorithmic improvements to further improve the robustness and performance of Dinosaur, with the detection of features for 98% of MS/MS identifications in a benchmark data set, and no other algorithm tested in this study passed 96% feature detection. We finally used Dinosaur to reimplement a published workflow for peptide identification in chimeric spectra, increasing chimeric identification from 26% to 32% over the standard workflow. Dinosaur is operating-system-independent and is freely available as open source on https://github.com/fickludd/dinosaur .

SUBMITTER: Teleman J 

PROVIDER: S-EPMC4933939 | biostudies-literature | 2016 Jul

REPOSITORIES: biostudies-literature

altmetric image

Publications

Dinosaur: A Refined Open-Source Peptide MS Feature Detector.

Teleman Johan J   Chawade Aakash A   Sandin Marianne M   Levander Fredrik F   Malmström Johan J  

Journal of proteome research 20160608 7


In bottom-up mass spectrometry (MS)-based proteomics, peptide isotopic and chromatographic traces (features) are frequently used for label-free quantification in data-dependent acquisition MS but can also be used for the improved identification of chimeric spectra or sample complexity characterization. Feature detection is difficult because of the high complexity of MS proteomics data from biological samples, which frequently causes features to intermingle. In addition, existing feature detectio  ...[more]

Similar Datasets

2016-06-08 | PXD003405 | Pride
| S-EPMC6620392 | biostudies-literature
| S-EPMC5860051 | biostudies-literature
| S-EPMC8002403 | biostudies-literature
| S-EPMC6868186 | biostudies-literature
| S-EPMC4547611 | biostudies-literature
| S-EPMC8816939 | biostudies-literature
| S-EPMC7710140 | biostudies-literature
| S-EPMC10310047 | biostudies-literature
| S-EPMC9710787 | biostudies-literature