Unknown

Dataset Information

0

Systematic Errors in Peptide and Protein Identification and Quantification by Modified Peptides.


ABSTRACT: The principle of shotgun proteomics is to use peptide mass spectra in order to identify corresponding sequences in a protein database. The quality of peptide and protein identification and quantification critically depends on the sensitivity and specificity of this assignment process. Many peptides in proteomic samples carry biochemical modifications, and a large fraction of unassigned spectra arise from modified peptides. Spectra derived from modified peptides can erroneously be assigned to wrong amino acid sequences. However, the impact of this problem on proteomic data has not yet been investigated systematically. Here we use combinations of different database searches to show that modified peptides can be responsible for 20-50% of false positive identifications in deep proteomic data sets. These false positive hits are particularly problematic as they have significantly higher scores and higher intensities than other false positive matches. Furthermore, these wrong peptide assignments lead to hundreds of false protein identifications and systematic biases in protein quantification. We devise a "cleaned search" strategy to address this problem and show that this considerably improves the sensitivity and specificity of proteomic data. In summary, we show that modified peptides cause systematic errors in peptide and protein identification and quantification and should therefore be considered to further improve the quality of proteomic data annotation.

SUBMITTER: Bogdanow B 

PROVIDER: S-EPMC4974352 | biostudies-literature | 2016 Aug

REPOSITORIES: biostudies-literature

altmetric image

Publications

Systematic Errors in Peptide and Protein Identification and Quantification by Modified Peptides.

Bogdanow Boris B   Zauber Henrik H   Selbach Matthias M  

Molecular & cellular proteomics : MCP 20160523 8


The principle of shotgun proteomics is to use peptide mass spectra in order to identify corresponding sequences in a protein database. The quality of peptide and protein identification and quantification critically depends on the sensitivity and specificity of this assignment process. Many peptides in proteomic samples carry biochemical modifications, and a large fraction of unassigned spectra arise from modified peptides. Spectra derived from modified peptides can erroneously be assigned to wro  ...[more]

Similar Datasets

| S-EPMC2868505 | biostudies-literature
| S-EPMC8075102 | biostudies-literature
2019-06-20 | GSE126405 | GEO
| S-EPMC1940031 | biostudies-literature
| S-EPMC7115945 | biostudies-literature
| S-EPMC3319072 | biostudies-other
| S-EPMC7426425 | biostudies-literature
2012-03-03 | E-GEOD-36217 | biostudies-arrayexpress
| S-EPMC3317402 | biostudies-literature
| S-EPMC3006185 | biostudies-literature