Unknown

Dataset Information

0

Effective Leveraging of Targeted Search Spaces for Improving Peptide Identification in Tandem Mass Spectrometry Based Proteomics.


ABSTRACT: In shotgun proteomics, peptides are typically identified using database searching, which involves scoring acquired tandem mass spectra against peptides derived from standard protein sequence databases such as Uniprot, Refseq, or Ensembl. In this strategy, the sensitivity of peptide identification is known to be affected by the size of the search space. Therefore, creating a targeted sequence database containing only peptides likely to be present in the analyzed sample can be a useful technique for improving the sensitivity of peptide identification. In this study, we describe how targeted peptide databases can be created based on the frequency of identification in the global proteome machine database (GPMDB), the largest publicly available repository of peptide and protein identification data. We demonstrate that targeted peptide databases can be easily integrated into existing proteome analysis workflows and describe a computational strategy for minimizing any loss of peptide identifications arising from potential search space incompleteness in the targeted search spaces. We demonstrate the performance of our workflow using several data sets of varying size and sample complexity.

SUBMITTER: Shanmugam AK 

PROVIDER: S-EPMC4748730 | biostudies-literature | 2015 Dec

REPOSITORIES: biostudies-literature

altmetric image

Publications

Effective Leveraging of Targeted Search Spaces for Improving Peptide Identification in Tandem Mass Spectrometry Based Proteomics.

Shanmugam Avinash K AK   Nesvizhskii Alexey I AI  

Journal of proteome research 20151124 12


In shotgun proteomics, peptides are typically identified using database searching, which involves scoring acquired tandem mass spectra against peptides derived from standard protein sequence databases such as Uniprot, Refseq, or Ensembl. In this strategy, the sensitivity of peptide identification is known to be affected by the size of the search space. Therefore, creating a targeted sequence database containing only peptides likely to be present in the analyzed sample can be a useful technique f  ...[more]

Similar Datasets

| S-EPMC5562220 | biostudies-literature
| S-EPMC3322561 | biostudies-literature
| S-EPMC7896416 | biostudies-literature
| S-EPMC6800818 | biostudies-literature
| S-EPMC2597439 | biostudies-literature
| S-EPMC2830836 | biostudies-other
| S-EPMC8341206 | biostudies-literature
| S-EPMC3586292 | biostudies-literature
| S-EPMC11294833 | biostudies-literature
| S-EPMC3134881 | biostudies-literature