Dataset Information

Combining de novo peptide sequencing algorithms, a synergistic approach to boost both identifications and confidence in bottom-up proteomics

ABSTRACT: Complex MS-based proteomics datasets are usually analyzed by protein database-searches. While this approach performs considerably well for sequenced organisms, direct inference of peptide sequences from tandem mass spectra, i.e. de novo peptide sequencing, oftentimes is the only way to obtain information when protein databases are absent. However, available algorithms suffer from drawbacks such as lack of validation and often high rates of false positive hits (FP). Here we present a simple method of combining results from commonly available de novo peptide sequencing algorithms, which in conjunction with minor tweaks in data acquisition ensues lower empirical FDR compared to the analysis using single algorithms. Results were validated using state-of-the art database search algorithms as well specifically synthesized reference peptides. Thus, we could increase the number of PSMs meeting a stringent FDR of 5% more than threefold compared to the single best de novo sequencing algorithm alone, accounting for an average of 11,120 PSMs (combined) instead of 3,476 PSMs (alone) in triplicate 2 h LC-MS runs of tryptic HeLa digestion.

INSTRUMENT(S):

ORGANISM(S): Homo Sapiens (human) Saccharomyces Cerevisiae (baker's Yeast) Radix Auricularia Mus Musculus (mouse)

TISSUE(S): Whole Body, Cell Culture, Hela Cell

SUBMITTER: Bernhard Blank-Landeshammer

LAB HEAD: Prof. Dr. Albert Sickmann

PROVIDER: PXD005280 | Pride | 2019-09-25

REPOSITORIES: Pride

ACCESS DATA

Dataset's files

Source:

			Action	DRS
	C2C12_1_qExHF01_02588.msf	Msf
	C2C12_1_qExHF01_02588.raw	Raw
	C2C12_2_qExHF01_02596.msf	Msf
	C2C12_2_qExHF01_02596.raw	Raw
	C2C12_3_qExHF01_02601.msf	Msf

Items per page:

1 - 5 of 24

Publications

Combining De Novo Peptide Sequencing Algorithms, A Synergistic Approach to Boost Both Identifications and Confidence in Bottom-up Proteomics.

Blank-Landeshammer Bernhard B Kollipara Laxmikanth L Biß Karsten K Pfenninger Markus M Malchow Sebastian S Shuvaev Konstantin K Zahedi René P RP Sickmann Albert A

Journal of proteome research 20170822 9

Complex mass spectrometry based proteomics data sets are mostly analyzed by protein database searches. While this approach performs considerably well for sequenced organisms, direct inference of peptide sequences from tandem mass spectra, i.e., de novo peptide sequencing, oftentimes is the only way to obtain information when protein databases are absent. However, available algorithms suffer from drawbacks such as lack of validation and often high rates of false positive hits (FP). Here we presen ...[more]

PMID: 28741358

Dataset Information

Combining de novo peptide sequencing algorithms, a synergistic approach to boost both identifications and confidence in bottom-up proteomics

Dataset's files

Publications

Combining De Novo Peptide Sequencing Algorithms, A Synergistic Approach to Boost Both Identifications and Confidence in Bottom-up Proteomics.

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets

Similar Datasets

Application of de novo sequencing to large-scale complex proteomics datasets
2016-01-12 | PXD003317 | Pride

E. coli mirror LC-MS/MS - Precision de novo peptide sequencing using mirror proteases of Ac-LysargiNase and trypsin for large-scale proteomics
2019-01-11 | PXD008688 | Pride

Monoclonal antibody LC-MS/MS - Precision de novo peptide sequencing using mirror proteases of Ac-LysargiNase and trypsin for large-scale proteomics
2019-01-11 | PXD008690 | Pride

NovoBoard: a comprehensive framework for evaluating the false discovery rate and accuracy of de novo peptide sequencing
2024-08-28 | PXD055277 | Pride

Yeast mirror LC-MS/MS - Precision de novo peptide sequencing using mirror proteases of Ac-LysargiNase and trypsin for large-scale proteomics
2019-01-11 | PXD011562 | Pride

full length protein sequencing method based on less specific and unspecific hydrolysis strategies 2
2022-02-15 | PXD030203 | Pride

Identification of Unknown Biological Toxin Protein Using Mass Spectrometry: A Case Study on De Novo Sequencing of Ricin
2025-12-01 | PXD061213 | Pride

De novo assembly of siRNA immunity in wild plants
2012-06-01 | E-GEOD-22079 | biostudies-arrayexpress

Spatial Regulation Dominates Gene Function in the Ganglia Chain
2013-12-04 | E-GEOD-45569 | biostudies-arrayexpress

Mass spectrometry provides a highly sensitive non-invasive means of sequencing and tracking M-protein levels in blood of multiple myeloma patients
2021-07-16 | PXD022784 | Pride