Dataset Information

Matching cross-linked peptide spectra: only as good as the worse identification.

ABSTRACT: Chemical cross-linking mass spectrometry identifies interacting surfaces within a protein assembly through labeling with bifunctional reagents and identifying the covalently modified peptides. These yield distance constraints that provide a powerful means to model the three-dimensional structure of the assembly. Bioinformatic analysis of cross-linked data resulting from large protein assemblies is challenging because each cross-linked product contains two covalently linked peptides, each of which must be correctly identified from a complex matrix of potential confounders. Protein Prospector addresses these issues through a complementary mass modification strategy in which each peptide is searched and identified separately. We demonstrate this strategy with an analysis of RNA polymerase II. False discovery rates (FDRs) are assessed via comparison of cross-linking data to crystal structure, as well as by using a decoy database strategy. Parameters that are most useful for positive identification of cross-linked spectra are explored. We find that fragmentation spectra generally contain more product ions from one of the two peptides constituting the cross-link. Hence, metrics reflecting the quality of the spectral match to the less confident peptide provide the most discriminatory power between correct and incorrect matches. A support vector machine model was built to further improve classification of cross-linked peptide hits. Furthermore, the frequency with which peptides cross-linked via common acylating reagents fragment to produce diagnostic, cross-linker-specific ions is assessed. The threshold for successful identification of the cross-linked peptide product depends upon the complexity of the sample under investigation. Protein Prospector, by focusing the reliability assessment on the least confident peptide, is better able to control the FDR for results as larger complexes and databases are analyzed. In addition, when FDR thresholds are calculated separately for intraprotein and interprotein results, a further improvement in the number of unique cross-links confidently identified is achieved. These improvements are demonstrated on two previously published cross-linking datasets.

SUBMITTER: Trnka MJ

PROVIDER: S-EPMC3916644 | biostudies-literature | 2014 Feb

REPOSITORIES: biostudies-literature

ACCESS DATA

Publications

Matching cross-linked peptide spectra: only as good as the worse identification.

Trnka Michael J MJ Baker Peter R PR Robinson Philip J J PJ Burlingame A L AL Chalkley Robert J RJ

Molecular & cellular proteomics : MCP 20131212 2

Chemical cross-linking mass spectrometry identifies interacting surfaces within a protein assembly through labeling with bifunctional reagents and identifying the covalently modified peptides. These yield distance constraints that provide a powerful means to model the three-dimensional structure of the assembly. Bioinformatic analysis of cross-linked data resulting from large protein assemblies is challenging because each cross-linked product contains two covalently linked peptides, each of whic ...[more]

PMID: 24335475

Dataset Information

Matching cross-linked peptide spectra: only as good as the worse identification.

Publications

Matching cross-linked peptide spectra: only as good as the worse identification.

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets

Similar Datasets

Matching Cross-linked Peptide Spectra: Only as Good as the Worse Identification
2014-06-02 | MSV000078693 | MassIVE

Peptide identification from mixture tandem mass spectra.
| S-EPMC2938093 | biostudies-literature

Mango: A General Tool for Collision Induced Dissociation-Cleavable Cross-Linked Peptide Identification.
| S-EPMC5959040 | biostudies-literature

SQID-XLink: implementation of an intensity-incorporated algorithm for cross-linked peptide identification.
| S-EPMC3463113 | biostudies-literature

Estimation of peptide elongation times from ribosome profiling spectra
2021-04-23 | GSE145571 | GEO

Faster SEQUEST searching for peptide identification from tandem mass spectra.
| S-EPMC3166376 | biostudies-literature

Submodular Generalized Matching for Peptide Identification in Tandem Mass Spectrometry.
| S-EPMC8641787 | biostudies-literature

In vivo protein interaction network identified with a novel real-time cross-linked peptide identification strategy.
| S-EPMC3925062 | biostudies-literature

Identification of Cross-linked Peptides Using Isotopomeric Cross-linkers.
| S-EPMC7069596 | biostudies-literature

CharmeRT: Boosting Peptide Identifications by Chimeric Spectra Identification and Retention Time Prediction.
| S-EPMC6079931 | biostudies-literature