An adaptive pipeline to maximize isobaric tagging data in large-scale MS-based proteomics
Ontology highlight
ABSTRACT: Isobaric tagging is a method of choice in Mass Spectrometry (MS)-based proteomics for comparing multiple conditions at a time. Despite its multiplexing capabilities, when multiple experiments are merged for comparison in large sample-size studies, some drawbacks appears, due to the presence of missing values, which result from the stochastic nature of the Data-Dependent Acquisition (DDA) mode. Another indirect cause of data incompleteness might derive from the proteomic-typical data processing workflow that first identifies proteins in individual experiments and then only quantifies those identified proteins, leaving a large number of unmatched spectra with quantitative information unexploited. Inspired by untargeted metabolomic and label-free proteomic workflows, we developed a quantification-driven bioinformatic pipeline (Quantify then Identify – QtI) that optimizes the processing of isobaric Tandem Mass Tag (TMT) data from large-scale studies. This pipeline includes innovative modules, such as the Peptide Match Rescue (PMR) and the Optimized Post-Translational Modification (OPTM) and outperforms a classical benchmark workflow in terms of quantification and identification rates, significantly reducing missing data while preserving unmatched features for quantitative comparison. The number of unexploited tandem mass spectra was reduced by 77% and 62% for two human cerebrospinal fluid (CSF) and plasma datasets, respectively.
INSTRUMENT(S): LTQ Orbitrap Elite
ORGANISM(S): Homo Sapiens (human)
TISSUE(S): Blood Plasma, Cerebrospinal Fluid
SUBMITTER: Charlotte Macron
LAB HEAD: Loïc Dayon
PROVIDER: PXD008029 | Pride | 2018-05-01
REPOSITORIES: Pride
ACCESS DATA