Analysis combined with fragment intensity predictions results in improved identification of classical bioactive peptides and sORF-encoded peptides
Ontology highlight
ABSTRACT: Bioactive peptides exhibit key roles in a wide variety of complex processes, such as regulation of body weight, learning, aging and innate immune response. Next to the classical bioactive peptides, emerging from larger precursor proteins by specific proteolytic processing, a new class of bio-active peptides originating from small open reading frames (sORFs) have been recognized as important biological regulators. But their intrinsic properties, specific expression pattern and location on presumed non-coding regions have hindered the full characterization of the repertoire of bioactive peptides, despite their predominant role in various pathways. Although the development of peptidomics has offered the opportunity to study these peptides in vivo, it remains challenging to identify the full peptidome as the lack of cleavage enzyme specification complicates conventional database search approaches. In this study, we introduce a proteogenomics methodology using a new type of mass spectrometry instrument and the implementation of machine learning tools towards improved identification of bioactive peptides in the mouse brain. The application of trapped ion mobility spectrometry (tims) coupled to a time-of-flight mass analyzer (TOF) offers improved sensitivity, an enhanced peptide coverage, reduction in chemical noise and the occurrence of chimeric spectra. Subsequent machine learning tools MS2PIP, predicting fragment ion intensities and DeepLC, predicting retention times, improve the database searching based on a large and comprehensive custom database containing both sORFs and alternative ORFs. Finally, the identification of peptides is further enhanced by applying the post-processing semi-supervised learning tool Percolator. Applying this workflow, we identified 48 sORF-encoded peptides originating from presumed non-coding locations, next to identifying 66 known neuropeptides from within 22 different families. Altogether, this robust pipeline fuses technological advancements from different fields ensuring an improved coverage of the neuropeptidome in the mouse brain.
INSTRUMENT(S): timsTOF Pro
ORGANISM(S): Mus Musculus (mouse)
TISSUE(S): Brain
SUBMITTER: Kurt Boonen
LAB HEAD: Kurt Boonen
PROVIDER: PXD026584 | Pride | 2021-10-11
REPOSITORIES: Pride
ACCESS DATA