Proteomics

Dataset Information

0

Multi-protease approach for the improved identification and molecular characterization of small proteins and short open reading frame-encoded peptides


ABSTRACT: The identification of proteins below 70 amino acids in bottom-up proteomics is still a challenging task due to the limited number of peptides generated by proteolytic digestion. This includes the short open reading frame-encoded peptides (SEP), which are a subset of the small proteins that were not previously annotated or that are alternatively encoded. Here, we systematically investigated the use of multiple proteases (trypsin, chymotrypsin, LysC, LysArgiNase and GluC) in GeLC-MS/MS analysis to improve the sequence coverage and the number of identified peptides for small proteins (<70 amino acids), with a focus on SEP, in the archaeon Methanosarcina mazei. Combining the data of all proteases, we identified 63 small proteins and additional 28 SEP with at least two unique peptides, while only 55 small proteins and 22 SEP could be identified using trypsin only. For 27 small proteins and 12 SEP, a 100 % sequence coverage could be achieved. Moreover, for five SEP, incorrectly predicted translation start points were identified, confirming the data of a previous top-down proteomics study of this organism. The results show clearly that a multi-protease approach can improve the identification and molecular characterization of small proteins and SEP.

INSTRUMENT(S): Q Exactive

ORGANISM(S): Methanosarcina Mazei Go1

TISSUE(S): Cell Culture

SUBMITTER: Andreas Tholey  

LAB HEAD: Andreas Tholey

PROVIDER: PXD023921 | Pride | 2021-04-01

REPOSITORIES: Pride

Dataset's files

Source:
altmetric image

Publications

Multi-protease Approach for the Improved Identification and Molecular Characterization of Small Proteins and Short Open Reading Frame-Encoded Peptides.

Kaulich Philipp T PT   Cassidy Liam L   Bartel Jürgen J   Schmitz Ruth A RA   Tholey Andreas A  

Journal of proteome research 20210324 5


The identification of proteins below approximately 70-100 amino acids in bottom-up proteomics is still a challenging task due to the limited number of peptides generated by proteolytic digestion. This includes the short open reading frame-encoded peptides (SEPs), which are a subset of the small proteins that were not previously annotated or that are alternatively encoded. Here, we systematically investigated the use of multiple proteases (trypsin, chymotrypsin, LysC, LysargiNase, and GluC) in Ge  ...[more]

Similar Datasets

2023-07-06 | PXD018269 | Pride
2023-09-19 | PXD041979 | Pride
2021-11-02 | PXD019792 | Pride
2009-08-05 | E-MEXP-2108 | biostudies-arrayexpress
2024-09-24 | PXD055748 | Pride
2024-09-24 | PXD055745 | Pride
2016-09-26 | PXD004325 | Pride
2011-10-01 | E-MTAB-787 | biostudies-arrayexpress
2009-04-15 | E-GEOD-11174 | biostudies-arrayexpress
2010-07-01 | E-GEOD-21175 | biostudies-arrayexpress