Proteomics

Dataset Information

0

Small Protein Enrichment-Based Proteogenomics Identifies Plentiful Missing Proteins and Three Novel sORFs in Saccharomyces cerevisiae


ABSTRACT: Small proteins (SPs) are defined as peptides of 100 amino acids or less encoded by short open reading frames (sORFs). Recent studies have shown that SPs are involved in many important biological processes, including cell signaling, metabolism, growth and so on. However, most of the annotated SPs in almost all species are currently lacking the evidence for protein existence and are regard as missing proteins (MPs). In addition, more and more mis-annotated sORFs have been discovered by proteogenomic methods in human, mice and even the well-characterised Saccharomyces cerevisiae. This reveals a blind spot in traditional gene annotation technology for those SPs-encoding genes. Because SPs are short and generally low abundant, SPs identification using proteomics faces challenges. A deeper coverage of SPs identification may help validate more MPs and discover more potential mis-annotated sORFs. Here, we applied a SPs enrichment-based proteogenomic strategy to Saccharomyces cerevisiae. By integrating four different SPs enrichment methods, we have successfully validated 31 MPs and discovered 3 novel sORFs (YKL104W-A, YHR052C-B and YHR054C-B) which were verified by novel peptide synthesis. In-depth analysis of our SPs enrichment datasets also reveals a series of special physicochemical and biological characteristics of SPs and particular rules of SPs identification. Based on these, we then systematically conclude the difficulties, causes and solutions in SPs identification.

INSTRUMENT(S): LTQ Orbitrap Velos

ORGANISM(S): Saccharomyces Cerevisiae (baker's Yeast)

SUBMITTER: cuitong he  

LAB HEAD: Ping Xu

PROVIDER: PXD008586 | Pride | 2018-06-15

REPOSITORIES: Pride

Dataset's files

Source:
Action DRS
HCl_MWCO.zip Other
HCl_Tricine.zip Other
Urea_MWCO.zip Other
Urea_Tricine.zip Other
Y_S_H.raw Raw
Items per page:
1 - 5 of 24

Similar Datasets

2023-02-14 | GSE206492 | GEO
2019-08-08 | PXD007973 | Pride
2019-06-20 | E-MTAB-8039 | biostudies-arrayexpress
2024-08-02 | PXD045039 | Pride
2019-08-08 | PXD007923 | Pride
2019-08-08 | PXD007922 | Pride
2018-10-04 | PXD010093 | Pride
2024-05-23 | PXD036790 | Pride
2015-07-28 | PXD002583 | Pride
2024-04-05 | E-MTAB-13698 | biostudies-arrayexpress