Unknown

Dataset Information

0

Systematic prediction of degrons and E3 ubiquitin ligase binding via deep learning


ABSTRACT:

Background

Degrons are short linear motifs, bound by E3 ubiquitin ligase to target protein substrates to be degraded by the ubiquitin-proteasome system. Mutations leading to deregulation of degron functionality disrupt control of protein abundance due to mistargeting of proteins destined for degradation and often result in pathologies. Targeting degrons by small molecules also emerges as an exciting drug design strategy to upregulate the expression of specific proteins. Despite their essential function and disease targetability, reliable identification of degrons remains a conundrum. Here, we developed a deep learning-based model named Degpred that predicts general degrons directly from protein sequences.

Results

We showed that the BERT-based model performed well in predicting degrons singly from protein sequences. Then, we used the deep learning model Degpred to predict degrons proteome-widely. Degpred successfully captured typical degron-related sequence properties and predicted degrons beyond those from motif-based methods which use a handful of E3 motifs to match possible degrons. Furthermore, we calculated E3 motifs using predicted degrons on the substrates in our collected E3-substrate interaction dataset and constructed a regulatory network of protein degradation by assigning predicted degrons to specific E3s with calculated motifs. Critically, we experimentally verified that a predicted SPOP binding degron on CBX6 prompts CBX6 degradation and mediates the interaction with SPOP. We also showed that the protein degradation regulatory system is important in tumorigenesis by surveying degron-related mutations in TCGA.

Conclusions

Degpred provides an efficient tool to proteome-wide prediction of degrons and binding E3s singly from protein sequences. Degpred successfully captures typical degron-related sequence properties and predicts degrons beyond those from previously used motif-based methods, thus greatly expanding the degron landscape, which should advance the understanding of protein degradation, and allow exploration of uncharacterized alterations of proteins in diseases. To make it easier for readers to access collected and predicted datasets, we integrated these data into the website http://degron.phasep.pro/.

Supplementary Information

The online version contains supplementary material available at 10.1186/s12915-022-01364-6.

SUBMITTER: Hou C 

PROVIDER: S-EPMC9281121 | biostudies-literature |

REPOSITORIES: biostudies-literature

Similar Datasets

| S-EPMC8198723 | biostudies-literature
| S-EPMC5777250 | biostudies-literature
| S-EPMC2795488 | biostudies-literature
| S-EPMC4693525 | biostudies-literature
| S-EPMC3387815 | biostudies-literature
| S-EPMC8586995 | biostudies-literature
| S-EPMC6557245 | biostudies-literature
| S-EPMC4886359 | biostudies-literature
| S-EPMC3556937 | biostudies-literature
| S-EPMC6141542 | biostudies-literature