Unknown

Dataset Information

0

DeepCleave: a deep learning predictor for caspase and matrix metalloprotease substrates and cleavage sites.


ABSTRACT:

Motivation

Proteases are enzymes that cleave target substrate proteins by catalyzing the hydrolysis of peptide bonds between specific amino acids. While the functional proteolysis regulated by proteases plays a central role in the 'life and death' cellular processes, many of the corresponding substrates and their cleavage sites were not found yet. Availability of accurate predictors of the substrates and cleavage sites would facilitate understanding of proteases' functions and physiological roles. Deep learning is a promising approach for the development of accurate predictors of substrate cleavage events.

Results

We propose DeepCleave, the first deep learning-based predictor of protease-specific substrates and cleavage sites. DeepCleave uses protein substrate sequence data as input and employs convolutional neural networks with transfer learning to train accurate predictive models. High predictive performance of our models stems from the use of high-quality cleavage site features extracted from the substrate sequences through the deep learning process, and the application of transfer learning, multiple kernels and attention layer in the design of the deep network. Empirical tests against several related state-of-the-art methods demonstrate that DeepCleave outperforms these methods in predicting caspase and matrix metalloprotease substrate-cleavage sites.

Availability and implementation

The DeepCleave webserver and source code are freely available at http://deepcleave.erc.monash.edu/.

Supplementary information

Supplementary data are available at Bioinformatics online.

SUBMITTER: Li F 

PROVIDER: S-EPMC8215920 | biostudies-literature |

REPOSITORIES: biostudies-literature

Similar Datasets

| S-EPMC2917068 | biostudies-literature
| S-EPMC4201543 | biostudies-literature
| S-EPMC10775615 | biostudies-literature
| S-EPMC1764470 | biostudies-literature
| S-EPMC8393747 | biostudies-literature
| S-EPMC2893604 | biostudies-literature
| S-EPMC8543953 | biostudies-literature
| S-EPMC3920740 | biostudies-literature
| S-EPMC9116730 | biostudies-literature
| S-EPMC3650350 | biostudies-literature