Unknown

Dataset Information

0

Predicting proteolytic sites in extracellular proteins: only halfway there.


ABSTRACT: MOTIVATION:Many secretory proteins are synthesized as inactive precursors that must undergo post-translational proteolysis in order to mature and become active. In the current study, we address the challenge of sequence-based discovery of proteolytic sites in secreted proteins using machine learning. RESULTS:The results revealed that only half of the extracellular proteolytic sites are currently annotated, leaving over 3600 unannotated ones. Furthermore, we have found that only 6% of the unannotated sites are similar to known proteolytic sites, whereas the remaining 94% do not share significant similarity with any annotated proteolytic site. The computational challenges in these two cases are very different. While the precision in detecting the former group is close to perfect, only a mere 22% of the latter group were detected with a precision of 80%. The applicability of the classifier is demonstrated through members of the FGF family, in which we verified the conservation of physiologically-relevant proteolytic sites in homologous proteins.

SUBMITTER: Kliger Y 

PROVIDER: S-EPMC7109841 | biostudies-literature | 2008 Apr

REPOSITORIES: biostudies-literature

altmetric image

Publications

Predicting proteolytic sites in extracellular proteins: only halfway there.

Kliger Yossef Y   Gofer Eyal E   Wool Assaf A   Toporik Amir A   Apatoff Avihay A   Olshansky Moshe M  

Bioinformatics (Oxford, England) 20080304 8


<h4>Motivation</h4>Many secretory proteins are synthesized as inactive precursors that must undergo post-translational proteolysis in order to mature and become active. In the current study, we address the challenge of sequence-based discovery of proteolytic sites in secreted proteins using machine learning.<h4>Results</h4>The results revealed that only half of the extracellular proteolytic sites are currently annotated, leaving over 3600 unannotated ones. Furthermore, we have found that only 6%  ...[more]

Similar Datasets

| S-EPMC10342023 | biostudies-literature
| S-EPMC449705 | biostudies-literature
| S-EPMC2761413 | biostudies-literature
| S-EPMC1534068 | biostudies-literature
| S-EPMC2639300 | biostudies-literature
| S-EPMC4210226 | biostudies-literature
| S-EPMC2638146 | biostudies-literature
| S-EPMC7646512 | biostudies-literature
| S-EPMC2858581 | biostudies-literature
| S-EPMC5578169 | biostudies-literature