Unknown

Dataset Information

0

Probabilistic approach to predicting substrate specificity of methyltransferases.


ABSTRACT: We present a general probabilistic framework for predicting the substrate specificity of enzymes. We designed this approach to be easily applicable to different organisms and enzymes. Therefore, our predictive models do not rely on species-specific properties and use mostly sequence-derived data. Maximum Likelihood optimization is used to fine-tune model parameters and the Akaike Information Criterion is employed to overcome the issue of correlated variables. As a proof-of-principle, we apply our approach to predicting general substrate specificity of yeast methyltransferases (MTases). As input, we use several physico-chemical and biological properties of MTases: structural fold, isoelectric point, expression pattern and cellular localization. Our method accurately predicts whether a yeast MTase methylates a protein, RNA or another molecule. Among our experimentally tested predictions, 89% were confirmed, including the surprising prediction that YOR021C is the first known MTase with a SPOUT fold that methylates a substrate other than RNA (protein). Our approach not only allows for highly accurate prediction of functional specificity of MTases, but also provides insight into general rules governing MTase substrate specificity.

SUBMITTER: Szczepinska T 

PROVIDER: S-EPMC3961171 | biostudies-literature | 2014 Mar

REPOSITORIES: biostudies-literature

altmetric image

Publications

Probabilistic approach to predicting substrate specificity of methyltransferases.

Szczepińska Teresa T   Kutner Jan J   Kopczyński Michał M   Pawłowski Krzysztof K   Dziembowski Andrzej A   Kudlicki Andrzej A   Ginalski Krzysztof K   Rowicka Maga M  

PLoS computational biology 20140320 3


We present a general probabilistic framework for predicting the substrate specificity of enzymes. We designed this approach to be easily applicable to different organisms and enzymes. Therefore, our predictive models do not rely on species-specific properties and use mostly sequence-derived data. Maximum Likelihood optimization is used to fine-tune model parameters and the Akaike Information Criterion is employed to overcome the issue of correlated variables. As a proof-of-principle, we apply ou  ...[more]

Similar Datasets

2024-11-08 | GSE247354 | GEO
| S-EPMC4433623 | biostudies-literature
2024-11-08 | GSE247353 | GEO
2024-11-08 | GSE247352 | GEO
| S-EPMC3904319 | biostudies-literature
| S-EPMC6014603 | biostudies-literature
| S-EPMC3225744 | biostudies-literature
| S-EPMC2999663 | biostudies-literature
| S-EPMC4465971 | biostudies-literature
| S-EPMC3893713 | biostudies-literature