Unknown

Dataset Information

0

A Fast and Interpretable Deep Learning Approach for Accurate Electrostatics-Driven pKa Predictions in Proteins.


ABSTRACT: Existing computational methods for estimating pKa values in proteins rely on theoretical approximations and lengthy computations. In this work, we use a data set of 6 million theoretically determined pKa shifts to train deep learning models, which are shown to rival the physics-based predictors. These neural networks managed to infer the electrostatic contributions of different chemical groups and learned the importance of solvent exposure and close interactions, including hydrogen bonds. Although trained only using theoretical data, our pKAI+ model displayed the best accuracy in a test set of ∼750 experimental values. Inference times allow speedups of more than 1000× compared to physics-based methods. By combining speed, accuracy, and a reasonable understanding of the underlying physics, our models provide a game-changing solution for fast estimations of macroscopic pKa values from ensembles of microscopic values as well as for many downstream applications such as molecular docking and constant-pH molecular dynamics simulations.

SUBMITTER: Reis PBPS 

PROVIDER: S-EPMC9369009 | biostudies-literature | 2022 Aug

REPOSITORIES: biostudies-literature

altmetric image

Publications

A Fast and Interpretable Deep Learning Approach for Accurate Electrostatics-Driven p<i>K</i><sub>a</sub> Predictions in Proteins.

Reis Pedro B P S PBPS   Bertolini Marco M   Montanari Floriane F   Rocchia Walter W   Machuqueiro Miguel M   Clevert Djork-Arné DA  

Journal of chemical theory and computation 20220715 8


Existing computational methods for estimating p<i>K</i><sub>a</sub> values in proteins rely on theoretical approximations and lengthy computations. In this work, we use a data set of 6 million theoretically determined p<i>K</i><sub>a</sub> shifts to train deep learning models, which are shown to rival the physics-based predictors. These neural networks managed to infer the electrostatic contributions of different chemical groups and learned the importance of solvent exposure and close interactio  ...[more]

Similar Datasets

| S-EPMC10591138 | biostudies-literature
| S-EPMC8912091 | biostudies-literature
| S-EPMC8122155 | biostudies-literature
| S-EPMC10829538 | biostudies-literature
| S-EPMC10850482 | biostudies-literature
| S-EPMC6143552 | biostudies-literature
| S-EPMC7402013 | biostudies-literature
| S-EPMC8076954 | biostudies-literature
| S-EPMC6395045 | biostudies-literature
| S-EPMC2578799 | biostudies-literature