Unknown

Dataset Information

0

Predicting compound-protein interaction using hierarchical graph convolutional networks.


ABSTRACT:

Motivation

Convolutional neural networks have enabled unprecedented breakthroughs in a variety of computer vision tasks. They have also drawn much attention from other domains, including drug discovery and drug development. In this study, we develop a computational method based on convolutional neural networks to tackle a fundamental question in drug discovery and development, i.e. the prediction of compound-protein interactions based on compound structure and protein sequence. We propose a hierarchical graph convolutional network (HGCN) to encode small molecules. The HGCN aggregates a molecule embedding from substructure embeddings, which are synthesized from atom embeddings. As small molecules usually share substructures, computing a molecule embedding from those common substructures allows us to learn better generic models. We then combined the HGCN with a one-dimensional convolutional network to construct a complete model for predicting compound-protein interactions. Furthermore we apply an explanation technique, Grad-CAM, to visualize the contribution of each amino acid into the prediction.

Results

Experiments using different datasets show the improvement of our model compared to other GCN-based methods and a sequence based method, DeepDTA, in predicting compound-protein interactions. Each prediction made by the model is also explainable and can be used to identify critical residues mediating the interaction.

SUBMITTER: Bui-Thi D 

PROVIDER: S-EPMC9302762 | biostudies-literature |

REPOSITORIES: biostudies-literature

Similar Datasets

| S-EPMC8182908 | biostudies-literature
| S-EPMC8155034 | biostudies-literature
| S-EPMC8058777 | biostudies-literature
| S-EPMC8268184 | biostudies-literature
| S-EPMC7739465 | biostudies-literature
| S-EPMC7799442 | biostudies-literature
| S-EPMC8742007 | biostudies-literature
| S-EPMC7726911 | biostudies-literature
| S-EPMC7514053 | biostudies-literature
| S-EPMC7374013 | biostudies-literature