Unknown

Dataset Information

0

End-to-end learning for compound activity prediction based on binding pocket information


ABSTRACT:

Background

Recently, machine learning-based ligand activity prediction methods have been greatly improved. However, if known active compounds of a target protein are unavailable, the machine learning-based method cannot be applied. In such cases, docking simulation is generally applied because it only requires a tertiary structure of the target protein. However, the conformation search and the evaluation of binding energy of docking simulation are computationally heavy and thus docking simulation needs huge computational resources. Thus, if we can apply a machine learning-based activity prediction method for a novel target protein, such methods would be highly useful. Recently, Tsubaki et al. proposed an end-to-end learning method to predict the activity of compounds for novel target proteins. However, the prediction accuracy of the method was still insufficient because it only used amino acid sequence information of a protein as the input.

Results

In this research, we proposed an end-to-end learning-based compound activity prediction using structure information of a binding pocket of a target protein. The proposed method learns the important features by end-to-end learning using a graph neural network both for a compound structure and a protein binding pocket structure. As a result of the evaluation experiments, the proposed method has shown higher accuracy than an existing method using amino acid sequence information.

Conclusions

The proposed method achieved equivalent accuracy to docking simulation using AutoDock Vina with much shorter computing time. This indicated that a machine learning-based approach would be promising even for novel target proteins in activity prediction.

SUBMITTER: Tanebe T 

PROVIDER: S-EPMC8555120 | biostudies-literature |

REPOSITORIES: biostudies-literature

Similar Datasets

| S-EPMC2766559 | biostudies-literature
| S-EPMC9278401 | biostudies-literature
| S-EPMC6045364 | biostudies-literature
| S-EPMC3155014 | biostudies-literature
2023-03-31 | GSE165175 | GEO
| S-EPMC3322832 | biostudies-literature
2023-03-31 | GSE165174 | GEO