Unknown

Dataset Information

0

GPCRLigNet: rapid screening for GPCR active ligands using machine learning.


ABSTRACT: Molecules with bioactivity towards G protein-coupled receptors represent a subset of the vast space of small drug-like molecules. Here, we compare machine learning models, including dilated graph convolutional networks, that conduct binary classification to quickly identify molecules with activity towards G protein-coupled receptors. The models are trained and validated using a large set of over 600,000 active, inactive, and decoy compounds. The best performing machine learning model, dubbed GPCRLigNet, was a surprisingly simple feedforward dense neural network mapping from Morgan fingerprints to activity. Incorporation of GPCRLigNet into a high-throughput virtual screening workflow is demonstrated with molecular docking towards a particular G protein-coupled receptor, the pituitary adenylate cyclase-activating polypeptide receptor type 1. Through rigorous comparison of docking scores for molecules selected with and without using GPCRLigNet, we demonstrate an enrichment of potentially potent molecules using GPCRLigNet. This work provides a proof of principle that GPCRLigNet can effectively hone the chemical search space towards ligands with G protein-coupled receptor activity.

SUBMITTER: Remington JM 

PROVIDER: S-EPMC10379640 | biostudies-literature | 2023 Mar

REPOSITORIES: biostudies-literature

altmetric image

Publications

GPCRLigNet: rapid screening for GPCR active ligands using machine learning.

Remington Jacob M JM   McKay Kyle T KT   Beckage Noah B NB   Ferrell Jonathon B JB   Schneebeli Severin T ST   Li Jianing J  

Journal of computer-aided molecular design 20230225 3


Molecules with bioactivity towards G protein-coupled receptors represent a subset of the vast space of small drug-like molecules. Here, we compare machine learning models, including dilated graph convolutional networks, that conduct binary classification to quickly identify molecules with activity towards G protein-coupled receptors. The models are trained and validated using a large set of over 600,000 active, inactive, and decoy compounds. The best performing machine learning model, dubbed GPC  ...[more]

Similar Datasets

| S-EPMC3137231 | biostudies-literature
| S-EPMC9163700 | biostudies-literature
| S-EPMC5831789 | biostudies-literature
| S-EPMC10784458 | biostudies-literature
| S-EPMC4919634 | biostudies-literature
2013-01-01 | E-GEOD-29210 | biostudies-arrayexpress
| S-EPMC6170736 | biostudies-literature
| S-EPMC10409756 | biostudies-literature
| S-EPMC11740121 | biostudies-literature
2023-01-25 | GSE223385 | GEO