Unknown

Dataset Information

0

Machine learning prediction of 3CLpro SARS-CoV-2 docking scores.


ABSTRACT: Molecular docking results of two training sets containing 866 and 8,696 compounds were used to train three different machine learning (ML) approaches. Neural network approaches according to Keras and TensorFlow libraries and the gradient boosted decision trees approach of XGBoost were used with DScribe's Smooth Overlap of Atomic Positions molecular descriptors. In addition, neural networks using the SchNetPack library and descriptors were used. The ML performance was tested on three different sets, including compounds for future organic synthesis. The final evaluation of the ML predicted docking scores was based on the ZINC in vivo set, from which 1,200 compounds were randomly selected with respect to their size. The results obtained showed a consistent ML prediction capability of docking scores, and even though compounds with more than 60 atoms were found slightly overestimated they remain valid for a subsequent evaluation of their drug repurposing suitability.

SUBMITTER: Bucinsky L 

PROVIDER: S-EPMC8881816 | biostudies-literature | 2022 Jun

REPOSITORIES: biostudies-literature

altmetric image

Publications

Machine learning prediction of 3CL<sup>pro</sup> SARS-CoV-2 docking scores.

Bucinsky Lukas L   Bortňák Dušan D   Gall Marián M   Matúška Ján J   Milata Viktor V   Pitoňák Michal M   Štekláč Marek M   Végh Daniel D   Zajaček Dávid D  

Computational biology and chemistry 20220226


Molecular docking results of two training sets containing 866 and 8,696 compounds were used to train three different machine learning (ML) approaches. Neural network approaches according to Keras and TensorFlow libraries and the gradient boosted decision trees approach of XGBoost were used with DScribe's Smooth Overlap of Atomic Positions molecular descriptors. In addition, neural networks using the SchNetPack library and descriptors were used. The ML performance was tested on three different se  ...[more]

Similar Datasets

| S-EPMC10160526 | biostudies-literature
| S-EPMC7785853 | biostudies-literature
| S-EPMC7441806 | biostudies-literature
| S-EPMC8242809 | biostudies-literature
| S-EPMC9781983 | biostudies-literature
| S-EPMC9499987 | biostudies-literature
| S-EPMC8018918 | biostudies-literature
| S-EPMC8178537 | biostudies-literature
| S-EPMC7544940 | biostudies-literature
| S-EPMC10079496 | biostudies-literature