Unknown

Dataset Information

0

BCL::Mol2D-a robust atom environment descriptor for QSAR modeling and lead optimization.


ABSTRACT: Comparing fragment based molecular fingerprints of drug-like molecules is one of the most robust and frequently used approaches in computer-assisted drug discovery. Molprint2D, a popular atom environment (AE) descriptor, yielded the best enrichment of active compounds across a diverse set of targets in a recent large-scale study. We present here BCL::Mol2D descriptors that outperformed Molprint2D on nine PubChem datasets spanning a wide range of protein classes. Because BCL::Mol2D records the number of AEs from a universal AE library, a novel aspect of BCL::Mol2D over the Molprint2D is its reversibility. This property enables decomposition of prediction from machine learning models to particular molecular substructures. Artificial neural networks with dropout, when trained on BCL::Mol2D descriptors outperform those trained on Molprint2D descriptors by up to 26% in logAUC metric. When combined with the Reduced Short Range descriptor set, our previously published set of descriptors optimized for QSARs, BCL::Mol2D yields a modest improvement. Finally, we demonstrate how the reversibility of BCL::Mol2D enables visualization of a 'pharmacophore map' that could guide lead optimization for serine/threonine kinase 33 inhibitors.

SUBMITTER: Vu O 

PROVIDER: S-EPMC6824857 | biostudies-literature | 2019 May

REPOSITORIES: biostudies-literature

altmetric image

Publications

BCL::Mol2D-a robust atom environment descriptor for QSAR modeling and lead optimization.

Vu Oanh O   Mendenhall Jeffrey J   Altarawy Doaa D   Meiler Jens J  

Journal of computer-aided molecular design 20190406 5


Comparing fragment based molecular fingerprints of drug-like molecules is one of the most robust and frequently used approaches in computer-assisted drug discovery. Molprint2D, a popular atom environment (AE) descriptor, yielded the best enrichment of active compounds across a diverse set of targets in a recent large-scale study. We present here BCL::Mol2D descriptors that outperformed Molprint2D on nine PubChem datasets spanning a wide range of protein classes. Because BCL::Mol2D records the nu  ...[more]

Similar Datasets

| S-EPMC3805266 | biostudies-literature
| S-EPMC8218488 | biostudies-literature
| S-EPMC5374651 | biostudies-literature
| S-EPMC4803518 | biostudies-literature
| S-EPMC6767540 | biostudies-literature
| S-EPMC4340309 | biostudies-literature
| S-EPMC7549127 | biostudies-literature
| S-EPMC2872997 | biostudies-literature
| S-EPMC1302821 | biostudies-literature
| S-EPMC7979990 | biostudies-literature