Unknown

Dataset Information

0

Generative Model for Proposing Drug Candidates Satisfying Anticancer Properties Using a Conditional Variational Autoencoder.


ABSTRACT: Deep learning-based molecular generative models have successfully identified drug candidates with desired properties against biological targets of interest. However, syntactically invalid molecules generated from a deep learning-generated model hinder the model from being applied to drug discovery. Herein, we propose a conditional variational autoencoder (CVAE) as a generative model to propose drug candidates with the desired property outside a data set range. We train the CVAE using molecular fingerprints and corresponding GI50 (inhibition of growth by 50%) results for breast cancer cell lines instead of training with various physical properties for each molecule together. We confirm that the generated fingerprints, not included in the training data set, represent the desired property using the CVAE model. In addition, our method can be used as a query expansion method for searching databases because fingerprints generated using our method can be regarded as expanded queries.

SUBMITTER: Joo S 

PROVIDER: S-EPMC7407547 | biostudies-literature | 2020 Aug

REPOSITORIES: biostudies-literature

altmetric image

Publications

Generative Model for Proposing Drug Candidates Satisfying Anticancer Properties Using a Conditional Variational Autoencoder.

Joo Sunghoon S   Kim Min Soo MS   Yang Jaeho J   Park Jeahyun J  

ACS omega 20200724 30


Deep learning-based molecular generative models have successfully identified drug candidates with desired properties against biological targets of interest. However, syntactically invalid molecules generated from a deep learning-generated model hinder the model from being applied to drug discovery. Herein, we propose a conditional variational autoencoder (CVAE) as a generative model to propose drug candidates with the desired property outside a data set range. We train the CVAE using molecular f  ...[more]

Similar Datasets

| S-EPMC6041224 | biostudies-literature
| S-EPMC10787843 | biostudies-literature
| S-EPMC9733204 | biostudies-literature
| S-EPMC7216580 | biostudies-literature
| S-EPMC11230389 | biostudies-literature
2013-10-26 | E-GEOD-51739 | biostudies-arrayexpress
| S-EPMC9233801 | biostudies-literature
| S-EPMC8906577 | biostudies-literature
| S-EPMC7450509 | biostudies-literature
| S-EPMC8633506 | biostudies-literature