Unknown

Dataset Information

0

Highly predictive and interpretable models for PAMPA permeability.


ABSTRACT: Cell membrane permeability is an important determinant for oral absorption and bioavailability of a drug molecule. An in silico model predicting drug permeability is described, which is built based on a large permeability dataset of 7488 compound entries or 5435 structurally unique molecules measured by the same lab using parallel artificial membrane permeability assay (PAMPA). On the basis of customized molecular descriptors, the support vector regression (SVR) model trained with 4071 compounds with quantitative data is able to predict the remaining 1364 compounds with the qualitative data with an area under the curve of receiver operating characteristic (AUC-ROC) of 0.90. The support vector classification (SVC) model trained with half of the whole dataset comprised of both the quantitative and the qualitative data produced accurate predictions to the remaining data with the AUC-ROC of 0.88. The results suggest that the developed SVR model is highly predictive and provides medicinal chemists a useful in silico tool to facilitate design and synthesis of novel compounds with optimal drug-like properties, and thus accelerate the lead optimization in drug discovery.

SUBMITTER: Sun H 

PROVIDER: S-EPMC5291813 | biostudies-literature | 2017 Feb

REPOSITORIES: biostudies-literature

altmetric image

Publications

Highly predictive and interpretable models for PAMPA permeability.

Sun Hongmao H   Nguyen Kimloan K   Kerns Edward E   Yan Zhengyin Z   Yu Kyeong Ri KR   Shah Pranav P   Jadhav Ajit A   Xu Xin X  

Bioorganic & medicinal chemistry 20161231 3


Cell membrane permeability is an important determinant for oral absorption and bioavailability of a drug molecule. An in silico model predicting drug permeability is described, which is built based on a large permeability dataset of 7488 compound entries or 5435 structurally unique molecules measured by the same lab using parallel artificial membrane permeability assay (PAMPA). On the basis of customized molecular descriptors, the support vector regression (SVR) model trained with 4071 compounds  ...[more]

Similar Datasets

2024-04-23 | MODEL2404220001 | BioModels
| S-EPMC9075804 | biostudies-literature
| S-EPMC6651837 | biostudies-literature
| S-EPMC2803117 | biostudies-other
| S-EPMC10682972 | biostudies-literature
| S-EPMC9901841 | biostudies-literature
| S-EPMC3573238 | biostudies-literature
| S-EPMC6175336 | biostudies-literature
| S-EPMC8353662 | biostudies-literature
| S-EPMC10689442 | biostudies-literature