Unknown

Dataset Information

0

Machine learning models for classification tasks related to drug safety.


ABSTRACT: In this review, we outline the current trends in the field of machine learning-driven classification studies related to ADME (absorption, distribution, metabolism and excretion) and toxicity endpoints from the past six years (2015-2021). The study focuses only on classification models with large datasets (i.e. more than a thousand compounds). A comprehensive literature search and meta-analysis was carried out for nine different targets: hERG-mediated cardiotoxicity, blood-brain barrier penetration, permeability glycoprotein (P-gp) substrate/inhibitor, cytochrome P450 enzyme family, acute oral toxicity, mutagenicity, carcinogenicity, respiratory toxicity and irritation/corrosion. The comparison of the best classification models was targeted to reveal the differences between machine learning algorithms and modeling types, endpoint-specific performances, dataset sizes and the different validation protocols. Based on the evaluation of the data, we can say that tree-based algorithms are (still) dominating the field, with consensus modeling being an increasing trend in drug safety predictions. Although one can already find classification models with great performances to hERG-mediated cardiotoxicity and the isoenzymes of the cytochrome P450 enzyme family, these targets are still central to ADMET-related research efforts.

SUBMITTER: Racz A 

PROVIDER: S-EPMC8342376 | biostudies-literature |

REPOSITORIES: biostudies-literature

Similar Datasets

| S-EPMC6262234 | biostudies-literature
| S-EPMC8155075 | biostudies-literature
| S-EPMC6367450 | biostudies-literature
| S-EPMC9252837 | biostudies-literature
| S-EPMC6257637 | biostudies-literature
| S-EPMC8472680 | biostudies-literature
| S-EPMC8034680 | biostudies-literature
| S-EPMC6167198 | biostudies-literature
| S-EPMC5088188 | biostudies-literature
| S-EPMC7540939 | biostudies-literature