Ontology highlight
ABSTRACT: Motivation
Fingerprints (FPs) are the most common small molecule representation in cheminformatics. There are a wide variety of FPs, and the Extended Connectivity Fingerprint (ECFP) is one of the best-suited for general applications. Despite the overall FP abundance, only a few FPs represent the 3D structure of the molecule, and hardly any encode protein-ligand interactions.Results
Here, we present a Protein-Ligand Extended Connectivity (PLEC) FP that implicitly encodes protein-ligand interactions by pairing the ECFP environments from the ligand and the protein. PLEC FPs were used to construct different machine learning models tailored for predicting protein-ligand affinities (pKi∕d). Even the simplest linear model built on the PLEC FP achieved Rp = 0.817 on the Protein Databank (PDB) bind v2016 'core set', demonstrating its descriptive power.Availability and implementation
The PLEC FP has been implemented in the Open Drug Discovery Toolkit (https://github.com/oddt/oddt).Supplementary information
Supplementary data are available at Bioinformatics online.
SUBMITTER: Wojcikowski M
PROVIDER: S-EPMC6477977 | biostudies-literature | 2019 Apr
REPOSITORIES: biostudies-literature
Wójcikowski Maciej M Kukiełka Michał M Stepniewska-Dziubinska Marta M MM Siedlecki Pawel P
Bioinformatics (Oxford, England) 20190401 8
<h4>Motivation</h4>Fingerprints (FPs) are the most common small molecule representation in cheminformatics. There are a wide variety of FPs, and the Extended Connectivity Fingerprint (ECFP) is one of the best-suited for general applications. Despite the overall FP abundance, only a few FPs represent the 3D structure of the molecule, and hardly any encode protein-ligand interactions.<h4>Results</h4>Here, we present a Protein-Ligand Extended Connectivity (PLEC) FP that implicitly encodes protein-l ...[more]