Unknown

Dataset Information

0

ChemMaps: Towards an approach for visualizing the chemical space based on adaptive satellite compounds.


ABSTRACT: We present a novel approach called ChemMaps for visualizing chemical space based on the similarity matrix of compound datasets generated with molecular fingerprints' similarity. The method uses a 'satellites' approach, where satellites are, in principle, molecules whose similarity to the rest of the molecules in the database provides sufficient information for generating a visualization of the chemical space. Such an approach could help make chemical space visualizations more efficient. We hereby describe a proof-of-principle application of the method to various databases that have different diversity measures. Unsurprisingly, we found the method works better with databases that have low 2D diversity. 3D diversity played a secondary role, although it seems to be more relevant as 2D diversity increases. For less diverse datasets, taking as few as 25% satellites seems to be sufficient for a fair depiction of the chemical space. We propose to iteratively increase the satellites number by a factor of 5% relative to the whole database, and stop when the new and the prior chemical space correlate highly. This Research Note represents a first exploratory step, prior to the full application of this method for several datasets.

SUBMITTER: Naveja JJ 

PROVIDER: S-EPMC5538041 | biostudies-literature | 2017

REPOSITORIES: biostudies-literature

altmetric image

Publications

ChemMaps: Towards an approach for visualizing the chemical space based on adaptive satellite compounds.

Naveja J Jesús JJ   Medina-Franco José L JL  

F1000Research 20170717


We present a novel approach called ChemMaps for visualizing chemical space based on the similarity matrix of compound datasets generated with molecular fingerprints' similarity. The method uses a 'satellites' approach, where satellites are, in principle, molecules whose similarity to the rest of the molecules in the database provides sufficient information for generating a visualization of the chemical space. Such an approach could help make chemical space visualizations more efficient. We hereb  ...[more]

Similar Datasets

| S-EPMC3140372 | biostudies-literature
| S-EPMC7593547 | biostudies-literature
| S-EPMC8588419 | biostudies-literature
| S-EPMC4344310 | biostudies-literature
| S-EPMC3248422 | biostudies-literature
| S-EPMC3993238 | biostudies-literature
| S-EPMC6807911 | biostudies-literature
| S-EPMC5458544 | biostudies-literature
| S-EPMC8199418 | biostudies-literature
| S-EPMC3670418 | biostudies-literature