Unknown

Dataset Information

0

Hybrid semantic recommender system for chemical compounds in large-scale datasets.


ABSTRACT: The large, and increasing, number of chemical compounds poses challenges to the exploration of such datasets. In this work, we propose the usage of recommender systems to identify compounds of interest to scientific researchers. Our approach consists of a hybrid recommender model suitable for implicit feedback datasets and focused on retrieving a ranked list according to the relevance of the items. The model integrates collaborative-filtering algorithms for implicit feedback (Alternating Least Squares and Bayesian Personalized Ranking) and a new content-based algorithm, using the semantic similarity between the chemical compounds in the ChEBI ontology. The algorithms were assessed on an implicit dataset of chemical compounds, CheRM-20, with more than 16.000 items (chemical compounds). The hybrid model was able to improve the results of the collaborative-filtering algorithms, by more than ten percentage points in most of the assessed evaluation metrics.

SUBMITTER: Barros M 

PROVIDER: S-EPMC7903631 | biostudies-literature | 2021 Feb

REPOSITORIES: biostudies-literature

altmetric image

Publications

Hybrid semantic recommender system for chemical compounds in large-scale datasets.

Barros Marcia M   Moitinho Andre A   Couto Francisco M FM  

Journal of cheminformatics 20210223 1


The large, and increasing, number of chemical compounds poses challenges to the exploration of such datasets. In this work, we propose the usage of recommender systems to identify compounds of interest to scientific researchers. Our approach consists of a hybrid recommender model suitable for implicit feedback datasets and focused on retrieving a ranked list according to the relevance of the items. The model integrates collaborative-filtering algorithms for implicit feedback (Alternating Least S  ...[more]

Similar Datasets

| S-EPMC4494041 | biostudies-literature
| S-EPMC2944781 | biostudies-literature
| S-EPMC4380459 | biostudies-literature
| S-EPMC8693048 | biostudies-literature
| S-EPMC6902006 | biostudies-literature
| S-EPMC10353785 | biostudies-literature
| S-EPMC4131427 | biostudies-other
| S-EPMC4493645 | biostudies-literature
| S-EPMC6177139 | biostudies-literature
| S-EPMC9908893 | biostudies-literature