Unknown

Dataset Information

0

ML2Motif-Reliable extraction of discriminative sequence motifs from learning machines.


ABSTRACT: High prediction accuracies are not the only objective to consider when solving problems using machine learning. Instead, particular scientific applications require some explanation of the learned prediction function. For computational biology, positional oligomer importance matrices (POIMs) have been successfully applied to explain the decision of support vector machines (SVMs) using weighted-degree (WD) kernels. To extract relevant biological motifs from POIMs, the motifPOIM method has been devised and showed promising results on real-world data. Our contribution in this paper is twofold: as an extension to POIMs, we propose gPOIM, a general measure of feature importance for arbitrary learning machines and feature sets (including, but not limited to, SVMs and CNNs) and devise a sampling strategy for efficient computation. As a second contribution, we derive a convex formulation of motifPOIMs that leads to more reliable motif extraction from gPOIMs. Empirical evaluations confirm the usefulness of our approach on artificially generated data as well as on real-world datasets.

SUBMITTER: Vidovic MM 

PROVIDER: S-EPMC5367830 | biostudies-literature | 2017

REPOSITORIES: biostudies-literature

altmetric image

Publications

ML2Motif-Reliable extraction of discriminative sequence motifs from learning machines.

Vidovic Marina M-C MM   Kloft Marius M   Müller Klaus-Robert KR   Görnitz Nico N  

PloS one 20170327 3


High prediction accuracies are not the only objective to consider when solving problems using machine learning. Instead, particular scientific applications require some explanation of the learned prediction function. For computational biology, positional oligomer importance matrices (POIMs) have been successfully applied to explain the decision of support vector machines (SVMs) using weighted-degree (WD) kernels. To extract relevant biological motifs from POIMs, the motifPOIM method has been dev  ...[more]

Similar Datasets

| S-EPMC3605052 | biostudies-literature
| S-EPMC6436896 | biostudies-literature
| S-EPMC7494202 | biostudies-literature
| S-EPMC3250568 | biostudies-literature
| S-EPMC6941814 | biostudies-literature
| S-EPMC2848239 | biostudies-literature
| S-EPMC2826217 | biostudies-literature
| S-EPMC6636396 | biostudies-literature
| S-EPMC10684887 | biostudies-literature
| S-EPMC10538488 | biostudies-literature