Ontology highlight
ABSTRACT: Background
Bioinformatics data analysis often deals with additive mixtures of signals for which only class labels are known. Then, the overall goal is to estimate class related signals for data mining purposes. A convenient application is metabolic monitoring of patients using infrared spectroscopy. Within an infrared spectrum each single compound contributes quantitatively to the measurement.Results
In this work, we propose a novel factorization technique for additive signal factorization that allows learning from classified samples. We define a composed loss function for this task and analytically derive a closed form equation such that training a model reduces to searching for an optimal threshold vector. Our experiments, carried out on synthetic and clinical data, show a sensitivity of up to 0.958 and specificity of up to 0.841 for a 15-class problem of disease classification. Using class and regression information in parallel, our algorithm outperforms linear SVM for training cases having many classes and few data.Conclusions
The presented factorization method provides a simple and generative model and, therefore, represents a first step towards predictive factorization methods.
SUBMITTER: Henneges C
PROVIDER: S-EPMC3247165 | biostudies-literature | 2010 Nov
REPOSITORIES: biostudies-literature
Henneges Carsten C Laskov Pavel P Darmawan Endang E Backhaus Jürgen J Kammerer Bernd B Zell Andreas A
BMC bioinformatics 20101115
<h4>Background</h4>Bioinformatics data analysis often deals with additive mixtures of signals for which only class labels are known. Then, the overall goal is to estimate class related signals for data mining purposes. A convenient application is metabolic monitoring of patients using infrared spectroscopy. Within an infrared spectrum each single compound contributes quantitatively to the measurement.<h4>Results</h4>In this work, we propose a novel factorization technique for additive signal fac ...[more]