Unknown

Dataset Information

0

Kernel approaches for differential expression analysis of mass spectrometry-based metabolomics data.


ABSTRACT:

Background

Data generated from metabolomics experiments are different from other types of "-omics" data. For example, a common phenomenon in mass spectrometry (MS)-based metabolomics data is that the data matrix frequently contains missing values, which complicates some quantitative analyses. One way to tackle this problem is to treat them as absent. Hence there are two types of information that are available in metabolomics data: presence/absence of a metabolite and a quantitative value of the abundance level of a metabolite if it is present. Combining these two layers of information poses challenges to the application of traditional statistical approaches in differential expression analysis.

Results

In this article, we propose a novel kernel-based score test for the metabolomics differential expression analysis. In order to simultaneously capture both the continuous pattern and discrete pattern in metabolomics data, two new kinds of kernels are designed. One is the distance-based kernel and the other is the stratified kernel. While we initially describe the procedures in the case of single-metabolite analysis, we extend the methods to handle metabolite sets as well.

Conclusions

Evaluation based on both simulated data and real data from a liver cancer metabolomics study indicates that our kernel method has a better performance than some existing alternatives. An implementation of the proposed kernel method in the R statistical computing environment is available at http://works.bepress.com/debashis_ghosh/60/ .

SUBMITTER: Zhan X 

PROVIDER: S-EPMC4359587 | biostudies-literature | 2015 Mar

REPOSITORIES: biostudies-literature

altmetric image

Publications

Kernel approaches for differential expression analysis of mass spectrometry-based metabolomics data.

Zhan Xiang X   Patterson Andrew D AD   Ghosh Debashis D  

BMC bioinformatics 20150311


<h4>Background</h4>Data generated from metabolomics experiments are different from other types of "-omics" data. For example, a common phenomenon in mass spectrometry (MS)-based metabolomics data is that the data matrix frequently contains missing values, which complicates some quantitative analyses. One way to tackle this problem is to treat them as absent. Hence there are two types of information that are available in metabolomics data: presence/absence of a metabolite and a quantitative value  ...[more]

Similar Datasets

| S-EPMC8424977 | biostudies-literature
| S-EPMC5362739 | biostudies-literature
| S-EPMC5540702 | biostudies-literature
| S-EPMC7346099 | biostudies-literature
| S-EPMC3471364 | biostudies-literature
| S-EPMC5766532 | biostudies-literature
| S-EPMC10050028 | biostudies-literature
| S-EPMC7542974 | biostudies-literature
| S-EPMC7014187 | biostudies-literature
| S-EPMC8510447 | biostudies-literature