Unknown

Dataset Information

0

A data preprocessing strategy for metabolomics to reduce the mask effect in data analysis.


ABSTRACT: HighlightsDeveloped a data preprocessing strategy to cope with missing values and mask effects in data analysis from high variation of abundant metabolites.A new method- 'x-VAST' was developed to amend the measurement deviation enlargement.Applying the above strategy, several low abundant masked differential metabolites were rescued. Metabolomics is a booming research field. Its success highly relies on the discovery of differential metabolites by comparing different data sets (for example, patients vs. controls). One of the challenges is that differences of the low abundant metabolites between groups are often masked by the high variation of abundant metabolites. In order to solve this challenge, a novel data preprocessing strategy consisting of three steps was proposed in this study. In step 1, a 'modified 80%' rule was used to reduce effect of missing values; in step 2, unit-variance and Pareto scaling methods were used to reduce the mask effect from the abundant metabolites. In step 3, in order to fix the adverse effect of scaling, stability information of the variables deduced from intensity information and the class information, was used to assign suitable weights to the variables. When applying to an LC/MS based metabolomics dataset from chronic hepatitis B patients study and two simulated datasets, the mask effect was found to be partially eliminated and several new low abundant differential metabolites were rescued.

SUBMITTER: Yang J 

PROVIDER: S-EPMC4428451 | biostudies-literature | 2015

REPOSITORIES: biostudies-literature

altmetric image

Publications

A data preprocessing strategy for metabolomics to reduce the mask effect in data analysis.

Yang Jun J   Zhao Xinjie X   Lu Xin X   Lin Xiaohui X   Xu Guowang G  

Frontiers in molecular biosciences 20150202


HighlightsDeveloped a data preprocessing strategy to cope with missing values and mask effects in data analysis from high variation of abundant metabolites.A new method- 'x-VAST' was developed to amend the measurement deviation enlargement.Applying the above strategy, several low abundant masked differential metabolites were rescued. Metabolomics is a booming research field. Its success highly relies on the discovery of differential metabolites by comparing different data sets (for example, pati  ...[more]

Similar Datasets

| S-EPMC7431853 | biostudies-literature
| S-EPMC3471364 | biostudies-literature
| S-EPMC7602939 | biostudies-literature
| S-EPMC6352382 | biostudies-literature
| S-EPMC4052814 | biostudies-literature
| S-EPMC6139382 | biostudies-other
| S-EPMC4652620 | biostudies-literature
| PRJEB45035 | ENA
| S-EPMC4630088 | biostudies-literature
| S-EPMC10515959 | biostudies-literature