Dataset Information

Bi-clustering of metabolic data using matrix factorization tools.

ABSTRACT: Metabolic phenotyping technologies based on Nuclear Magnetic Spectroscopy (NMR) and Mass Spectrometry (MS) generate vast amounts of unrefined data from biological samples. Clustering strategies are frequently employed to provide insight into patterns of relationships between samples and metabolites. Here, we propose the use of a non-negative matrix factorization driven bi-clustering strategy for metabolic phenotyping data in order to discover subsets of interrelated metabolites that exhibit similar behaviour across subsets of samples. The proposed strategy incorporates bi-cross validation and statistical segmentation techniques to automatically determine the number and structure of bi-clusters. This alternative approach is in contrast to the widely used conventional clustering approaches that incorporate all molecular peaks for clustering in metabolic studies and require a priori specification of the number of clusters. We perform the comparative analysis of the proposed strategy with other bi-clustering approaches, which were developed in the context of genomics and transcriptomics research. We demonstrate the superior performance of the proposed bi-clustering strategy on both simulated (NMR) and real (MS) bacterial metabolic data.

SUBMITTER: Gu Q

PROVIDER: S-EPMC6297113 | biostudies-literature | 2018 Dec

REPOSITORIES: biostudies-literature

ACCESS DATA

Publications

Bi-clustering of metabolic data using matrix factorization tools.

Gu Quan Q Veselkov Kirill K

Methods (San Diego, Calif.) 20180210

Metabolic phenotyping technologies based on Nuclear Magnetic Spectroscopy (NMR) and Mass Spectrometry (MS) generate vast amounts of unrefined data from biological samples. Clustering strategies are frequently employed to provide insight into patterns of relationships between samples and metabolites. Here, we propose the use of a non-negative matrix factorization driven bi-clustering strategy for metabolic phenotyping data in order to discover subsets of interrelated metabolites that exhibit simi ...[more]

PMID: 29438828

Dataset Information

Bi-clustering of metabolic data using matrix factorization tools.

Publications

Bi-clustering of metabolic data using matrix factorization tools.

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets

Similar Datasets

A Unified Bayesian Framework for Bi-overlapping-Clustering Multi-omics Data via Sparse Matrix Factorization.
| S-EPMC10766378 | biostudies-literature

Network-based integrative clustering of multiple types of genomic data using non-negative matrix factorization.
| S-EPMC7078030 | biostudies-literature

Single-cell data clustering based on sparse optimization and low-rank matrix factorization.
| S-EPMC8495739 | biostudies-literature

Integrative clustering of multi-level 'omic data based on non-negative matrix factorization algorithm.
| S-EPMC5411077 | biostudies-literature

Peak picking NMR spectral data using non-negative matrix factorization.
| S-EPMC3931316 | biostudies-literature

Applications of a Novel Clustering Approach Using Non-Negative Matrix Factorization to Environmental Research in Public Health.
| S-EPMC4881134 | biostudies-literature

Robust hypergraph regularized non-negative matrix factorization for sample clustering and feature selection in multi-view gene expression data.
| S-EPMC6805321 | biostudies-literature

Predicting epileptic seizures using nonnegative matrix factorization.
| S-EPMC7001919 | biostudies-literature

Protein complex detection via weighted ensemble clustering based on Bayesian nonnegative matrix factorization.
| S-EPMC3642239 | biostudies-literature

Sparse data embedding and prediction by tropical matrix factorization.
| S-EPMC7908717 | biostudies-literature