Dataset Information

Variational autoencoders learn transferrable representations of metabolomics data.

ABSTRACT: Dimensionality reduction approaches are commonly used for the deconvolution of high-dimensional metabolomics datasets into underlying core metabolic processes. However, current state-of-the-art methods are widely incapable of detecting nonlinearities in metabolomics data. Variational Autoencoders (VAEs) are a deep learning method designed to learn nonlinear latent representations which generalize to unseen data. Here, we trained a VAE on a large-scale metabolomics population cohort of human blood samples consisting of over 4500 individuals. We analyzed the pathway composition of the latent space using a global feature importance score, which demonstrated that latent dimensions represent distinct cellular processes. To demonstrate model generalizability, we generated latent representations of unseen metabolomics datasets on type 2 diabetes, acute myeloid leukemia, and schizophrenia and found significant correlations with clinical patient groups. Notably, the VAE representations showed stronger effects than latent dimensions derived by linear and non-linear principal component analysis. Taken together, we demonstrate that the VAE is a powerful method that learns biologically meaningful, nonlinear, and transferrable latent representations of metabolomics data.

SUBMITTER: Gomari DP

PROVIDER: S-EPMC9246987 | biostudies-literature | 2022 Jun

REPOSITORIES: biostudies-literature

ACCESS DATA

Publications

Variational autoencoders learn transferrable representations of metabolomics data.

Gomari Daniel P DP Schweickart Annalise A Cerchietti Leandro L Paietta Elisabeth E Fernandez Hugo H Al-Amin Hassen H Suhre Karsten K Krumsiek Jan J

Communications biology 20220630 1

Dimensionality reduction approaches are commonly used for the deconvolution of high-dimensional metabolomics datasets into underlying core metabolic processes. However, current state-of-the-art methods are widely incapable of detecting nonlinearities in metabolomics data. Variational Autoencoders (VAEs) are a deep learning method designed to learn nonlinear latent representations which generalize to unseen data. Here, we trained a VAE on a large-scale metabolomics population cohort of human bloo ...[more]

PMID: 35773471

Dataset Information

Variational autoencoders learn transferrable representations of metabolomics data.

Publications

Variational autoencoders learn transferrable representations of metabolomics data.

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets

Similar Datasets

Benchmarking variational AutoEncoders on cancer transcriptomics data.
| S-EPMC10553230 | biostudies-literature

Orientation-invariant autoencoders learn robust representations for shape profiling of cells and organelles.
| S-EPMC10838319 | biostudies-literature

Variational Autoencoders for Cancer Data Integration: Design Principles and Computational Practice.
| S-EPMC6917668 | biostudies-literature

Data augmentation using Variational Autoencoders for improvement of respiratory disease classification.
| S-EPMC9374267 | biostudies-literature

Variational embedding of protein folding simulations using Gaussian mixture variational autoencoders.
| S-EPMC8605902 | biostudies-literature

Detecting Respiratory Pathologies Using Convolutional Neural Networks and Variational Autoencoders for Unbalancing Data.
| S-EPMC7070339 | biostudies-literature

Adversarial and variational autoencoders improve metagenomic binning.
| S-EPMC10590447 | biostudies-literature

Generating functional protein variants with variational autoencoders.
| S-EPMC7946179 | biostudies-literature

Joint variational autoencoders for multimodal imputation and embedding.
| S-EPMC11340721 | biostudies-literature

Interpretable cardiac anatomy modeling using variational mesh autoencoders.
| S-EPMC9813669 | biostudies-literature