Unknown

Dataset Information

0

RVAgene: Generative modeling of gene expression time series data.


ABSTRACT:

Motivation

Methods to model dynamic changes in gene expression at a genome-wide level are not currently sufficient for large (temporally rich or single-cell) datasets. Variational autoencoders offer means to characterize large datasets and have been used effectively to characterize features of single-cell datasets. Here we extend these methods for use with gene expression time series data.

Results

We present RVAgene: a recurrent variational autoencoder to model gene expression dynamics. RVAgene learns to accurately and efficiently reconstruct temporal gene profiles. It also learns a low dimensional representation of the data via a recurrent encoder network that can be used for biological feature discovery, and from which we can generate new gene expression data by sampling the latent space. We test RVAgene on simulated and real biological datasets, including embryonic stem cell differentiation and kidney injury response dynamics. In all cases, RVAgene accurately reconstructed complex gene expression temporal profiles. Via cross validation, we show that a low-error latent space representation can be learnt using only a fraction of the data. Through clustering and gene ontology term enrichment analysis on the latent space, we demonstrate the potential of RVAgene for unsupervised discovery. In particular, RVAgene identifies new programs of shared gene regulation of Lox family genes in response to kidney injury.

Availability

RVAgene is available in Python, at gihub: https://github.com/maclean-lab/RVAgene; Zenodo archive: http://doi.org/10.5281/zenodo.4271097.

Supplementary information

Supplementary data are available at Bioinformatics online.

SUBMITTER: Mitra R 

PROVIDER: S-EPMC8504625 | biostudies-literature |

REPOSITORIES: biostudies-literature

Similar Datasets

| S-EPMC7449007 | biostudies-literature
| S-EPMC5649163 | biostudies-literature
| S-EPMC3524318 | biostudies-literature
| S-EPMC11330441 | biostudies-literature
| S-EPMC2743670 | biostudies-literature
| S-EPMC8163769 | biostudies-literature
| S-EPMC2453326 | biostudies-literature
| S-EPMC2720980 | biostudies-literature
| S-EPMC5259824 | biostudies-literature
| S-EPMC7647064 | biostudies-literature