Unknown

Dataset Information

0

A Poisson Log-Normal Model for Constructing Gene Covariation Network Using RNA-seq Data.


ABSTRACT: Constructing expression networks using transcriptomic data is an effective approach for studying gene regulation. A popular approach for constructing such a network is based on the Gaussian graphical model (GGM), in which an edge between a pair of genes indicates that the expression levels of these two genes are conditionally dependent, given the expression levels of all other genes. However, GGMs are not appropriate for non-Gaussian data, such as those generated in RNA-seq experiments. We propose a novel statistical framework that maximizes a penalized likelihood, in which the observed count data follow a Poisson log-normal distribution. To overcome the computational challenges, we use Laplace's method to approximate the likelihood and its gradients, and apply the alternating directions method of multipliers to find the penalized maximum likelihood estimates. The proposed method is evaluated and compared with GGMs using both simulated and real RNA-seq data. The proposed method shows improved performance in detecting edges that represent covarying pairs of genes, particularly for edges connecting low-abundant genes and edges around regulatory hubs.

SUBMITTER: Choi Y 

PROVIDER: S-EPMC5510689 | biostudies-literature | 2017 Jul

REPOSITORIES: biostudies-literature

altmetric image

Publications

A Poisson Log-Normal Model for Constructing Gene Covariation Network Using RNA-seq Data.

Choi Yoonha Y   Coram Marc M   Peng Jie J   Tang Hua H  

Journal of computational biology : a journal of computational molecular cell biology 20170530 7


Constructing expression networks using transcriptomic data is an effective approach for studying gene regulation. A popular approach for constructing such a network is based on the Gaussian graphical model (GGM), in which an edge between a pair of genes indicates that the expression levels of these two genes are conditionally dependent, given the expression levels of all other genes. However, GGMs are not appropriate for non-Gaussian data, such as those generated in RNA-seq experiments. We propo  ...[more]

Similar Datasets

| S-EPMC6636065 | biostudies-literature
| S-EPMC6763381 | biostudies-literature
| S-EPMC2943596 | biostudies-literature
| S-EPMC3874726 | biostudies-literature
| S-EPMC3244770 | biostudies-literature
| S-EPMC8721966 | biostudies-literature
| S-EPMC4539434 | biostudies-literature
| S-EPMC3156954 | biostudies-literature
| S-EPMC6446481 | biostudies-literature
| S-EPMC3493127 | biostudies-literature