Unknown

Dataset Information

0

Inferring Regulatory Networks From Mixed Observational Data Using Directed Acyclic Graphs.


ABSTRACT: Construction of regulatory networks using cross-sectional expression profiling of genes is desired, but challenging. The Directed Acyclic Graph (DAG) provides a general framework to infer causal effects from observational data. However, most existing DAG methods assume that all nodes follow the same type of distribution, which prohibit a joint modeling of continuous gene expression and categorical variables. We present a new mixed DAG (mDAG) algorithm to infer the regulatory pathway from mixed observational data containing both continuous variables (e.g. expression of genes) and categorical variables (e.g. categorical phenotypes or single nucleotide polymorphisms). Our method can identify upstream causal factors and downstream effectors closely linked to a variable and generate hypotheses for causal direction of regulatory pathways. We propose a new permutation method to test the conditional independence of variables of mixed types, which is the key for mDAG. We also utilize an L 1 regularization in mDAG to ensure it can recover a large sparse DAG with limited sample size. We demonstrate through extensive simulations that mDAG outperforms two well-known methods in recovering the true underlying DAG. We apply mDAG to a cross-sectional immunological study of Chlamydia trachomatis infection and successfully infer the regularity network of cytokines. We also apply mDAG to a large cohort study, generating sensible mechanistic hypotheses underlying plasma adiponectin level. The R package mDAG is publicly available from CRAN at https://CRAN.R-project.org/package=mDAG.

SUBMITTER: Zhong W 

PROVIDER: S-EPMC7038820 | biostudies-literature | 2020

REPOSITORIES: biostudies-literature

altmetric image

Publications

Inferring Regulatory Networks From Mixed Observational Data Using Directed Acyclic Graphs.

Zhong Wujuan W   Dong Li L   Poston Taylor B TB   Darville Toni T   Spracklen Cassandra N CN   Wu Di D   Mohlke Karen L KL   Li Yun Y   Li Quefeng Q   Zheng Xiaojing X  

Frontiers in genetics 20200207


Construction of regulatory networks using cross-sectional expression profiling of genes is desired, but challenging. The Directed Acyclic Graph (DAG) provides a general framework to infer causal effects from observational data. However, most existing DAG methods assume that all nodes follow the same type of distribution, which prohibit a joint modeling of continuous gene expression and categorical variables. We present a new mixed DAG (mDAG) algorithm to infer the regulatory pathway from mixed o  ...[more]

Similar Datasets

| S-EPMC6176748 | biostudies-literature
2008-12-30 | GSE8880 | GEO
| S-EPMC2743182 | biostudies-literature
| S-EPMC6935350 | biostudies-literature
| S-EPMC4975686 | biostudies-literature
| S-EPMC3898602 | biostudies-literature
| S-EPMC8240035 | biostudies-literature
| S-EPMC7124493 | biostudies-literature
| S-EPMC4765882 | biostudies-other
| S-EPMC7787104 | biostudies-literature