Unknown

Dataset Information

0

MODEL FREE ESTIMATION OF GRAPHICAL MODEL USING GENE EXPRESSION DATA.


ABSTRACT: Graphical model is a powerful and popular approach to study high-dimensional omic data, such as genome-wide gene expression data. Nonlinear relations between genes are widely documented. However, partly due to sparsity of data points in high dimensional space (i.e., curse of dimensionality) and computational challenges, most available methods construct graphical models by testing linear relations. We propose to address this challenge by a two-step approach: first use a model-free approach to prioritize the neighborhood of each gene, then apply a non-parametric conditional independence testing method to refine such neighborhood estimation. Our method, named as "mofreds" (MOdel FRee Estimation of DAG Skeletons), seeks to estimate the skeleton of a directed acyclic graph (DAG) by this two-step approach. We studied the theoretical properties of mofreds, and evaluated its performance in extensive simulation settings. We found mofreds has substantially better performance than the state-of-the art method which is designed to detect linear relations of Gaussian graphical models. We applied mofreds to analyze gene expression data of breast cancer patients from The Cancer Genome Atlas (TCGA). We found that it discovers non-linear relationships among gene pairs that are missed by the Gaussian graphical model methods.

SUBMITTER: Yang J 

PROVIDER: S-EPMC8341558 | biostudies-literature |

REPOSITORIES: biostudies-literature

Similar Datasets

| S-EPMC8360145 | biostudies-literature
| S-EPMC2808166 | biostudies-literature
| S-EPMC4052785 | biostudies-literature
| S-EPMC8760643 | biostudies-literature
| S-EPMC4634229 | biostudies-literature
| S-EPMC4674863 | biostudies-literature
| S-EPMC4974017 | biostudies-literature
| S-EPMC6509820 | biostudies-literature
| S-EPMC5515703 | biostudies-literature
| S-EPMC4307846 | biostudies-literature