Unknown

Dataset Information

0

Gene set analysis with graph-embedded kernel association test.


ABSTRACT:

Motivation

Kernel-based association test (KAT) has been a popular approach to evaluate the association of expressions of a gene set (e.g. pathway) with a phenotypic trait. KATs rely on kernel functions which capture the sample similarity across multiple features, to capture potential linear or non-linear relationship among features in a gene set. When calculating the kernel functions, no network graphical information about the features is considered. While genes in a functional group (e.g. a pathway) are not independent in general due to regulatory interactions, incorporating regulatory network (or graph) information can potentially increase the power of KAT. In this work, we propose a graph-embedded kernel association test, termed gKAT. gKAT incorporates prior pathway knowledge when constructing a kernel function into hypothesis testing.

Results

We apply a diffusion kernel to capture any graph structures in a gene set, then incorporate such information to build a kernel function for further association test. We illustrate the geometric meaning of the approach. Through extensive simulation studies, we show that the proposed gKAT algorithm can improve testing power compared to the one without considering graph structures. Application to a real dataset further demonstrate the utility of the method.

Availability and implementation

The R code used for the analysis can be accessed at https://github.com/JialinQu/gKAT.

Supplementary information

Supplementary data are available at Bioinformatics online.

SUBMITTER: Qu J 

PROVIDER: S-EPMC8896609 | biostudies-literature | 2022 Mar

REPOSITORIES: biostudies-literature

altmetric image

Publications

Gene set analysis with graph-embedded kernel association test.

Qu Jialin J   Cui Yuehua Y  

Bioinformatics (Oxford, England) 20220301 6


<h4>Motivation</h4>Kernel-based association test (KAT) has been a popular approach to evaluate the association of expressions of a gene set (e.g. pathway) with a phenotypic trait. KATs rely on kernel functions which capture the sample similarity across multiple features, to capture potential linear or non-linear relationship among features in a gene set. When calculating the kernel functions, no network graphical information about the features is considered. While genes in a functional group (e.  ...[more]

Similar Datasets

| S-EPMC3441747 | biostudies-literature
| S-EPMC2429859 | biostudies-literature
| S-EPMC4158946 | biostudies-other
| S-EPMC8388036 | biostudies-literature
| S-EPMC6692773 | biostudies-literature
| S-EPMC6902471 | biostudies-literature
| S-EPMC8059024 | biostudies-literature
| S-EPMC3373190 | biostudies-literature
| S-EPMC4724299 | biostudies-literature
| S-EPMC5860324 | biostudies-literature