Unknown

Dataset Information

0

Association testing of bisulfite-sequencing methylation data via a Laplace approximation.


ABSTRACT:

Motivation

Epigenome-wide association studies can provide novel insights into the regulation of genes involved in traits and diseases. The rapid emergence of bisulfite-sequencing technologies enables performing such genome-wide studies at the resolution of single nucleotides. However, analysis of data produced by bisulfite-sequencing poses statistical challenges owing to low and uneven sequencing depth, as well as the presence of confounding factors. The recently introduced Mixed model Association for Count data via data AUgmentation (MACAU) can address these challenges via a generalized linear mixed model when confounding can be encoded via a single variance component. However, MACAU cannot be used in the presence of multiple variance components. Additionally, MACAU uses a computationally expensive Markov Chain Monte Carlo (MCMC) procedure, which cannot directly approximate the model likelihood.

Results

We present a new method, Mixed model Association via a Laplace ApproXimation (MALAX), that is more computationally efficient than MACAU and allows to model multiple variance components. MALAX uses a Laplace approximation rather than MCMC based approximations, which enables to directly approximate the model likelihood. Through an extensive analysis of simulated and real data, we demonstrate that MALAX successfully addresses statistical challenges introduced by bisulfite-sequencing while controlling for complex sources of confounding, and can be over 50% faster than the state of the art.

Availability and implementation

The full source code of MALAX is available at https://github.com/omerwe/MALAX .

Contact

omerw@cs.technion.ac.il or ehalperin@cs.ucla.edu.

Supplementary information

Supplementary data are available at Bioinformatics online.

SUBMITTER: Weissbrod O 

PROVIDER: S-EPMC5870555 | biostudies-literature | 2017 Jul

REPOSITORIES: biostudies-literature

altmetric image

Publications

Association testing of bisulfite-sequencing methylation data via a Laplace approximation.

Weissbrod Omer O   Rahmani Elior E   Schweiger Regev R   Rosset Saharon S   Halperin Eran E  

Bioinformatics (Oxford, England) 20170701 14


<h4>Motivation</h4>Epigenome-wide association studies can provide novel insights into the regulation of genes involved in traits and diseases. The rapid emergence of bisulfite-sequencing technologies enables performing such genome-wide studies at the resolution of single nucleotides. However, analysis of data produced by bisulfite-sequencing poses statistical challenges owing to low and uneven sequencing depth, as well as the presence of confounding factors. The recently introduced Mixed model A  ...[more]

Similar Datasets

| S-EPMC6031184 | biostudies-literature
| S-EPMC8345583 | biostudies-literature
| S-EPMC4071208 | biostudies-literature
| S-EPMC8116545 | biostudies-literature
| S-EPMC8450090 | biostudies-literature
| S-EPMC3938178 | biostudies-literature
| S-EPMC5320668 | biostudies-literature
| S-EPMC8686677 | biostudies-literature
| S-EPMC7329512 | biostudies-literature
| S-EPMC4623491 | biostudies-literature