Dataset Information

Pulver: an R package for parallel ultra-rapid p-value computation for linear regression interaction terms.

ABSTRACT: Genome-wide association studies allow us to understand the genetics of complex diseases. Human metabolism provides information about the disease-causing mechanisms, so it is usual to investigate the associations between genetic variants and metabolite levels. However, only considering genetic variants and their effects on one trait ignores the possible interplay between different "omics" layers. Existing tools only consider single-nucleotide polymorphism (SNP)-SNP interactions, and no practical tool is available for large-scale investigations of the interactions between pairs of arbitrary quantitative variables.We developed an R package called pulver to compute p-values for the interaction term in a very large number of linear regression models. Comparisons based on simulated data showed that pulver is much faster than the existing tools. This is achieved by using the correlation coefficient to test the null-hypothesis, which avoids the costly computation of inversions. Additional tricks are a rearrangement of the order, when iterating through the different "omics" layers, and implementing this algorithm in the fast programming language C++. Furthermore, we applied our algorithm to data from the German KORA study to investigate a real-world problem involving the interplay among DNA methylation, genetic variants, and metabolite levels.The pulver package is a convenient and rapid tool for screening huge numbers of linear regression models for significant interaction terms in arbitrary pairs of quantitative variables. pulver is written in R and C++, and can be downloaded freely from CRAN at https://cran.r-project.org/web/packages/pulver/ .

SUBMITTER: Molnos S

PROVIDER: S-EPMC5622569 | biostudies-literature | 2017 Sep

REPOSITORIES: biostudies-literature

ACCESS DATA

Publications

pulver: an R package for parallel ultra-rapid p-value computation for linear regression interaction terms.

Molnos Sophie S Baumbach Clemens C Wahl Simone S Müller-Nurasyid Martina M Strauch Konstantin K Wang-Sattler Rui R Waldenberger Melanie M Meitinger Thomas T Adamski Jerzy J Kastenmüller Gabi G Suhre Karsten K Peters Annette A Grallert Harald H Theis Fabian J FJ Gieger Christian C

BMC bioinformatics 20170929 1

<h4>Background</h4>Genome-wide association studies allow us to understand the genetics of complex diseases. Human metabolism provides information about the disease-causing mechanisms, so it is usual to investigate the associations between genetic variants and metabolite levels. However, only considering genetic variants and their effects on one trait ignores the possible interplay between different "omics" layers. Existing tools only consider single-nucleotide polymorphism (SNP)-SNP interactions ...[more]

PMID: 28962546

Dataset Information

Pulver: an R package for parallel ultra-rapid p-value computation for linear regression interaction terms.

Publications

pulver: an R package for parallel ultra-rapid p-value computation for linear regression interaction terms.

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets

Similar Datasets

GOSim--an R-package for computation of information theoretic GO similarities between terms and gene products.
| S-EPMC1892785 | biostudies-literature

BatchMap: A parallel implementation of the OneMap R package for fast computation of F1 linkage maps in outcrossing species.
| S-EPMC5738033 | biostudies-literature

Combined Linear Interaction Energy and Alchemical Solvation Free-Energy Approach for Protein-Binding Affinity Computation.
| S-EPMC7017367 | biostudies-literature

GenMap: ultra-fast computation of genome mappability.
| S-EPMC7320602 | biostudies-literature

GWAS on your notebook: fast semi-parallel linear and logistic regression for genome-wide association studies.
| S-EPMC3695771 | biostudies-literature

TEffectR: an R package for studying the potential effects of transposable elements on gene expression with linear regression model.
| S-EPMC6899341 | biostudies-literature

Interaction terms in nonlinear models.
| S-EPMC3447245 | biostudies-literature

QuantumInformation.jl-A Julia package for numerical computation in quantum information theory.
| S-EPMC6306216 | biostudies-other

Benign overfitting in linear regression.
| S-EPMC7720150 | biostudies-literature

The advantages of linear information processing for cerebellar computation.
| S-EPMC2657437 | biostudies-literature