Dataset Information

Detecting genetic association through shortest paths in a bidirected graph.

ABSTRACT: Genome-wide association studies (GWASs) commonly use marginal association tests for each single-nucleotide polymorphism (SNP). Because these tests treat SNPs as independent, their power will be suboptimal for detecting SNPs hidden by linkage disequilibrium (LD). One way to improve power is to use a multiple regression model. However, the large number of SNPs preclude simultaneous fitting with multiple regression, and subset regression is infeasible because of an exorbitant number of candidate subsets. We therefore propose a new method for detecting hidden SNPs having significant yet weak marginal association in a multiple regression model. Our method begins by constructing a bidirected graph locally around each SNP that demonstrates a moderately sized marginal association signal, the focal SNPs. Vertexes correspond to SNPs, and adjacency between vertexes is defined by an LD measure. Subsequently, the method collects from each graph all shortest paths to the focal SNP. Finally, for each shortest path the method fits a multiple regression model to all the SNPs lying in the path and tests the significance of the regression coefficient corresponding to the terminal SNP in the path. Simulation studies show that the proposed method can detect susceptibility SNPs hidden by LD that go undetected with marginal association testing or with existing multivariate methods. When applied to real GWAS data from the Alzheimer's Disease Neuroimaging Initiative (ADNI), our method detected two groups of SNPs: one in a region containing the apolipoprotein E (APOE) gene, and another in a region close to the semaphorin 5A (SEMA5A) gene.

SUBMITTER: Ueki M

PROVIDER: S-EPMC5849262 | biostudies-literature | 2017 Sep

REPOSITORIES: biostudies-literature

ACCESS DATA

Publications

Detecting genetic association through shortest paths in a bidirected graph.

Ueki Masao M Kawasaki Yoshinori Y Tamiya Gen G

Genetic epidemiology 20170619 6

Genome-wide association studies (GWASs) commonly use marginal association tests for each single-nucleotide polymorphism (SNP). Because these tests treat SNPs as independent, their power will be suboptimal for detecting SNPs hidden by linkage disequilibrium (LD). One way to improve power is to use a multiple regression model. However, the large number of SNPs preclude simultaneous fitting with multiple regression, and subset regression is infeasible because of an exorbitant number of candidate su ...[more]

PMID: 28626864

Dataset Information

Detecting genetic association through shortest paths in a bidirected graph.

Publications

Detecting genetic association through shortest paths in a bidirected graph.

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets

Similar Datasets

Shortest Paths in Multiplex Networks.
| S-EPMC5438413 | biostudies-literature

DiversePathsJ: diverse shortest paths for bioimage analysis.
| S-EPMC5860364 | biostudies-literature

Simulating SIR processes on networks using weighted shortest paths.
| S-EPMC5920074 | biostudies-literature

Efficient prediction of reaction paths through molecular graph and reaction network analysis.
| S-EPMC5887236 | biostudies-other

Two betweenness centrality measures based on Randomized Shortest Paths.
| S-EPMC4738330 | biostudies-other

A trainable clustering algorithm based on shortest paths from density peaks.
| S-EPMC7051829 | biostudies-literature

Estimation and update of betweenness centrality with progressive algorithm and shortest paths approximation.
| S-EPMC10564764 | biostudies-literature

Drug-drug interaction extraction via hierarchical RNNs on sequence and shortest dependency paths.
| S-EPMC6030919 | biostudies-literature

A shortest-path graph kernel for estimating gene product semantic similarity.
| S-EPMC3161911 | biostudies-literature

Neo4j graph dataset of cycling paths in Slovenia.
| S-EPMC10293952 | biostudies-literature