Unknown

Dataset Information

0

FHSA-SED: Two-Locus Model Detection for Genome-Wide Association Study with Harmony Search Algorithm.


ABSTRACT: MOTIVATION:Two-locus model is a typical significant disease model to be identified in genome-wide association study (GWAS). Due to intensive computational burden and diversity of disease models, existing methods have drawbacks on low detection power, high computation cost, and preference for some types of disease models. METHOD:In this study, two scoring functions (Bayesian network based K2-score and Gini-score) are used for characterizing two SNP locus as a candidate model, the two criteria are adopted simultaneously for improving identification power and tackling the preference problem to disease models. Harmony search algorithm (HSA) is improved for quickly finding the most likely candidate models among all two-locus models, in which a local search algorithm with two-dimensional tabu table is presented to avoid repeatedly evaluating some disease models that have strong marginal effect. Finally G-test statistic is used to further test the candidate models. RESULTS:We investigate our method named FHSA-SED on 82 simulated datasets and a real AMD dataset, and compare it with two typical methods (MACOED and CSE) which have been developed recently based on swarm intelligent search algorithm. The results of simulation experiments indicate that our method outperforms the two compared algorithms in terms of detection power, computation time, evaluation times, sensitivity (TPR), specificity (SPC), positive predictive value (PPV) and accuracy (ACC). Our method has identified two SNPs (rs3775652 and rs10511467) that may be also associated with disease in AMD dataset.

SUBMITTER: Tuo S 

PROVIDER: S-EPMC4807955 | biostudies-literature | 2016

REPOSITORIES: biostudies-literature

altmetric image

Publications

FHSA-SED: Two-Locus Model Detection for Genome-Wide Association Study with Harmony Search Algorithm.

Tuo Shouheng S   Zhang Junying J   Yuan Xiguo X   Zhang Yuanyuan Y   Liu Zhaowen Z  

PloS one 20160325 3


<h4>Motivation</h4>Two-locus model is a typical significant disease model to be identified in genome-wide association study (GWAS). Due to intensive computational burden and diversity of disease models, existing methods have drawbacks on low detection power, high computation cost, and preference for some types of disease models.<h4>Method</h4>In this study, two scoring functions (Bayesian network based K2-score and Gini-score) are used for characterizing two SNP locus as a candidate model, the t  ...[more]

Similar Datasets

| S-EPMC1570380 | biostudies-literature
| S-EPMC7516787 | biostudies-literature
| S-EPMC10575495 | biostudies-literature
| S-EPMC3116488 | biostudies-literature
| S-EPMC9541083 | biostudies-literature
| S-EPMC2824680 | biostudies-literature
| S-EPMC8041068 | biostudies-literature
| S-EPMC5308866 | biostudies-literature
| S-EPMC3563403 | biostudies-literature
| S-EPMC5599559 | biostudies-literature