Unknown

Dataset Information

0

To control false positives in gene-gene interaction analysis: two novel conditional entropy-based approaches.


ABSTRACT: Genome-wide analysis of gene-gene interactions has been recognized as a powerful avenue to identify the missing genetic components that can not be detected by using current single-point association analysis. Recently, several model-free methods (e.g. the commonly used information based metrics and several logistic regression-based metrics) were developed for detecting non-linear dependence between genetic loci, but they are potentially at the risk of inflated false positive error, in particular when the main effects at one or both loci are salient. In this study, we proposed two conditional entropy-based metrics to challenge this limitation. Extensive simulations demonstrated that the two proposed metrics, provided the disease is rare, could maintain consistently correct false positive rate. In the scenarios for a common disease, our proposed metrics achieved better or comparable control of false positive error, compared to four previously proposed model-free metrics. In terms of power, our methods outperformed several competing metrics in a range of common disease models. Furthermore, in real data analyses, both metrics succeeded in detecting interactions and were competitive with the originally reported results or the logistic regression approaches. In conclusion, the proposed conditional entropy-based metrics are promising as alternatives to current model-based approaches for detecting genuine epistatic effects.

SUBMITTER: Zuo X 

PROVIDER: S-EPMC3858311 | biostudies-literature | 2013

REPOSITORIES: biostudies-literature

altmetric image

Publications

To control false positives in gene-gene interaction analysis: two novel conditional entropy-based approaches.

Zuo Xiaoyu X   Rao Shaoqi S   Fan An A   Lin Meihua M   Li Haoli H   Zhao Xiaolei X   Qin Jiheng J  

PloS one 20131210 12


Genome-wide analysis of gene-gene interactions has been recognized as a powerful avenue to identify the missing genetic components that can not be detected by using current single-point association analysis. Recently, several model-free methods (e.g. the commonly used information based metrics and several logistic regression-based metrics) were developed for detecting non-linear dependence between genetic loci, but they are potentially at the risk of inflated false positive error, in particular  ...[more]

Similar Datasets

| S-EPMC3674525 | biostudies-literature
| S-EPMC5287229 | biostudies-literature
| S-EPMC10516709 | biostudies-literature
| S-EPMC2643841 | biostudies-literature
| S-EPMC3045793 | biostudies-literature
| S-EPMC2747693 | biostudies-literature
| S-EPMC6067744 | biostudies-literature
| S-EPMC4027514 | biostudies-literature
| S-EPMC6773557 | biostudies-literature
| S-EPMC7612315 | biostudies-literature