Unknown

Dataset Information

0

A forest-based approach to identifying gene and gene gene interactions.


ABSTRACT: Multiple genes, gene-by-gene interactions, and gene-by-environment interactions are believed to underlie most complex diseases. However, such interactions are difficult to identify. Although there have been recent successes in identifying genetic variants for complex diseases, it still remains difficult to identify gene-gene and gene-environment interactions. To overcome this difficulty, we propose a forest-based approach and a concept of variable importance. The proposed approach is demonstrated by simulation study for its validity and illustrated by a real data analysis for its use. Analyses of both real data and simulated data based on published genetic models show the effectiveness of our approach. For example, our analysis of a published data set on age-related macular degeneration (AMD) not only confirmed a known genetic variant (P value = 2E-6) for AMD, but also revealed an unreported haplotype surrounding single-nucleotide polymorphism (SNP) rs10272438 on chromosome 7 that was significantly associated with AMD (P value = 0.0024). These significance levels are obtained after the consideration for a large number of SNPs. Thus, the importance of this work is twofold: it proposes a powerful and flexible method to identify high-risk haplotypes and their interactions and reveals a potentially protective variant for AMD.

SUBMITTER: Chen X 

PROVIDER: S-EPMC2148267 | biostudies-literature | 2007 Dec

REPOSITORIES: biostudies-literature

altmetric image

Publications

A forest-based approach to identifying gene and gene gene interactions.

Chen Xiang X   Liu Ching-Ti CT   Zhang Meizhuo M   Zhang Heping H  

Proceedings of the National Academy of Sciences of the United States of America 20071128 49


Multiple genes, gene-by-gene interactions, and gene-by-environment interactions are believed to underlie most complex diseases. However, such interactions are difficult to identify. Although there have been recent successes in identifying genetic variants for complex diseases, it still remains difficult to identify gene-gene and gene-environment interactions. To overcome this difficulty, we propose a forest-based approach and a concept of variable importance. The proposed approach is demonstrate  ...[more]

Similar Datasets

| S-EPMC3715501 | biostudies-literature
| S-EPMC2799940 | biostudies-literature
| S-EPMC4356212 | biostudies-literature
| S-EPMC4822295 | biostudies-literature
| S-EPMC6013347 | biostudies-literature
| S-EPMC6808442 | biostudies-literature
| S-EPMC5771864 | biostudies-literature
| S-EPMC6533537 | biostudies-literature
| S-EPMC3203928 | biostudies-literature
| S-EPMC10828680 | biostudies-literature