Unknown

Dataset Information

0

Genome-wide genetic heterogeneity discovery with categorical covariates.


ABSTRACT:

Motivation

Genetic heterogeneity is the phenomenon that distinct genetic variants may give rise to the same phenotype. The recently introduced algorithm Fast Automatic Interval Search ( FAIS ) enables the genome-wide search of candidate regions for genetic heterogeneity in the form of any contiguous sequence of variants, and achieves high computational efficiency and statistical power. Although FAIS can test all possible genomic regions for association with a phenotype, a key limitation is its inability to correct for confounders such as gender or population structure, which may lead to numerous false-positive associations.

Results

We propose FastCMH , a method that overcomes this problem by properly accounting for categorical confounders, while still retaining statistical power and computational efficiency. Experiments comparing FastCMH with FAIS and multiple kinds of burden tests on simulated data, as well as on human and Arabidopsis samples, demonstrate that FastCMH can drastically reduce genomic inflation and discover associations that are missed by standard burden tests.

Availability and implementation

An R package fastcmh is available on CRAN and the source code can be found at: https://www.bsse.ethz.ch/mlcb/research/bioinformatics-and-computational-biology/fastcmh.html.

Contact

felipe.llinares@bsse.ethz.ch.

Supplementary information

Supplementary data are available at Bioinformatics online.

SUBMITTER: Llinares-Lopez F 

PROVIDER: S-EPMC5870548 | biostudies-literature | 2017 Jun

REPOSITORIES: biostudies-literature

altmetric image

Publications

Genome-wide genetic heterogeneity discovery with categorical covariates.

Llinares-López Felipe F   Papaxanthos Laetitia L   Bodenham Dean D   Roqueiro Damian D   Borgwardt Karsten K  

Bioinformatics (Oxford, England) 20170601 12


<h4>Motivation</h4>Genetic heterogeneity is the phenomenon that distinct genetic variants may give rise to the same phenotype. The recently introduced algorithm Fast Automatic Interval Search ( FAIS ) enables the genome-wide search of candidate regions for genetic heterogeneity in the form of any contiguous sequence of variants, and achieves high computational efficiency and statistical power. Although FAIS can test all possible genomic regions for association with a phenotype, a key limitation  ...[more]

Similar Datasets

| S-EPMC3714402 | biostudies-literature
| S-EPMC2670970 | biostudies-literature
| S-EPMC4046680 | biostudies-literature
| S-EPMC3910100 | biostudies-literature
| S-EPMC2866100 | biostudies-literature
| S-EPMC4559912 | biostudies-literature
| S-EPMC6239891 | biostudies-literature
| S-EPMC5449251 | biostudies-literature
| S-EPMC4320269 | biostudies-literature
| S-EPMC5320544 | biostudies-literature