Unknown

Dataset Information

0

Multiple linear combination (MLC) regression tests for common variants adapted to linkage disequilibrium structure.


ABSTRACT: By jointly analyzing multiple variants within a gene, instead of one at a time, gene-based multiple regression can improve power, robustness, and interpretation in genetic association analysis. We investigate multiple linear combination (MLC) test statistics for analysis of common variants under realistic trait models with linkage disequilibrium (LD) based on HapMap Asian haplotypes. MLC is a directional test that exploits LD structure in a gene to construct clusters of closely correlated variants recoded such that the majority of pairwise correlations are positive. It combines variant effects within the same cluster linearly, and aggregates cluster-specific effects in a quadratic sum of squares and cross-products, producing a test statistic with reduced degrees of freedom (df) equal to the number of clusters. By simulation studies of 1000 genes from across the genome, we demonstrate that MLC is a well-powered and robust choice among existing methods across a broad range of gene structures. Compared to minimum P-value, variance-component, and principal-component methods, the mean power of MLC is never much lower than that of other methods, and can be higher, particularly with multiple causal variants. Moreover, the variation in gene-specific MLC test size and power across 1000 genes is less than that of other methods, suggesting it is a complementary approach for discovery in genome-wide analysis. The cluster construction of the MLC test statistics helps reveal within-gene LD structure, allowing interpretation of clustered variants as haplotypic effects, while multiple regression helps to distinguish direct and indirect associations.

SUBMITTER: Yoo YJ 

PROVIDER: S-EPMC5245123 | biostudies-literature | 2017 Feb

REPOSITORIES: biostudies-literature

altmetric image

Publications

Multiple linear combination (MLC) regression tests for common variants adapted to linkage disequilibrium structure.

Yoo Yun Joo YJ   Sun Lei L   Poirier Julia G JG   Paterson Andrew D AD   Bull Shelley B SB  

Genetic epidemiology 20161125 2


By jointly analyzing multiple variants within a gene, instead of one at a time, gene-based multiple regression can improve power, robustness, and interpretation in genetic association analysis. We investigate multiple linear combination (MLC) test statistics for analysis of common variants under realistic trait models with linkage disequilibrium (LD) based on HapMap Asian haplotypes. MLC is a directional test that exploits LD structure in a gene to construct clusters of closely correlated varian  ...[more]

Similar Datasets

| S-EPMC2732754 | biostudies-literature
| S-EPMC2427310 | biostudies-literature
| S-EPMC6684625 | biostudies-literature
| S-EPMC4655331 | biostudies-literature
| S-EPMC1457009 | biostudies-literature
| S-EPMC4172344 | biostudies-literature
| S-EPMC7236654 | biostudies-literature
| S-EPMC3875850 | biostudies-literature
| S-EPMC3708889 | biostudies-literature
| S-EPMC5993419 | biostudies-literature