Unknown

Dataset Information

0

Illustrating, Quantifying, and Correcting for Bias in Post-hoc Analysis of Gene-Based Rare Variant Tests of Association.


ABSTRACT: To date, gene-based rare variant testing approaches have focused on aggregating information across sets of variants to maximize statistical power in identifying genes showing significant association with diseases. Beyond identifying genes that are associated with diseases, the identification of causal variant(s) in those genes and estimation of their effect is crucial for planning replication studies and characterizing the genetic architecture of the locus. However, we illustrate that straightforward single-marker association statistics can suffer from substantial bias introduced by conditioning on gene-based test significance, due to the phenomenon often referred to as "winner's curse." We illustrate the ramifications of this bias on variant effect size estimation and variant prioritization/ranking approaches, outline parameters of genetic architecture that affect this bias, and propose a bootstrap resampling method to correct for this bias. We find that our correction method significantly reduces the bias due to winner's curse (average two-fold decrease in bias, p < 2.2 × 10-6) and, consequently, substantially improves mean squared error and variant prioritization/ranking. The method is particularly helpful in adjustment for winner's curse effects when the initial gene-based test has low power and for relatively more common, non-causal variants. Adjustment for winner's curse is recommended for all post-hoc estimation and ranking of variants after a gene-based test. Further work is necessary to continue seeking ways to reduce bias and improve inference in post-hoc analysis of gene-based tests under a wide variety of genetic architectures.

SUBMITTER: Grinde KE 

PROVIDER: S-EPMC5603735 | biostudies-literature | 2017

REPOSITORIES: biostudies-literature

altmetric image

Publications

Illustrating, Quantifying, and Correcting for Bias in Post-hoc Analysis of Gene-Based Rare Variant Tests of Association.

Grinde Kelsey E KE   Arbet Jaron J   Green Alden A   O'Connell Michael M   Valcarcel Alessandra A   Westra Jason J   Tintle Nathan N  

Frontiers in genetics 20170914


To date, gene-based rare variant testing approaches have focused on aggregating information across sets of variants to maximize statistical power in identifying genes showing significant association with diseases. Beyond identifying genes that are associated with diseases, the identification of causal variant(s) in those genes and estimation of their effect is crucial for planning replication studies and characterizing the genetic architecture of the locus. However, we illustrate that straightfo  ...[more]

Similar Datasets

| S-EPMC4085641 | biostudies-literature
| S-EPMC4127117 | biostudies-literature
| S-EPMC3718063 | biostudies-literature
| S-EPMC4121482 | biostudies-literature
| S-EPMC3440237 | biostudies-literature
| S-EPMC3939031 | biostudies-literature
| S-EPMC4968883 | biostudies-literature
| S-EPMC3420665 | biostudies-literature
| S-EPMC3701690 | biostudies-literature
| S-EPMC6329454 | biostudies-literature