Unknown

Dataset Information

0

Estimating effect sizes in genome-wide association studies.


ABSTRACT: Knowledge about the proportion of markers without effects (p( 0 )) and the effect sizes in large scale genetic studies is important to understand the basic properties of the data and for applications such as the control of false discoveries and designing adequately powered replication studies. Many p(0) estimators have been proposed. However, high dimensional data sets typically comprise a large range of effect sizes and it is unclear whether the estimated p(0) is related to the whole range, including markers with very small effects, or just the markers with large effects. In this article we develop an estimation procedure that can be used in all scenarios where the test statistic distribution under the alternative can be characterized by a single parameter (e.g. non-centrality parameter of the non-central chi-square or F distribution). The estimation procedure starts with estimating the largest effect in the data set, then the second largest effect, then the third largest effect, etc. We stop when the effect sizes become so small that they cannot be estimated precisely anymore for the given sample size. Once the individual effect sizes are estimated, they can be used to calculate an interpretable estimate of p(0). Thus, our method results in both an interpretable estimate of p(0) as well as estimates of the effect sizes present in the whole marker set by repeatedly estimating a single parameter. Simulations suggest that the effects are estimated precisely with only a small upward bias. The R codes that compute the effect estimates are freely downloadable from the website: http://www.people.vcu.edu/~jbukszar/.

SUBMITTER: Bukszar J 

PROVIDER: S-EPMC3923086 | biostudies-literature | 2010 May

REPOSITORIES: biostudies-literature

altmetric image

Publications

Estimating effect sizes in genome-wide association studies.

Bukszár József J   van den Oord Edwin J C G EJ  

Behavior genetics 20100106 3


Knowledge about the proportion of markers without effects (p( 0 )) and the effect sizes in large scale genetic studies is important to understand the basic properties of the data and for applications such as the control of false discoveries and designing adequately powered replication studies. Many p(0) estimators have been proposed. However, high dimensional data sets typically comprise a large range of effect sizes and it is unclear whether the estimated p(0) is related to the whole range, inc  ...[more]

Similar Datasets

| S-EPMC5330672 | biostudies-literature
| S-EPMC3059431 | biostudies-literature
| S-EPMC4489336 | biostudies-other
| S-EPMC4321952 | biostudies-literature
| S-EPMC8237646 | biostudies-literature
| S-EPMC4291231 | biostudies-literature
| S-EPMC6723621 | biostudies-literature
| S-EPMC4417323 | biostudies-literature
| S-EPMC5007749 | biostudies-other
| S-EPMC2984437 | biostudies-literature