Unknown

Dataset Information

0

Estimating linkage disequilibrium from genotypes under Hardy-Weinberg equilibrium.


ABSTRACT:

Background

Measures of linkage disequilibrium (LD) play a key role in a wide range of applications from disease association to demographic history estimation. The true population LD cannot be measured directly and instead can only be inferred from genetic samples, which are unavoidably subject to measurement error. Previous studies of r2 (a measure of LD), such as the bias due to finite sample size and its variance, were based on the special case that the true population-wise LD is zero. These results generally do not hold for non-zero [Formula: see text] values, which are more common in real genetic data.

Results

This work generalises the estimation of r2 to all levels of LD, and for both phased and unphased data. First, we provide new formulae for the effect of finite sample size on the observed r2 values. Second, we find a new empirical formula for the variance of the observed r2, equals to 2E[r2](1?-?E[r2])/n, where n is the diploid sample size. Third, we propose a new routine, Constrained ML, a likelihood-based method to directly estimate haplotype frequencies and r2 from diploid genotypes under Hardy-Weinberg Equilibrium. While serving the same purpose as the pre-existing Expectation-Maximisation algorithm, the new routine can have better convergence and is simpler to use. A new likelihood-ratio test is also introduced to test for the absence of a particular haplotype. Extensive simulations are run to support these findings.

Conclusion

Most inferences on LD will benefit from our new findings, from point and interval estimation to hypothesis testing. Genetic analyses utilising r2 information will become more accurate as a result.

SUBMITTER: Hui TJ 

PROVIDER: S-EPMC7045472 | biostudies-literature | 2020 Feb

REPOSITORIES: biostudies-literature

altmetric image

Publications

Estimating linkage disequilibrium from genotypes under Hardy-Weinberg equilibrium.

Hui Tin-Yu J TJ   Burt Austin A  

BMC genetics 20200226 1


<h4>Background</h4>Measures of linkage disequilibrium (LD) play a key role in a wide range of applications from disease association to demographic history estimation. The true population LD cannot be measured directly and instead can only be inferred from genetic samples, which are unavoidably subject to measurement error. Previous studies of r<sup>2</sup> (a measure of LD), such as the bias due to finite sample size and its variance, were based on the special case that the true population-wise  ...[more]

Similar Datasets

| S-EPMC4141651 | biostudies-literature
| S-EPMC1698709 | biostudies-other
| S-EPMC4357990 | biostudies-literature
| S-EPMC4138174 | biostudies-literature
| S-EPMC2986695 | biostudies-literature
| S-EPMC5671567 | biostudies-literature
| S-EPMC9250657 | biostudies-literature
| S-EPMC6827367 | biostudies-literature
| S-EPMC4725899 | biostudies-other
| S-EPMC7083100 | biostudies-literature