Unknown

Dataset Information

0

Examining the effect of linkage disequilibrium between markers on the Type I error rate and power of nonparametric multipoint linkage analysis of two-generation and multigenerational pedigrees in the presence of missing genotype data.


ABSTRACT: Because most multipoint linkage analysis programs currently assume linkage equilibrium between markers when inferring parental haplotypes, ignoring linkage disequilibrium (LD) may inflate the Type I error rate. We investigated the effect of LD on the Type I error rate and power of nonparametric multipoint linkage analysis of two-generation and multigenerational multiplex families. Using genome-wide single nucleotide polymorphism (SNP) data from the Collaborative Study of the Genetics of Alcoholism, we modified the original data set into 30 total data sets in order to consider six different patterns of missing data for five different levels of SNP density. To assess power, we designed simulated traits based on existing marker genotypes. For the Type I error rate, we simulated 1,000 qualitative traits from random distributions, unlinked to any of the marker data. Overall, the different levels of SNP density examined here had only small effects on power (except sibpair data). Missing data had a substantial effect on power, with more completely genotyped pedigrees yielding the highest power (except sibpair data). Most of the missing data patterns did not cause large increases in the Type I error rate if the SNP markers were more than 0.3 cM apart. However, in a dense 0.25-cM map, removing genotypes on founders and/or founders and parents in the middle generation caused substantial inflation of the Type I error rate, which corresponded to the increasing proportion of persons with missing data. Results also showed that long high-LD blocks have severe effects on Type I error rates.

SUBMITTER: Kim Y 

PROVIDER: S-EPMC2216429 | biostudies-literature | 2008 Jan

REPOSITORIES: biostudies-literature

altmetric image

Publications

Examining the effect of linkage disequilibrium between markers on the Type I error rate and power of nonparametric multipoint linkage analysis of two-generation and multigenerational pedigrees in the presence of missing genotype data.

Kim Yoonhee Y   Duggal Priya P   Gillanders Elizabeth M EM   Kim Ho H   Bailey-Wilson Joan E JE  

Genetic epidemiology 20080101 1


Because most multipoint linkage analysis programs currently assume linkage equilibrium between markers when inferring parental haplotypes, ignoring linkage disequilibrium (LD) may inflate the Type I error rate. We investigated the effect of LD on the Type I error rate and power of nonparametric multipoint linkage analysis of two-generation and multigenerational multiplex families. Using genome-wide single nucleotide polymorphism (SNP) data from the Collaborative Study of the Genetics of Alcoholi  ...[more]

Similar Datasets

| S-EPMC1785316 | biostudies-literature
| S-EPMC263833 | biostudies-other
| S-EPMC2997478 | biostudies-literature
| S-EPMC1378058 | biostudies-literature
| S-EPMC4879118 | biostudies-literature
| S-EPMC1235540 | biostudies-literature
| S-EPMC1866695 | biostudies-literature
| S-EPMC1852728 | biostudies-literature
| S-EPMC4143626 | biostudies-literature
| S-EPMC4012494 | biostudies-literature