Unknown

Dataset Information

0

Genotypic discrepancies arising from imputation.


ABSTRACT: The ideal genetic analysis of family data would include whole genome sequence on all family members. A strategy of combining sequence data from a subset of key individuals with inexpensive, genome-wide association study (GWAS) chip genotypes on all individuals to infer sequence level genotypes throughout the families has been suggested as a highly accurate alternative. This strategy was followed by the Genetic Analysis Workshop 18 data providers. We examined the quality of the imputation to identify potential consequences of this strategy by comparing discrepancies between GWAS genotype calls and imputed calls for the same variants. Overall, the inference and imputation process worked very well. However, we find that discrepancies occurred at an increased rate when imputation was used to infer missing data in sequenced individuals. Although this may be an artifact of this particular instantiation of these analytic methods, there may be general genetic or algorithmic reasons to avoid trying to fill in missing sequence data. This is especially true given the risk of false positives and reduction in power for family-based transmission tests when founders are incorrectly imputed as heterozygotes. Finally, we note a higher rate of discrepancies when unsequenced individuals are inferred using sequenced individuals from other pedigrees drawn from the same admixed population.

SUBMITTER: Hinrichs AL 

PROVIDER: S-EPMC4143754 | biostudies-literature | 2014

REPOSITORIES: biostudies-literature

altmetric image

Publications

Genotypic discrepancies arising from imputation.

Hinrichs Anthony L AL   Culverhouse Robert C RC   Suarez Brian K BK  

BMC proceedings 20140617 Suppl 1 Genetic Analysis Workshop 18Vanessa Olmo


The ideal genetic analysis of family data would include whole genome sequence on all family members. A strategy of combining sequence data from a subset of key individuals with inexpensive, genome-wide association study (GWAS) chip genotypes on all individuals to infer sequence level genotypes throughout the families has been suggested as a highly accurate alternative. This strategy was followed by the Genetic Analysis Workshop 18 data providers. We examined the quality of the imputation to iden  ...[more]

Similar Datasets

| S-EPMC2795971 | biostudies-literature
| S-EPMC10103531 | biostudies-literature
| S-EPMC8138701 | biostudies-literature
| S-EPMC10460491 | biostudies-literature
| S-EPMC3126435 | biostudies-literature
| PRJEB61632 | ENA
| PRJNA75845 | ENA
| S-EPMC5660667 | biostudies-literature
| S-EPMC7062756 | biostudies-literature
| S-EPMC9167364 | biostudies-literature