Unknown

Dataset Information

0

Characterizing linkage disequilibrium and evaluating imputation power of human genomic insertion-deletion polymorphisms.


ABSTRACT:

Background

Indels are an important cause of human variation and central to the study of human disease. The 1000 Genomes Project Low-Coverage Pilot identified over 1.3 million indels shorter than 50 bp, of which over 890 were identified as potentially disruptive variants. Yet, despite their ubiquity, the local genomic characteristics of indels remain unexplored.

Results

Herein we describe population- and minor allele frequency-based differences in linkage disequilibrium and imputation characteristics for indels included in the 1000 Genomes Project Low-Coverage Pilot for the CEU, YRI and CHB+JPT populations. Common indels were well tagged by nearby SNPs in all studied populations, and were also tagged at a similar rate to common SNPs. Both neutral and functionally deleterious common indels were imputed with greater than 95% concordance from HapMap Phase 3 and OMNI SNP sites. Further, 38 to 56% of low frequency indels were tagged by low frequency SNPs. We were able to impute heterozygous low frequency indels with over 50% concordance. Lastly, our analysis also revealed evidence of ascertainment bias. This bias prevents us from extending the applicability of our results to highly polymorphic indels that could not be identified in the Low-Coverage Pilot.

Conclusions

Although further scope exists to improve the imputation of low frequency indels, our study demonstrates that there are already ample opportunities to retrospectively impute indels for prior genome-wide association studies and to incorporate indel imputation into future case/control studies.

SUBMITTER: Lu JT 

PROVIDER: S-EPMC3334570 | biostudies-literature | 2012 Feb

REPOSITORIES: biostudies-literature

altmetric image

Publications

Characterizing linkage disequilibrium and evaluating imputation power of human genomic insertion-deletion polymorphisms.

Lu James T JT   Wang Yi Y   Gibbs Richard A RA   Yu Fuli F  

Genome biology 20120229 2


<h4>Background</h4>Indels are an important cause of human variation and central to the study of human disease. The 1000 Genomes Project Low-Coverage Pilot identified over 1.3 million indels shorter than 50 bp, of which over 890 were identified as potentially disruptive variants. Yet, despite their ubiquity, the local genomic characteristics of indels remain unexplored.<h4>Results</h4>Herein we describe population- and minor allele frequency-based differences in linkage disequilibrium and imputat  ...[more]

Similar Datasets

| S-EPMC2997478 | biostudies-literature
| S-EPMC3909073 | biostudies-literature
| S-EPMC2013689 | biostudies-literature
| S-EPMC1852728 | biostudies-literature
| S-EPMC2672168 | biostudies-literature
| S-EPMC4012494 | biostudies-literature
| S-EPMC1785334 | biostudies-literature
| S-EPMC3096569 | biostudies-literature
| S-EPMC4493124 | biostudies-literature
| S-EPMC1915086 | biostudies-other