Unknown

Dataset Information

0

Inclusion of Population-specific Reference Panel from India to the 1000 Genomes Phase 3 Panel Improves Imputation Accuracy.


ABSTRACT: Imputation is a computational method based on the principle of haplotype sharing allowing enrichment of genome-wide association study datasets. It depends on the haplotype structure of the population and density of the genotype data. The 1000 Genomes Project led to the generation of imputation reference panels which have been used globally. However, recent studies have shown that population-specific panels provide better enrichment of genome-wide variants. We compared the imputation accuracy using 1000 Genomes phase 3 reference panel and a panel generated from genome-wide data on 407 individuals from Western India (WIP). The concordance of imputed variants was cross-checked with next-generation re-sequencing data on a subset of genomic regions. Further, using the genome-wide data from 1880 individuals, we demonstrate that WIP works better than the 1000 Genomes phase 3 panel and when merged with it, significantly improves the imputation accuracy throughout the minor allele frequency range. We also show that imputation using only South Asian component of the 1000 Genomes phase 3 panel works as good as the merged panel, making it computationally less intensive job. Thus, our study stresses that imputation accuracy using 1000 Genomes phase 3 panel can be further improved by including population-specific reference panels from South Asia.

SUBMITTER: Ahmad M 

PROVIDER: S-EPMC5532257 | biostudies-literature | 2017 Jul

REPOSITORIES: biostudies-literature

altmetric image

Publications

Inclusion of Population-specific Reference Panel from India to the 1000 Genomes Phase 3 Panel Improves Imputation Accuracy.

Ahmad Meraj M   Sinha Anubhav A   Ghosh Sreya S   Kumar Vikrant V   Davila Sonia S   Yajnik Chittaranjan S CS   Chandak Giriraj R GR  

Scientific reports 20170727 1


Imputation is a computational method based on the principle of haplotype sharing allowing enrichment of genome-wide association study datasets. It depends on the haplotype structure of the population and density of the genotype data. The 1000 Genomes Project led to the generation of imputation reference panels which have been used globally. However, recent studies have shown that population-specific panels provide better enrichment of genome-wide variants. We compared the imputation accuracy usi  ...[more]

Similar Datasets

| S-EPMC4686825 | biostudies-literature
| S-EPMC4580532 | biostudies-literature
| S-EPMC5177868 | biostudies-literature
| S-EPMC6805399 | biostudies-other
2015-07-01 | GSE70188 | GEO
| S-EPMC6933688 | biostudies-literature
| S-EPMC4338501 | biostudies-literature
| S-EPMC5520064 | biostudies-literature
| S-EPMC3703942 | biostudies-literature
2015-07-01 | E-GEOD-70188 | biostudies-arrayexpress