Unknown

Dataset Information

0

A population-specific reference panel empowers genetic studies of Anabaptist populations.


ABSTRACT: Genotype imputation is a powerful strategy for achieving the large sample sizes required for identification of variants underlying complex phenotypes, but imputation of rare variants remains problematic. Genetically isolated populations offer one solution, however population-specific reference panels are needed to assure optimal imputation accuracy and allele frequency estimation. Here we report the Anabaptist Genome Reference Panel (AGRP), the first whole-genome catalogue of variants and phased haplotypes in people of Amish and Mennonite ancestry. Based on high-depth whole-genome sequence (WGS) from 265 individuals, the AGRP contains >12?M high-confidence single nucleotide variants and short indels, of which ~12.5% are novel. These Anabaptist-specific variants were more deleterious than variants with comparable frequencies observed in the 1000 Genomes panel. About 43,000 variants showed enriched allele frequencies in AGRP, consistent with drift. When combined with the 1000 Genomes Project reference panel, the AGRP substantially improved imputation, especially for rarer variants. The AGRP is freely available to researchers through an imputation server.

SUBMITTER: Hou L 

PROVIDER: S-EPMC5519631 | biostudies-literature | 2017 Jul

REPOSITORIES: biostudies-literature

altmetric image

Publications


Genotype imputation is a powerful strategy for achieving the large sample sizes required for identification of variants underlying complex phenotypes, but imputation of rare variants remains problematic. Genetically isolated populations offer one solution, however population-specific reference panels are needed to assure optimal imputation accuracy and allele frequency estimation. Here we report the Anabaptist Genome Reference Panel (AGRP), the first whole-genome catalogue of variants and phased  ...[more]

Similar Datasets

| S-EPMC8571350 | biostudies-literature
2020-12-10 | GSE126018 | GEO
| S-EPMC3683990 | biostudies-literature
| S-EPMC5532257 | biostudies-literature
2020-02-28 | GSE117850 | GEO
| S-EPMC4631106 | biostudies-literature
| PRJEB20550 | ENA
| S-EPMC7671345 | biostudies-literature
| S-EPMC6685638 | biostudies-literature
| S-EPMC10593948 | biostudies-literature