Unknown

Dataset Information

0

Spatial localization of recent ancestors for admixed individuals.


ABSTRACT: Ancestry analysis from genetic data plays a critical role in studies of human disease and evolution. Recent work has introduced explicit models for the geographic distribution of genetic variation and has shown that such explicit models yield superior accuracy in ancestry inference over nonmodel-based methods. Here we extend such work to introduce a method that models admixture between ancestors from multiple sources across a geographic continuum. We devise efficient algorithms based on hidden Markov models to localize on a map the recent ancestors (e.g., grandparents) of admixed individuals, joint with assigning ancestry at each locus in the genome. We validate our methods by using empirical data from individuals with mixed European ancestry from the Population Reference Sample study and show that our approach is able to localize their recent ancestors within an average of 470 km of the reported locations of their grandparents. Furthermore, simulations from real Population Reference Sample genotype data show that our method attains high accuracy in localizing recent ancestors of admixed individuals in Europe (an average of 550 km from their true location for localization of two ancestries in Europe, four generations ago). We explore the limits of ancestry localization under our approach and find that performance decreases as the number of distinct ancestries and generations since admixture increases. Finally, we build a map of expected localization accuracy across admixed individuals according to the location of origin within Europe of their ancestors.

SUBMITTER: Yang WY 

PROVIDER: S-EPMC4267945 | biostudies-literature | 2014 Nov

REPOSITORIES: biostudies-literature

altmetric image

Publications

Spatial localization of recent ancestors for admixed individuals.

Yang Wen-Yun WY   Platt Alexander A   Chiang Charleston Wen-Kai CW   Eskin Eleazar E   Novembre John J   Pasaniuc Bogdan B  

G3 (Bethesda, Md.) 20141103 12


Ancestry analysis from genetic data plays a critical role in studies of human disease and evolution. Recent work has introduced explicit models for the geographic distribution of genetic variation and has shown that such explicit models yield superior accuracy in ancestry inference over nonmodel-based methods. Here we extend such work to introduce a method that models admixture between ancestors from multiple sources across a geographic continuum. We devise efficient algorithms based on hidden M  ...[more]

Similar Datasets

| S-EPMC3030550 | biostudies-other
| S-EPMC3245293 | biostudies-literature
| S-EPMC3990492 | biostudies-literature
| S-EPMC8582322 | biostudies-literature
| S-EPMC8168010 | biostudies-literature
| S-EPMC4737291 | biostudies-other
| S-EPMC9104695 | biostudies-literature
| S-EPMC7365470 | biostudies-literature
| S-EPMC7159207 | biostudies-literature
| S-EPMC7118071 | biostudies-literature