Unknown

Dataset Information

0

Inference of Population Structure from Time-Series Genotype Data.


ABSTRACT: Sequencing ancient DNA can offer direct probing of population history. Yet, such data are commonly analyzed with standard tools that assume DNA samples are all contemporary. We present DyStruct, a model and inference algorithm for inferring shared ancestry from temporally sampled genotype data. DyStruct explicitly incorporates temporal dynamics by modeling individuals as mixtures of unobserved populations whose allele frequencies drift over time. We develop an efficient inference algorithm for our model using stochastic variational inference. On simulated data, we show that DyStruct outperforms the current state of the art when individuals are sampled over time. Using a dataset of 296 modern and 80 ancient samples, we demonstrate DyStruct is able to capture a well-supported admixture event of steppe ancestry into modern Europe. We further apply DyStruct to a genome-wide dataset of 2,067 modern and 262 ancient samples used to study the origin of farming in the Near East. We show that DyStruct provides new insight into population history when compared with alternate approaches, within feasible run time.

SUBMITTER: Joseph TA 

PROVIDER: S-EPMC6698887 | biostudies-literature | 2019 Aug

REPOSITORIES: biostudies-literature

altmetric image

Publications

Inference of Population Structure from Time-Series Genotype Data.

Joseph Tyler A TA   Pe'er Itsik I  

American journal of human genetics 20190627 2


Sequencing ancient DNA can offer direct probing of population history. Yet, such data are commonly analyzed with standard tools that assume DNA samples are all contemporary. We present DyStruct, a model and inference algorithm for inferring shared ancestry from temporally sampled genotype data. DyStruct explicitly incorporates temporal dynamics by modeling individuals as mixtures of unobserved populations whose allele frequencies drift over time. We develop an efficient inference algorithm for o  ...[more]

Similar Datasets

| S-EPMC3114728 | biostudies-literature
| S-EPMC10237648 | biostudies-literature
| S-EPMC8796374 | biostudies-literature
| S-EPMC3266881 | biostudies-literature
| S-EPMC5860468 | biostudies-other
| S-EPMC7505465 | biostudies-literature
| S-EPMC6836738 | biostudies-literature
| S-EPMC5896118 | biostudies-literature
| S-EPMC4295839 | biostudies-literature
| S-EPMC5704982 | biostudies-literature