Unknown

Dataset Information

0

Robust and scalable inference of population history from hundreds of unphased whole genomes.


ABSTRACT: It has recently been demonstrated that inference methods based on genealogical processes with recombination can uncover past population history in unprecedented detail. However, these methods scale poorly with sample size, limiting resolution in the recent past, and they require phased genomes, which contain switch errors that can catastrophically distort the inferred history. Here we present SMC++, a new statistical tool capable of analyzing orders of magnitude more samples than existing methods while requiring only unphased genomes (its results are independent of phasing). SMC++ can jointly infer population size histories and split times in diverged populations, and it employs a novel spline regularization scheme that greatly reduces estimation error. We apply SMC++ to analyze sequence data from over a thousand human genomes in Africa and Eurasia, hundreds of genomes from a Drosophila melanogaster population in Africa, and tens of genomes from zebra finch and long-tailed finch populations in Australia.

SUBMITTER: Terhorst J 

PROVIDER: S-EPMC5470542 | biostudies-literature | 2017 Feb

REPOSITORIES: biostudies-literature

altmetric image

Publications

Robust and scalable inference of population history from hundreds of unphased whole genomes.

Terhorst Jonathan J   Kamm John A JA   Song Yun S YS  

Nature genetics 20161226 2


It has recently been demonstrated that inference methods based on genealogical processes with recombination can uncover past population history in unprecedented detail. However, these methods scale poorly with sample size, limiting resolution in the recent past, and they require phased genomes, which contain switch errors that can catastrophically distort the inferred history. Here we present SMC++, a new statistical tool capable of analyzing orders of magnitude more samples than existing method  ...[more]

Similar Datasets

| S-EPMC7730797 | biostudies-literature
| S-EPMC3154645 | biostudies-literature
| S-EPMC4285295 | biostudies-literature
| S-EPMC4251999 | biostudies-literature
| S-EPMC10311346 | biostudies-literature
| S-EPMC4326465 | biostudies-literature
| S-EPMC3872189 | biostudies-other
| S-EPMC7303978 | biostudies-literature
| S-EPMC3189801 | biostudies-literature
| S-EPMC5243798 | biostudies-literature