Unknown

Dataset Information

0

Measuring Genetic Differentiation from Pool-seq Data.


ABSTRACT: The advent of high throughput sequencing and genotyping technologies enables the comparison of patterns of polymorphisms at a very large number of markers. While the characterization of genetic structure from individual sequencing data remains expensive for many nonmodel species, it has been shown that sequencing pools of individual DNAs (Pool-seq) represents an attractive and cost-effective alternative. However, analyzing sequence read counts from a DNA pool instead of individual genotypes raises statistical challenges in deriving correct estimates of genetic differentiation. In this article, we provide a method-of-moments estimator of [Formula: see text] for Pool-seq data, based on an analysis-of-variance framework. We show, by means of simulations, that this new estimator is unbiased and outperforms previously proposed estimators. We evaluate the robustness of our estimator to model misspecification, such as sequencing errors and uneven contributions of individual DNAs to the pools. Finally, by reanalyzing published Pool-seq data of different ecotypes of the prickly sculpin Cottus asper, we show how the use of an unbiased [Formula: see text] estimator may question the interpretation of population structure inferred from previous analyses.

SUBMITTER: Hivert V 

PROVIDER: S-EPMC6116966 | biostudies-literature | 2018 Sep

REPOSITORIES: biostudies-literature

altmetric image

Publications

Measuring Genetic Differentiation from Pool-seq Data.

Hivert Valentin V   Leblois Raphaël R   Petit Eric J EJ   Gautier Mathieu M   Vitalis Renaud R  

Genetics 20180730 1


The advent of high throughput sequencing and genotyping technologies enables the comparison of patterns of polymorphisms at a very large number of markers. While the characterization of genetic structure from individual sequencing data remains expensive for many nonmodel species, it has been shown that sequencing pools of individual DNAs (Pool-seq) represents an attractive and cost-effective alternative. However, analyzing sequence read counts from a DNA pool instead of individual genotypes rais  ...[more]

Shared Molecules

Only show the datasets with similarity scores above: 0.5
     

Similar Datasets

| S-EPMC5850601 | biostudies-literature
| S-EPMC5121893 | biostudies-literature
| S-EPMC5100849 | biostudies-literature
| S-EPMC8251607 | biostudies-literature
| S-EPMC5865117 | biostudies-literature
| S-EPMC5996037 | biostudies-literature
| S-EPMC4744716 | biostudies-literature
| S-EPMC4359753 | biostudies-literature
| S-EPMC5499643 | biostudies-literature
| S-EPMC5026257 | biostudies-literature