Unknown

Dataset Information

0

Detecting selective sweeps from pooled next-generation sequencing samples.


ABSTRACT: Due to its cost effectiveness, next-generation sequencing of pools of individuals (Pool-Seq) is becoming a popular strategy for characterizing variation in population samples. Because Pool-Seq provides genome-wide SNP frequency data, it is possible to use them for demographic inference and/or the identification of selective sweeps. Here, we introduce a statistical method that is designed to detect selective sweeps from pooled data by accounting for statistical challenges associated with Pool-Seq, namely sequencing errors and random sampling among chromosomes. This allows for an efficient use of the information: all base calls are included in the analysis, but the higher credibility of regions with higher coverage and base calls with better quality scores is accounted for. Computer simulations show that our method efficiently detects sweeps even at very low coverage (0.5× per chromosome). Indeed, the power of detecting sweeps is similar to what we could expect from sequences of individual chromosomes. Since the inference of selective sweeps is based on the allele frequency spectrum (AFS), we also provide a method to accurately estimate the AFS provided that the quality scores for the sequence reads are reliable. Applying our approach to Pool-Seq data from Drosophila melanogaster, we identify several selective sweep signatures on chromosome X that include some previously well-characterized sweeps like the wapl region.

SUBMITTER: Boitard S 

PROVIDER: S-EPMC3424412 | biostudies-literature | 2012 Sep

REPOSITORIES: biostudies-literature

altmetric image

Publications

Detecting selective sweeps from pooled next-generation sequencing samples.

Boitard Simon S   Schlötterer Christian C   Nolte Viola V   Pandey Ram Vinay RV   Futschik Andreas A  

Molecular biology and evolution 20120312 9


Due to its cost effectiveness, next-generation sequencing of pools of individuals (Pool-Seq) is becoming a popular strategy for characterizing variation in population samples. Because Pool-Seq provides genome-wide SNP frequency data, it is possible to use them for demographic inference and/or the identification of selective sweeps. Here, we introduce a statistical method that is designed to detect selective sweeps from pooled data by accounting for statistical challenges associated with Pool-Seq  ...[more]

Similar Datasets

| S-EPMC5037392 | biostudies-literature
| S-EPMC3268604 | biostudies-literature
| S-EPMC4148959 | biostudies-literature
| S-EPMC1206276 | biostudies-other
| S-EPMC3861164 | biostudies-literature
| S-EPMC5001557 | biostudies-literature
| S-EPMC3477622 | biostudies-literature
| S-EPMC3630221 | biostudies-literature
2017-04-03 | PXD003804 | Pride
| S-EPMC3675123 | biostudies-literature