Unknown

Dataset Information

0

The effects of microsatellite selection on linked sequence diversity.


ABSTRACT: The genome-wide scan for selection is an important method for identifying loci involved in adaptive evolution. However, theory that underlies standard scans for selection assumes a simple mutation model. In particular, recurrent mutation of the selective target is not considered. Although this assumption is reasonable for single-nucleotide variants (SNVs), a microsatellite targeted by selection will reliably violate this assumption due to high mutation rate. Moreover, the mutation rate of microsatellites is generally high enough to ensure that recurrent mutation is pervasive rather than occasional. It is therefore unclear if positive selection targeting microsatellites can be detected using standard scanning statistics. Examples of functional variation at microsatellites underscore the significance of understanding the genomic effects of microsatellite selection. Here, we investigate the joint effects of selection and complex mutation on linked sequence diversity, comparing simulations of microsatellite selection and SNV-based selective sweeps. We find that selection on microsatellites is generally difficult to detect using popular summaries of the site frequency spectrum, and, under certain conditions, using popular methods such as the integrated haplotype statistic and SweepFinder. However, comparisons of the number of haplotypes (K) and segregating sites (S) often provide considerable power to detect selection on microsatellites. We apply this knowledge to a scan of autosomes in the human CEU population (CEPH population sampled from Utah). In addition to the most commonly reported targets of selection in European populations, we identify numerous novel genomic regions that bear highly anomalous haplotype configurations. Using one of these regions-intron 1 of MAGI2-as an example, we show that the anomalous configuration is coincident with a perfect CA repeat of length 22. We conclude that standard genome-wide scans will commonly fail to detect mutationally complex targets of selection but that comparisons of K and S will, in many cases, facilitate their identification.

SUBMITTER: Haasl RJ 

PROVIDER: S-EPMC4122932 | biostudies-literature |

REPOSITORIES: biostudies-literature

Similar Datasets

| S-EPMC9071562 | biostudies-literature
| S-EPMC10713635 | biostudies-literature
| S-EPMC7747977 | biostudies-literature
| S-EPMC4990265 | biostudies-literature
| S-EPMC10120885 | biostudies-literature
2023-02-01 | PXD035592 | Pride
2023-02-01 | PXD035776 | Pride
2023-02-01 | PXD035431 | Pride
2023-02-01 | PXD035536 | Pride
2023-02-01 | PXD035670 | Pride