Unknown

Dataset Information

0

Population genomic inference of recombination rates and hotspots.


ABSTRACT: As more human genomic data become available, fine-scale recombination rate variation can be inferred on a genome-wide scale. Current statistical methods to infer recombination rates that can be applied to moderate, or large, genomic regions are limited to approximated likelihoods. Here, we develop a Bayesian full-likelihood method using Markov Chain Monte Carlo (MCMC) to estimate background recombination rates and hotspots. The probability model is inspired by the observed patterns of recombination at several genomic regions analyzed in sperm-typing studies. Posterior probabilities and Bayes factors of recombination hotspots along chromosomes are inferred. For moderate-size genomic regions (e.g., with <100 SNPs), the full-likelihood method is used. Larger regions are split into subintervals (typically each having between 20 and 50 markers). The likelihood is approximated based on the genealogies for each subinterval. The background recombination rates, hotspots, and parameters are evaluated by using a parallel computing approach and assuming shared parameters across the subintervals. Simulation analyses show that our method can accurately estimate the variation in recombination rates across genomic regions. In particular, clusters of hotspots can be distinguished even though weaker hotspots are present. The method is applied to SNP data from the HLA region, the MS32, and chromosome 19.

SUBMITTER: Wang Y 

PROVIDER: S-EPMC2669376 | biostudies-literature | 2009 Apr

REPOSITORIES: biostudies-literature

altmetric image

Publications

Population genomic inference of recombination rates and hotspots.

Wang Ying Y   Rannala Bruce B  

Proceedings of the National Academy of Sciences of the United States of America 20090402 15


As more human genomic data become available, fine-scale recombination rate variation can be inferred on a genome-wide scale. Current statistical methods to infer recombination rates that can be applied to moderate, or large, genomic regions are limited to approximated likelihoods. Here, we develop a Bayesian full-likelihood method using Markov Chain Monte Carlo (MCMC) to estimate background recombination rates and hotspots. The probability model is inspired by the observed patterns of recombinat  ...[more]

Similar Datasets

| S-EPMC4682399 | biostudies-literature
| S-EPMC4256775 | biostudies-literature
2011-03-16 | E-GEOD-25656 | biostudies-arrayexpress
2011-03-16 | GSE25656 | GEO
| S-EPMC1201364 | biostudies-literature
| S-EPMC3218859 | biostudies-literature
| S-EPMC8582322 | biostudies-literature
| S-EPMC3389972 | biostudies-literature
| S-EPMC4889653 | biostudies-literature
| S-EPMC4315300 | biostudies-literature