Unknown

Dataset Information

0

Bayesian haplotype inference for multiple linked single-nucleotide polymorphisms.


ABSTRACT: Haplotypes have gained increasing attention in the mapping of complex-disease genes, because of the abundance of single-nucleotide polymorphisms (SNPs) and the limited power of conventional single-locus analyses. It has been shown that haplotype-inference methods such as Clark's algorithm, the expectation-maximization algorithm, and a coalescence-based iterative-sampling algorithm are fairly effective and economical alternatives to molecular-haplotyping methods. To contend with some weaknesses of the existing algorithms, we propose a new Monte Carlo approach. In particular, we first partition the whole haplotype into smaller segments. Then, we use the Gibbs sampler both to construct the partial haplotypes of each segment and to assemble all the segments together. Our algorithm can accurately and rapidly infer haplotypes for a large number of linked SNPs. By using a wide variety of real and simulated data sets, we demonstrate the advantages of our Bayesian algorithm, and we show that it is robust to the violation of Hardy-Weinberg equilibrium, to the presence of missing data, and to occurrences of recombination hotspots.

SUBMITTER: Niu T 

PROVIDER: S-EPMC448439 | biostudies-other | 2002 Jan

REPOSITORIES: biostudies-other

Similar Datasets

| S-EPMC1196396 | biostudies-literature
| S-EPMC3827885 | biostudies-other
| S-EPMC2892635 | biostudies-other
| S-EPMC3058059 | biostudies-literature
| S-EPMC2638261 | biostudies-literature
| S-EPMC3730463 | biostudies-literature
| S-EPMC1866685 | biostudies-literature
| S-EPMC5915226 | biostudies-literature
| S-EPMC3241355 | biostudies-literature
| S-EPMC5039928 | biostudies-literature