Unknown

Dataset Information

0

Genome-wide compatible SNP intervals and their properties.


ABSTRACT: Intraspecific genomes can be subdivided into blocks with limited diversity. Understanding the distribution and structure of these blocks will help to unravel many biological problems including the identification of genes associated with complex diseases, finding the ancestral origins of a given population, and localizing regions of historical recombination, gene conversion, and homoplasy. We present methods for partitioning a genome into blocks for which there are no apparent recombinations, thus providing parsimonious sets of compatible genome intervals based on the four-gamete test. Our contribution is a thorough analysis of the problem of dividing a genome into compatible intervals, in terms of its computational complexity, and by providing an achievable lower-bound on the minimal number of intervals required to cover an entire data set. In general, such minimal interval partitions are not unique. However, we identify properties that are common to every possible solution. We also define the notion of an interval set that achieves the interval lower-bound, yet maximizes interval overlap. We demonstrate algorithms for partitioning both haplotype data from inbred mice as well as outbred heterozygous genotype data using extensions of the standard four-gamete test. These methods allow our algorithms to be applied to a wide range of genomic data sets.

SUBMITTER: Wang J 

PROVIDER: S-EPMC5690570 | biostudies-literature | 2010 Aug

REPOSITORIES: biostudies-literature

altmetric image

Publications

Genome-wide compatible SNP intervals and their properties.

Wang Jeremy J   de Villena Fernando Pardo-Manual FP   Moore Kyle J KJ   Wang Wei W   Zhang Qi Q   McMillan Leonard L  

The 2010 ACM International Conference on Bioinformatics and Computational Biology : ACM-BCB 2010 : Niagara Falls, New York, U.S.A., August 2-4, 2010. ACM International Conference on Bioinformatics and Computational Biology (1st : 2010 :.. 20100801


Intraspecific genomes can be subdivided into blocks with limited diversity. Understanding the distribution and structure of these blocks will help to unravel many biological problems including the identification of genes associated with complex diseases, finding the ancestral origins of a given population, and localizing regions of historical recombination, gene conversion, and homoplasy. We present methods for partitioning a genome into blocks for which there are no apparent recombinations, thu  ...[more]

Similar Datasets

| S-EPMC6813082 | biostudies-other
| S-EPMC3871583 | biostudies-literature
| S-EPMC4072903 | biostudies-literature
| S-EPMC3022824 | biostudies-literature
| S-EPMC4938657 | biostudies-literature
| S-EPMC3483552 | biostudies-literature
| S-EPMC6655603 | biostudies-literature
| S-EPMC343282 | biostudies-literature
| S-EPMC5698929 | biostudies-literature
| S-EPMC3722675 | biostudies-literature