Unknown

Dataset Information

0

HaploBlocker: Creation of Subgroup-Specific Haplotype Blocks and Libraries.


ABSTRACT: The concept of haplotype blocks has been shown to be useful in genetics. Fields of application range from the detection of regions under positive selection to statistical methods that make use of dimension reduction. We propose a novel approach ("HaploBlocker") for defining and inferring haplotype blocks that focuses on linkage instead of the commonly used population-wide measures of linkage disequilibrium. We define a haplotype block as a sequence of genetic markers that has a predefined minimum frequency in the population, and only haplotypes with a similar sequence of markers are considered to carry that block, effectively screening a dataset for group-wise identity-by-descent. From these haplotype blocks, we construct a haplotype library that represents a large proportion of genetic variability with a limited number of blocks. Our method is implemented in the associated R-package HaploBlocker, and provides flexibility not only to optimize the structure of the obtained haplotype library for subsequent analyses, but also to handle datasets of different marker density and genetic diversity. By using haplotype blocks instead of single nucleotide polymorphisms (SNPs), local epistatic interactions can be naturally modeled, and the reduced number of parameters enables a wide variety of new methods for further genomic analyses such as genomic prediction and the detection of selection signatures. We illustrate our methodology with a dataset comprising 501 doubled haploid lines in a European maize landrace genotyped at 501,124 SNPs. With the suggested approach, we identified 2991 haplotype blocks with an average length of 2685 SNPs that together represent 94% of the dataset.

SUBMITTER: Pook T 

PROVIDER: S-EPMC6707469 | biostudies-literature | 2019 Aug

REPOSITORIES: biostudies-literature

altmetric image

Publications

HaploBlocker: Creation of Subgroup-Specific Haplotype Blocks and Libraries.

Pook Torsten T   Schlather Martin M   de Los Campos Gustavo G   Mayer Manfred M   Schoen Chris Carolin CC   Simianer Henner H  

Genetics 20190531 4


The concept of haplotype blocks has been shown to be useful in genetics. Fields of application range from the detection of regions under positive selection to statistical methods that make use of dimension reduction. We propose a novel approach ("HaploBlocker") for defining and inferring haplotype blocks that focuses on linkage instead of the commonly used population-wide measures of linkage disequilibrium. We define a haplotype block as a sequence of genetic markers that has a predefined minimu  ...[more]

Similar Datasets

| S-EPMC7243190 | biostudies-literature
| S-EPMC2783737 | biostudies-literature
| S-EPMC4027187 | biostudies-literature
| S-EPMC4395795 | biostudies-literature
| S-EPMC7754423 | biostudies-literature
| S-EPMC1181911 | biostudies-literature
| S-EPMC1285181 | biostudies-literature
| S-EPMC2851959 | biostudies-literature
| S-EPMC430919 | biostudies-literature
| S-EPMC1488870 | biostudies-literature