Unknown

Dataset Information

0

Family-Based Haplotype Estimation and Allele Dosage Correction for Polyploids Using Short Sequence Reads.


ABSTRACT: DNA sequence reads contain information about the genomic variants located on a single chromosome. By extracting and extending this information using the overlaps between the reads, the haplotypes of an individual can be obtained. Using parent-offspring relationships in a population can considerably improve the quality of the haplotypes obtained from short reads, as pedigree information can be used to correct for spurious overlaps (due to sequencing errors) and insufficient overlaps (due to short read lengths, low genomic variation and shallow coverage). We developed a novel method, PopPoly, to estimate polyploid haplotypes in an F1-population from short sequence data by taking into consideration the transmission of the haplotypes from the parents to the offspring. In addition, this information is employed to improve genotype dosage estimation and to call missing genotypes in the population. Through simulations, we compare PopPoly to other haplotyping methods and show its better performance. We evaluate PopPoly by applying it to a tetraploid potato cross at nine genomic regions involved in tuber formation.

SUBMITTER: Motazedi E 

PROVIDER: S-EPMC6477055 | biostudies-literature | 2019

REPOSITORIES: biostudies-literature

altmetric image

Publications

Family-Based Haplotype Estimation and Allele Dosage Correction for Polyploids Using Short Sequence Reads.

Motazedi Ehsan E   Maliepaard Chris C   Finkers Richard R   Visser Richard R   de Ridder Dick D  

Frontiers in genetics 20190416


DNA sequence reads contain information about the genomic variants located on a single chromosome. By extracting and extending this information using the overlaps between the reads, the haplotypes of an individual can be obtained. Using parent-offspring relationships in a population can considerably improve the quality of the haplotypes obtained from short reads, as pedigree information can be used to correct for spurious overlaps (due to sequencing errors) and insufficient overlaps (due to short  ...[more]

Similar Datasets

| S-EPMC3791270 | biostudies-literature
| S-EPMC5932196 | biostudies-other
| S-EPMC7080815 | biostudies-literature
| S-EPMC4720449 | biostudies-literature
| S-EPMC3495688 | biostudies-literature
| S-EPMC3092772 | biostudies-literature
| S-EPMC4674864 | biostudies-literature
| S-EPMC3188795 | biostudies-literature
| S-EPMC3460743 | biostudies-literature
| S-EPMC9750119 | biostudies-literature