Unknown

Dataset Information

0

Inferring combined CNV/SNP haplotypes from genotype data.


ABSTRACT:

Motivation

Copy number variations (CNVs) are increasingly recognized as an substantial source of individual genetic variation, and hence there is a growing interest in investigating the evolutionary history of CNVs as well as their impact on complex disease susceptibility. CNV/SNP haplotypes are critical for this research, but although many methods have been proposed for inferring integer copy number, few have been designed for inferring CNV haplotypic phase and none of these are applicable at genome-wide scale. Here, we present a method for inferring missing CNV genotypes, predicting CNV allelic configuration and for inferring CNV haplotypic phase from SNP/CNV genotype data. Our method, implemented in the software polyHap v2.0, is based on a hidden Markov model, which models the joint haplotype structure between CNVs and SNPs. Thus, haplotypic phase of CNVs and SNPs are inferred simultaneously. A sampling algorithm is employed to obtain a measure of confidence/credibility of each estimate.

Results

We generated diploid phase-known CNV-SNP genotype datasets by pairing male X chromosome CNV-SNP haplotypes. We show that polyHap provides accurate estimates of missing CNV genotypes, allelic configuration and CNV haplotypic phase on these datasets. We applied our method to a non-simulated dataset-a region on Chromosome 2 encompassing a short deletion. The results confirm that polyHap's accuracy extends to real-life datasets.

Availability

Our method is implemented in version 2.0 of the polyHap software package and can be downloaded from http://www.imperial.ac.uk/medicine/people/l.coin.

SUBMITTER: Su SY 

PROVIDER: S-EPMC2913665 | biostudies-literature | 2010 Jun

REPOSITORIES: biostudies-literature

altmetric image

Publications

Inferring combined CNV/SNP haplotypes from genotype data.

Su Shu-Yi SY   Asher Julian E JE   Jarvelin Marjo-Riita MR   Froguel Phillipe P   Blakemore Alexandra I F AI   Balding David J DJ   Coin Lachlan J M LJ  

Bioinformatics (Oxford, England) 20100420 11


<h4>Motivation</h4>Copy number variations (CNVs) are increasingly recognized as an substantial source of individual genetic variation, and hence there is a growing interest in investigating the evolutionary history of CNVs as well as their impact on complex disease susceptibility. CNV/SNP haplotypes are critical for this research, but although many methods have been proposed for inferring integer copy number, few have been designed for inferring CNV haplotypic phase and none of these are applica  ...[more]

Similar Datasets

| S-EPMC4491549 | biostudies-literature
| S-EPMC5809101 | biostudies-literature
| S-EPMC3492655 | biostudies-literature
| S-EPMC8317106 | biostudies-literature
| S-EPMC3276117 | biostudies-literature
2012-09-07 | GSE40698 | GEO
| S-EPMC4162913 | biostudies-literature
| S-EPMC3852919 | biostudies-literature
2012-09-07 | E-GEOD-40698 | biostudies-arrayexpress
| S-EPMC3491382 | biostudies-literature