Unknown

Dataset Information

0

HaplotypeCN: copy number haplotype inference with Hidden Markov Model and localized haplotype clustering.


ABSTRACT: Copy number variation (CNV) has been reported to be associated with disease and various cancers. Hence, identifying the accurate position and the type of CNV is currently a critical issue. There are many tools targeting on detecting CNV regions, constructing haplotype phases on CNV regions, or estimating the numerical copy numbers. However, none of them can do all of the three tasks at the same time. This paper presents a method based on Hidden Markov Model to detect parent specific copy number change on both chromosomes with signals from SNP arrays. A haplotype tree is constructed with dynamic branch merging to model the transition of the copy number status of the two alleles assessed at each SNP locus. The emission models are constructed for the genotypes formed with the two haplotypes. The proposed method can provide the segmentation points of the CNV regions as well as the haplotype phasing for the allelic status on each chromosome. The estimated copy numbers are provided as fractional numbers, which can accommodate the somatic mutation in cancer specimens that usually consist of heterogeneous cell populations. The algorithm is evaluated on simulated data and the previously published regions of CNV of the 270 HapMap individuals. The results were compared with five popular methods: PennCNV, genoCN, COKGEN, QuantiSNP and cnvHap. The application on oral cancer samples demonstrates how the proposed method can facilitate clinical association studies. The proposed algorithm exhibits comparable sensitivity of the CNV regions to the best algorithm in our genome-wide study and demonstrates the highest detection rate in SNP dense regions. In addition, we provide better haplotype phasing accuracy than similar approaches. The clinical association carried out with our fractional estimate of copy numbers in the cancer samples provides better detection power than that with integer copy number states.

SUBMITTER: Lin YJ 

PROVIDER: S-EPMC4029584 | biostudies-literature | 2014

REPOSITORIES: biostudies-literature

altmetric image

Publications

HaplotypeCN: copy number haplotype inference with Hidden Markov Model and localized haplotype clustering.

Lin Yen-Jen YJ   Chen Yu-Tin YT   Hsu Shu-Ni SN   Peng Chien-Hua CH   Tang Chuan-Yi CY   Yen Tzu-Chen TC   Hsieh Wen-Ping WP  

PloS one 20140521 5


Copy number variation (CNV) has been reported to be associated with disease and various cancers. Hence, identifying the accurate position and the type of CNV is currently a critical issue. There are many tools targeting on detecting CNV regions, constructing haplotype phases on CNV regions, or estimating the numerical copy numbers. However, none of them can do all of the three tasks at the same time. This paper presents a method based on Hidden Markov Model to detect parent specific copy number  ...[more]

Similar Datasets

| S-EPMC4866742 | biostudies-literature
| S-EPMC6236906 | biostudies-literature
| S-EPMC3371636 | biostudies-literature
| S-EPMC8550639 | biostudies-literature
| S-EPMC4277924 | biostudies-literature
| S-EPMC1874617 | biostudies-literature
| S-EPMC10370020 | biostudies-literature
| S-EPMC7455056 | biostudies-literature
| S-EPMC2265661 | biostudies-other
| S-EPMC4867884 | biostudies-other