Unknown

Dataset Information

0

Detection of common copy number variation with application to population clustering from next generation sequencing data.


ABSTRACT: Copy number variation (CNV) is a structural variation in human genome that has been associated with many complex diseases. In this paper we present a method to detect common copy number variation from next generation sequencing data. First, copy number variations are detected from each individual sample, which is formulated as a total variation penalized least square problem. Second, the common copy number discovery from multiple samples is obtained using source separation techniques such as the non-negative matrix factorization (NMF). Finally, the method is applied to population clustering. The results on real data analysis show that two family trio with different ancestries can be clustered into two ethnic groups based on their common CNVs, demonstrating the potential of the proposed method for application to population genetics.

SUBMITTER: Duan J 

PROVIDER: S-EPMC4154475 | biostudies-literature | 2012

REPOSITORIES: biostudies-literature

altmetric image

Publications

Detection of common copy number variation with application to population clustering from next generation sequencing data.

Duan Junbo J   Zhang Ji-Gang JG   Deng Hong-Wen HW   Wang Yu-Ping YP  

Annual International Conference of the IEEE Engineering in Medicine and Biology Society. IEEE Engineering in Medicine and Biology Society. Annual International Conference 20120101


Copy number variation (CNV) is a structural variation in human genome that has been associated with many complex diseases. In this paper we present a method to detect common copy number variation from next generation sequencing data. First, copy number variations are detected from each individual sample, which is formulated as a total variation penalized least square problem. Second, the common copy number discovery from multiple samples is obtained using source separation techniques such as the  ...[more]

Similar Datasets

| S-EPMC4021345 | biostudies-literature
| S-EPMC3604020 | biostudies-literature
| S-EPMC3317159 | biostudies-literature
| S-EPMC5427176 | biostudies-literature
| S-EPMC4504183 | biostudies-literature
| S-EPMC5655909 | biostudies-other
| S-EPMC8406611 | biostudies-literature
| S-EPMC4111851 | biostudies-literature
| S-EPMC5418319 | biostudies-literature
| S-EPMC6829143 | biostudies-literature