Unknown

Dataset Information

0

Identifying disease-associated copy number variations by a doubly penalized regression model.


ABSTRACT: Copy number variation (CNV) of DNA plays an important role in the development of many diseases. However, due to the irregularity and sparsity of the CNVs, studying the association between CNVs and a disease outcome or a trait can be challenging. Up to now, not many methods have been proposed in the literature for this problem. Most of the current researchers reply on an ad hoc two-stage procedure by first identifying CNVs in each individual genome and then performing an association test using these identified CNVs. This potentially leads to information loss and as a result a lower power to identify disease associated CNVs. In this article, we describe a new method that combines the two steps into a single coherent model to identify the common CNV across patients that are associated with certain diseases. We use a double penalty model to capture CNVs' association with both the intensities and the disease trait. We validate its performance in simulated datasets and a data example on platinum resistance and CNV in ovarian cancer genome.

SUBMITTER: Cheng Y 

PROVIDER: S-EPMC6663092 | biostudies-literature | 2018 Dec

REPOSITORIES: biostudies-literature

altmetric image

Publications

Identifying disease-associated copy number variations by a doubly penalized regression model.

Cheng Yichen Y   Dai James Y JY   Wang Xiaoyu X   Kooperberg Charles C  

Biometrics 20180612 4


Copy number variation (CNV) of DNA plays an important role in the development of many diseases. However, due to the irregularity and sparsity of the CNVs, studying the association between CNVs and a disease outcome or a trait can be challenging. Up to now, not many methods have been proposed in the literature for this problem. Most of the current researchers reply on an ad hoc two-stage procedure by first identifying CNVs in each individual genome and then performing an association test using th  ...[more]

Similar Datasets

| S-EPMC4676147 | biostudies-literature
| S-EPMC8278790 | biostudies-literature
| S-EPMC2585631 | biostudies-other
| S-EPMC5771864 | biostudies-literature
| S-EPMC6559655 | biostudies-literature
| S-EPMC5110597 | biostudies-literature
| S-EPMC2731494 | biostudies-literature
| S-EPMC3997817 | biostudies-literature
| S-EPMC3330746 | biostudies-literature
| S-EPMC5287955 | biostudies-literature