Dataset Information

Genome-wide algorithm for detecting CNV associations with diseases.

ABSTRACT:

Background

SNP genotyping arrays have been developed to characterize single-nucleotide polymorphisms (SNPs) and DNA copy number variations (CNVs). Nonparametric and model-based statistical algorithms have been developed to detect CNVs from SNP data using the marker intensities. However, these algorithms lack specificity to detect small CNVs owing to the high false positive rate when calling CNVs based on the intensity values. Therefore, the resulting association tests lack power even if the CNVs affecting disease risk are common. An alternative procedure called PennCNV uses information from both the marker intensities as well as the genotypes and therefore has increased sensitivity.

Results

By using the hidden Markov model (HMM) implemented in PennCNV to derive the probabilities of different copy number states which we subsequently used in a logistic regression model, we developed a new genome-wide algorithm to detect CNV associations with diseases. We compared this new method with association test applied to the most probable copy number state for each individual that is provided by PennCNV after it performs an initial HMM analysis followed by application of the Viterbi algorithm, which removes information about copy number probabilities. In one of our simulation studies, we showed that for large CNVs (number of SNPs ? 10), the association tests based on PennCNV calls gave more significant results, but the new algorithm retained high power. For small CNVs (number of SNPs <10), the logistic algorithm provided smaller average p-values (e.g., p = 7.54e - 17 when relative risk RR = 3.0) in all the scenarios and could capture signals that PennCNV did not (e.g., p = 0.020 when RR = 3.0). From a second set of simulations, we showed that the new algorithm is more powerful in detecting disease associations with small CNVs (number of SNPs ranging from 3 to 5) under different penetrance models (e.g., when RR = 3.0, for relatively weak signals, power = 0.8030 comparing to 0.2879 obtained from the association tests based on PennCNV calls). The new method was implemented in software GWCNV. It is freely available at http://gwcnv.sourceforge.net, distributed under a GPL license.

Conclusions

We conclude that the new algorithm is more sensitive and can be more powerful in detecting CNV associations with diseases than the existing HMM algorithm, especially when the CNV association signal is weak and a limited number of SNPs are located in the CNV.

SUBMITTER: Xu Y

PROVIDER: S-EPMC3173460 | biostudies-literature | 2011 Aug

REPOSITORIES: biostudies-literature

ACCESS DATA

Publications

Genome-wide algorithm for detecting CNV associations with diseases.

Xu Yaji Y Peng Bo B Fu Yunxin Y Amos Christopher I CI

BMC bioinformatics 20110809

<h4>Background</h4>SNP genotyping arrays have been developed to characterize single-nucleotide polymorphisms (SNPs) and DNA copy number variations (CNVs). Nonparametric and model-based statistical algorithms have been developed to detect CNVs from SNP data using the marker intensities. However, these algorithms lack specificity to detect small CNVs owing to the high false positive rate when calling CNVs based on the intensity values. Therefore, the resulting association tests lack power even if ...[more]

PMID: 21827692

Dataset Information

Genome-wide algorithm for detecting CNV associations with diseases.

Background

Results

Conclusions

Publications

Genome-wide algorithm for detecting CNV associations with diseases.

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets

Similar Datasets

A FAST ALGORITHM FOR DETECTING GENE-GENE INTERACTIONS IN GENOME-WIDE ASSOCIATION STUDIES.
| S-EPMC4595934 | biostudies-literature

MetaPhat: Detecting and Decomposing Multivariate Associations From Univariate Genome-Wide Association Statistics.
| S-EPMC7242752 | biostudies-literature

GStream: improving SNP and CNV coverage on genome-wide association studies.
| S-EPMC3700900 | biostudies-literature

Gender differences in CNV burden do not confound schizophrenia CNV associations.
| S-EPMC4869015 | biostudies-literature

Genome-wide copy number variation (CNV) in patients with autoimmune Addison's disease.
| S-EPMC3166911 | biostudies-literature

X-CNV: genome-wide prediction of the pathogenicity of copy number variations.
| S-EPMC8375180 | biostudies-literature

GWAS3D: Detecting human regulatory variants by integrative analysis of genome-wide associations, chromosome interactions and histone modifications.
| S-EPMC3692118 | biostudies-literature

Continuing difficulties in interpreting CNV data: lessons from a genome-wide CNV association study of Australian HNPCC/lynch syndrome patients.
| S-EPMC3626775 | biostudies-literature

Detecting associations of rare variants with common diseases: collapsing or haplotyping?
| S-EPMC4570202 | biostudies-literature

JAX-CNV: A Whole-genome Sequencing-based Algorithm for Copy Number Detection at Clinical Grade Level.
| S-EPMC10225484 | biostudies-literature