Unknown

Dataset Information

0

Reducing system noise in copy number data using principal components of self-self hybridizations.


ABSTRACT: Genomic copy number variation underlies genetic disorders such as autism, schizophrenia, and congenital heart disease. Copy number variations are commonly detected by array based comparative genomic hybridization of sample to reference DNAs, but probe and operational variables combine to create correlated system noise that degrades detection of genetic events. To correct for this we have explored hybridizations in which no genetic signal is expected, namely "self-self" hybridizations (SSH) comparing DNAs from the same genome. We show that SSH trap a variety of correlated system noise present also in sample-reference (test) data. Through singular value decomposition of SSH, we are able to determine the principal components (PCs) of this noise. The PCs themselves offer deep insights into the sources of noise, and facilitate detection of artifacts. We present evidence that linear and piecewise linear correction of test data with the PCs does not introduce detectable spurious signal, yet improves signal-to-noise metrics, reduces false positives, and facilitates copy number determination.

SUBMITTER: Lee YH 

PROVIDER: S-EPMC3271883 | biostudies-literature | 2012 Jan

REPOSITORIES: biostudies-literature

altmetric image

Publications

Reducing system noise in copy number data using principal components of self-self hybridizations.

Lee Yoon-ha YH   Ronemus Michael M   Kendall Jude J   Lakshmi B B   Leotta Anthony A   Levy Dan D   Esposito Diane D   Grubor Vladimir V   Ye Kenny K   Wigler Michael M   Yamrom Boris B  

Proceedings of the National Academy of Sciences of the United States of America 20111229 3


Genomic copy number variation underlies genetic disorders such as autism, schizophrenia, and congenital heart disease. Copy number variations are commonly detected by array based comparative genomic hybridization of sample to reference DNAs, but probe and operational variables combine to create correlated system noise that degrades detection of genetic events. To correct for this we have explored hybridizations in which no genetic signal is expected, namely "self-self" hybridizations (SSH) compa  ...[more]

Similar Datasets

| S-EPMC3872138 | biostudies-literature
2011-06-09 | GSE23682 | GEO
| S-EPMC1994778 | biostudies-literature
| S-EPMC5494116 | biostudies-literature
| S-EPMC4408558 | biostudies-literature
| S-EPMC1501050 | biostudies-literature
| S-EPMC2992445 | biostudies-literature
| S-EPMC3636030 | biostudies-literature
| S-EPMC4583840 | biostudies-literature
| S-EPMC3962763 | biostudies-literature