Unknown

Dataset Information

0

Systematic prediction and validation of breakpoints associated with copy-number variants in the human genome.


ABSTRACT: Copy-number variants (CNVs) are an abundant form of genetic variation in humans. However, approaches for determining exact CNV breakpoint sequences (physical deletion or duplication boundaries) across individuals, crucial for associating genotype to phenotype, have been lacking so far, and the vast majority of CNVs have been reported with approximate genomic coordinates only. Here, we report an approach, called BreakPtr, for fine-mapping CNVs (available from http://breakptr.gersteinlab.org). We statistically integrate both sequence characteristics and data from high-resolution comparative genome hybridization experiments in a discrete-valued, bivariate hidden Markov model. Incorporation of nucleotide-sequence information allows us to take into account the fact that recently duplicated sequences (e.g., segmental duplications) often coincide with breakpoints. In anticipation of an upcoming increase in CNV data, we developed an iterative, "active" approach to initially scoring with a preliminary model, performing targeted validations, retraining the model, and then rescoring, and a flexible parameterization system that intuitively collapses from a full model of 2,503 parameters to a core one of only 10. Using our approach, we accurately mapped >400 breakpoints on chromosome 22 and a region of chromosome 11, refining the boundaries of many previously approximately mapped CNVs. Four predicted breakpoints flanked known disease-associated deletions. We validated an additional four predicted CNV breakpoints by sequencing. Overall, our results suggest a predictive resolution of approximately 300 bp. This level of resolution enables more precise correlations between CNVs and across individuals than previously possible, allowing the study of CNV population frequencies. Further, it enabled us to demonstrate a clear Mendelian pattern of inheritance for one of the CNVs.

SUBMITTER: Korbel JO 

PROVIDER: S-EPMC1891248 | biostudies-literature | 2007 Jun

REPOSITORIES: biostudies-literature

altmetric image

Publications

Systematic prediction and validation of breakpoints associated with copy-number variants in the human genome.

Korbel Jan O JO   Urban Alexander Eckehart AE   Grubert Fabian F   Du Jiang J   Royce Thomas E TE   Starr Peter P   Zhong Guoneng G   Emanuel Beverly S BS   Weissman Sherman M SM   Snyder Michael M   Gerstein Mark B MB  

Proceedings of the National Academy of Sciences of the United States of America 20070605 24


Copy-number variants (CNVs) are an abundant form of genetic variation in humans. However, approaches for determining exact CNV breakpoint sequences (physical deletion or duplication boundaries) across individuals, crucial for associating genotype to phenotype, have been lacking so far, and the vast majority of CNVs have been reported with approximate genomic coordinates only. Here, we report an approach, called BreakPtr, for fine-mapping CNVs (available from http://breakptr.gersteinlab.org). We  ...[more]

Similar Datasets

| S-EPMC5500957 | biostudies-literature
| S-EPMC3112242 | biostudies-literature
| S-EPMC2876824 | biostudies-other
| S-EPMC4118997 | biostudies-literature
| S-EPMC5720705 | biostudies-other
| S-EPMC4160470 | biostudies-literature
| S-EPMC6101599 | biostudies-other
| S-EPMC3409366 | biostudies-literature
2020-10-05 | GSE114131 | GEO
| S-EPMC11266557 | biostudies-literature