Unknown

Dataset Information

0

Identifying Human Genome-Wide CNV, LOH and UPD by Targeted Sequencing of Selected Regions.


ABSTRACT: Copy-number variations (CNV), loss of heterozygosity (LOH), and uniparental disomy (UPD) are large genomic aberrations leading to many common inherited diseases, cancers, and other complex diseases. An integrated tool to identify these aberrations is essential in understanding diseases and in designing clinical interventions. Previous discovery methods based on whole-genome sequencing (WGS) require very high depth of coverage on the whole genome scale, and are cost-wise inefficient. Another approach, whole exome genome sequencing (WEGS), is limited to discovering variations within exons. Thus, we are lacking efficient methods to detect genomic aberrations on the whole genome scale using next-generation sequencing technology. Here we present a method to identify genome-wide CNV, LOH and UPD for the human genome via selectively sequencing a small portion of genome termed Selected Target Regions (SeTRs). In our experiments, the SeTRs are covered by 99.73%~99.95% with sufficient depth. Our developed bioinformatics pipeline calls genome-wide CNVs with high confidence, revealing 8 credible events of LOH and 3 UPD events larger than 5M from 15 individual samples. We demonstrate that genome-wide CNV, LOH and UPD can be detected using a cost-effective SeTRs sequencing approach, and that LOH and UPD can be identified using just a sample grouping technique, without using a matched sample or familial information.

SUBMITTER: Wang Y 

PROVIDER: S-EPMC4412667 | biostudies-literature | 2014

REPOSITORIES: biostudies-literature

altmetric image

Publications


Copy-number variations (CNV), loss of heterozygosity (LOH), and uniparental disomy (UPD) are large genomic aberrations leading to many common inherited diseases, cancers, and other complex diseases. An integrated tool to identify these aberrations is essential in understanding diseases and in designing clinical interventions. Previous discovery methods based on whole-genome sequencing (WGS) require very high depth of coverage on the whole genome scale, and are cost-wise inefficient. Another appr  ...[more]

Similar Datasets

| S-EPMC6288940 | biostudies-literature
| S-EPMC10558996 | biostudies-literature
| S-EPMC5577042 | biostudies-literature
| S-EPMC3348062 | biostudies-literature
| S-EPMC3201882 | biostudies-literature
| S-EPMC3173460 | biostudies-literature
| S-EPMC6791317 | biostudies-literature
| S-EPMC9255983 | biostudies-literature
| S-EPMC4059462 | biostudies-literature
| S-EPMC3677876 | biostudies-literature