Unknown

Dataset Information

0

Personalized copy number and segmental duplication maps using next-generation sequencing.


ABSTRACT: Despite their importance in gene innovation and phenotypic variation, duplicated regions have remained largely intractable owing to difficulties in accurately resolving their structure, copy number and sequence content. We present an algorithm (mrFAST) to comprehensively map next-generation sequence reads, which allows for the prediction of absolute copy-number variation of duplicated segments and genes. We examine three human genomes and experimentally validate genome-wide copy number differences. We estimate that, on average, 73-87 genes vary in copy number between any two individuals and find that these genic differences overwhelmingly correspond to segmental duplications (odds ratio = 135; P < 2.2 x 10(-16)). Our method can distinguish between different copies of highly identical genes, providing a more accurate assessment of gene content and insight into functional constraint without the limitations of array-based technology.

SUBMITTER: Alkan C 

PROVIDER: S-EPMC2875196 | biostudies-literature | 2009 Oct

REPOSITORIES: biostudies-literature

altmetric image

Publications


Despite their importance in gene innovation and phenotypic variation, duplicated regions have remained largely intractable owing to difficulties in accurately resolving their structure, copy number and sequence content. We present an algorithm (mrFAST) to comprehensively map next-generation sequence reads, which allows for the prediction of absolute copy-number variation of duplicated segments and genes. We examine three human genomes and experimentally validate genome-wide copy number differenc  ...[more]

Similar Datasets

| S-EPMC4021345 | biostudies-literature
| S-EPMC3317159 | biostudies-literature
| S-EPMC2574762 | biostudies-literature
| S-EPMC2867831 | biostudies-literature
| S-EPMC7954749 | biostudies-literature
| S-EPMC4061055 | biostudies-literature
| S-EPMC5655909 | biostudies-other
2008-05-10 | GSE11369 | GEO
| S-EPMC4344483 | biostudies-literature
| S-EPMC5427176 | biostudies-literature