Unknown

Dataset Information

0

A human genome structural variation sequencing resource reveals insights into mutational mechanisms.


ABSTRACT: Understanding the prevailing mutational mechanisms responsible for human genome structural variation requires uniformity in the discovery of allelic variants and precision in terms of breakpoint delineation. We develop a resource based on capillary end sequencing of 13.8 million fosmid clones from 17 human genomes and characterize the complete sequence of 1054 large structural variants corresponding to 589 deletions, 384 insertions, and 81 inversions. We analyze the 2081 breakpoint junctions and infer potential mechanism of origin. Three mechanisms account for the bulk of germline structural variation: microhomology-mediated processes involving short (2-20 bp) stretches of sequence (28%), nonallelic homologous recombination (22%), and L1 retrotransposition (19%). The high quality and long-range continuity of the sequence reveals more complex mutational mechanisms, including repeat-mediated inversions and gene conversion, that are most often missed by other methods, such as comparative genomic hybridization, single nucleotide polymorphism microarrays, and next-generation sequencing.

SUBMITTER: Kidd JM 

PROVIDER: S-EPMC3026629 | biostudies-literature | 2010 Nov

REPOSITORIES: biostudies-literature

altmetric image

Publications

A human genome structural variation sequencing resource reveals insights into mutational mechanisms.

Kidd Jeffrey M JM   Graves Tina T   Newman Tera L TL   Fulton Robert R   Hayden Hillary S HS   Malig Maika M   Kallicki Joelle J   Kaul Rajinder R   Wilson Richard K RK   Eichler Evan E EE  

Cell 20101101 5


Understanding the prevailing mutational mechanisms responsible for human genome structural variation requires uniformity in the discovery of allelic variants and precision in terms of breakpoint delineation. We develop a resource based on capillary end sequencing of 13.8 million fosmid clones from 17 human genomes and characterize the complete sequence of 1054 large structural variants corresponding to 589 deletions, 384 insertions, and 81 inversions. We analyze the 2081 breakpoint junctions and  ...[more]

Similar Datasets

| S-EPMC3695511 | biostudies-literature
| S-EPMC2674581 | biostudies-literature
| S-EPMC3179416 | biostudies-literature
| S-EPMC4227286 | biostudies-literature
| S-EPMC7118897 | biostudies-literature
| S-EPMC3785719 | biostudies-literature
| S-EPMC3202281 | biostudies-literature
| S-EPMC8082928 | biostudies-literature
| S-EPMC3089435 | biostudies-literature
| S-EPMC3615480 | biostudies-literature