Unknown

Dataset Information

0

The fine-scale architecture of structural variants in 17 mouse genomes.


ABSTRACT: BACKGROUND:Accurate catalogs of structural variants (SVs) in mammalian genomes are necessary to elucidate the potential mechanisms that drive SV formation and to assess their functional impact. Next generation sequencing methods for SV detection are an advance on array-based methods, but are almost exclusively limited to four basic types: deletions, insertions, inversions and copy number gains. RESULTS:By visual inspection of 100 Mbp of genome to which next generation sequence data from 17 inbred mouse strains had been aligned, we identify and interpret 21 paired-end mapping patterns, which we validate by PCR. These paired-end mapping patterns reveal a greater diversity and complexity in SVs than previously recognized. In addition, Sanger-based sequence analysis of 4,176 breakpoints at 261 SV sites reveal additional complexity at approximately a quarter of structural variants analyzed. We find micro-deletions and micro-insertions at SV breakpoints, ranging from 1 to 107 bp, and SNPs that extend breakpoint micro-homology and may catalyze SV formation. CONCLUSIONS:An integrative approach using experimental analyses to train computational SV calling is essential for the accurate resolution of the architecture of SVs. We find considerable complexity in SV formation; about a quarter of SVs in the mouse are composed of a complex mixture of deletion, insertion, inversion and copy number gain. Computational methods can be adapted to identify most paired-end mapping patterns.

SUBMITTER: Yalcin B 

PROVIDER: S-EPMC3439969 | biostudies-literature | 2012

REPOSITORIES: biostudies-literature

altmetric image

Publications

The fine-scale architecture of structural variants in 17 mouse genomes.

Yalcin Binnaz B   Wong Kim K   Bhomra Amarjit A   Goodson Martin M   Keane Thomas M TM   Adams David J DJ   Flint Jonathan J  

Genome biology 20120101 3


<h4>Background</h4>Accurate catalogs of structural variants (SVs) in mammalian genomes are necessary to elucidate the potential mechanisms that drive SV formation and to assess their functional impact. Next generation sequencing methods for SV detection are an advance on array-based methods, but are almost exclusively limited to four basic types: deletions, insertions, inversions and copy number gains.<h4>Results</h4>By visual inspection of 100 Mbp of genome to which next generation sequence dat  ...[more]

Similar Datasets

| S-EPMC1779558 | biostudies-literature
2008-05-01 | E-GEOD-10008 | biostudies-arrayexpress
2008-05-01 | E-GEOD-10037 | biostudies-arrayexpress
2008-05-01 | GSE10008 | GEO
2008-05-01 | GSE10037 | GEO
| S-EPMC6664780 | biostudies-literature
| S-EPMC6720376 | biostudies-literature
2018-02-12 | GSE92291 | GEO
2008-01-23 | E-GEOD-9831 | biostudies-arrayexpress
| S-EPMC6499320 | biostudies-literature