Unknown

Dataset Information

0

Combination of short-read, long-read, and optical mapping assemblies reveals large-scale tandem repeat arrays with population genetic implications.


ABSTRACT: Accurate and contiguous genome assembly is key to a comprehensive understanding of the processes shaping genomic diversity and evolution. Yet, it is frequently constrained by constitutive heterochromatin, usually characterized by highly repetitive DNA. As a key feature of genome architecture associated with centromeric and subtelomeric regions, it locally influences meiotic recombination. In this study, we assess the impact of large tandem repeat arrays on the recombination rate landscape in an avian speciation model, the Eurasian crow. We assembled two high-quality genome references using single-molecule real-time sequencing (long-read assembly [LR]) and single-molecule optical maps (optical map assembly [OM]). A three-way comparison including the published short-read assembly (SR) constructed for the same individual allowed assessing assembly properties and pinpointing misassemblies. By combining information from all three assemblies, we characterized 36 previously unidentified large repetitive regions in the proximity of sequence assembly breakpoints, the majority of which contained complex arrays of a 14-kb satellite repeat or its 1.2-kb subunit. Using whole-genome population resequencing data, we estimated the population-scaled recombination rate (?) and found it to be significantly reduced in these regions. These findings are consistent with an effect of low recombination in regions adjacent to centromeric or subtelomeric heterochromatin and add to our understanding of the processes generating widespread heterogeneity in genetic diversity and differentiation along the genome. By combining three different technologies, our results highlight the importance of adding a layer of information on genome structure that is inaccessible to each approach independently.

SUBMITTER: Weissensteiner MH 

PROVIDER: S-EPMC5411765 | biostudies-literature | 2017 May

REPOSITORIES: biostudies-literature

altmetric image

Publications

Combination of short-read, long-read, and optical mapping assemblies reveals large-scale tandem repeat arrays with population genetic implications.

Weissensteiner Matthias H MH   Pang Andy W C AWC   Bunikis Ignas I   Höijer Ida I   Vinnere-Petterson Olga O   Suh Alexander A   Wolf Jochen B W JBW  

Genome research 20170330 5


Accurate and contiguous genome assembly is key to a comprehensive understanding of the processes shaping genomic diversity and evolution. Yet, it is frequently constrained by constitutive heterochromatin, usually characterized by highly repetitive DNA. As a key feature of genome architecture associated with centromeric and subtelomeric regions, it locally influences meiotic recombination. In this study, we assess the impact of large tandem repeat arrays on the recombination rate landscape in an  ...[more]

Similar Datasets

| S-EPMC9174224 | biostudies-literature
| S-EPMC8812927 | biostudies-literature
| S-EPMC5425171 | biostudies-other
| S-EPMC6218588 | biostudies-literature
2018-06-08 | GSE115454 | GEO
| S-EPMC8145836 | biostudies-literature
| S-EPMC9215042 | biostudies-literature
| S-EPMC3919575 | biostudies-literature
| S-EPMC3044310 | biostudies-literature
| S-EPMC7539535 | biostudies-literature