Unknown

Dataset Information

0

Genomic abundance is not predictive of tandem repeat localization in grass genomes.


ABSTRACT: Highly repetitive regions have historically posed a challenge when investigating sequence variation and content. High-throughput sequencing has enabled researchers to use whole-genome shotgun sequencing to estimate the abundance of repetitive sequence, and these methodologies have been recently applied to centromeres. Previous research has investigated variation in centromere repeats across eukaryotes, positing that the highest abundance tandem repeat in a genome is often the centromeric repeat. To test this assumption, we used shotgun sequencing and a bioinformatic pipeline to identify common tandem repeats across a number of grass species. We find that de novo assembly and subsequent abundance ranking of repeats can successfully identify tandem repeats with homology to known tandem repeats. Fluorescent in-situ hybridization shows that de novo assembly and ranking of repeats from non-model taxa identifies chromosome domains rich in tandem repeats both near pericentromeres and elsewhere in the genome.

SUBMITTER: Bilinski P 

PROVIDER: S-EPMC5453492 | biostudies-literature | 2017

REPOSITORIES: biostudies-literature

altmetric image

Publications

Genomic abundance is not predictive of tandem repeat localization in grass genomes.

Bilinski Paul P   Han Yonghua Y   Hufford Matthew B MB   Lorant Anne A   Zhang Pingdong P   Estep Matt C MC   Jiang Jiming J   Ross-Ibarra Jeffrey J  

PloS one 20170601 6


Highly repetitive regions have historically posed a challenge when investigating sequence variation and content. High-throughput sequencing has enabled researchers to use whole-genome shotgun sequencing to estimate the abundance of repetitive sequence, and these methodologies have been recently applied to centromeres. Previous research has investigated variation in centromere repeats across eukaryotes, positing that the highest abundance tandem repeat in a genome is often the centromeric repeat.  ...[more]

Similar Datasets

| S-EPMC3371701 | biostudies-literature
| S-EPMC3488214 | biostudies-literature
| S-EPMC7918261 | biostudies-literature
| S-EPMC5673627 | biostudies-literature
| S-EPMC2248175 | biostudies-literature
| S-EPMC5438981 | biostudies-literature
| PRJEB21659 | ENA
| S-EPMC187525 | biostudies-literature
| S-EPMC3543359 | biostudies-literature
| S-EPMC10199239 | biostudies-literature