Project description:Centromeres are functionally conserved chromosomal loci essential for proper chromosome segregation during cell division, yet they show high sequence diversity across species. A near universal feature of centromeres is the presence of repetitive sequences, such as satellites and transposable elements (TEs). Because of their rapidly evolving karyotypes, gibbons represent a compelling model to investigate divergence of functional centromere sequences across short evolutionary timescales. Previously, we identified a novel composite retrotransposon, LAVA, that is exclusive to gibbons and expanded within the centromere regions of one gibbon genus, Hoolock. In this study, we use ChIP-seq, RNA-seq and fluorescence in situ hybridization to comprehensively investigate the repeat content of centromeres of the four extant gibbon genera (Hoolock, Hylobates, Nomascus and Siamang). We find that CENP-A nucleosomes and the DNA-protein interface with the inner kinetochore are enriched in retroelements in all gibbon genera, rather than satellite DNA. We find that LAVA in Hoolock is enriched in the centromeres of most chromosomes and shows centromere- and species-specific sequence and structural differences compared to other genera, potentially as a result of its co-option to a centromeric function. In contrast, we found that a centromeric retroelement-derived macrosatellite, SST1, corresponds with chromosome breakpoint reuse across gibbons and shows high sequence conservation across genera. Finally, using de novo assembly of centromere-specific sequences, we determine that transcripts originating from gibbon centromeres recapitulate species-specific TE diversity. Combined, our data reveals dynamic, species-specific shifts in repeat content that define gibbon centromeres and coincide with the extensive karyotypic diversity observed within this lineage.
Project description:We used small RNA-Seq to characterize heart non-coding RNAs in 85 inbred strains of mice from the Hybrid Mouse Diversity Panel (HMDP), a reference resource population for cardiovascular and metabolic traits.