Project description:The utility of RADseq in an experimental setting is also demonstrated, based on our chasacterisation of an APOBEC mutation signature in an APOBEC3A transfected mouse cell line. 0D5 cells, derived from SSM3 cells, were co-transfected with a mixture containing pcDNA3.1 vectors expressing either APOBEC3A or APOBEC3B (kindly donated by Vincent Caval), pcDNA3.1 construct expressing deaminase null APOBEC3A linked to a uracil deglycosylase construct and a plasmid encoding mutant GFP and WT mCherry that is a reporter for APOBEC mutagenesis. Cells were grown, and gDNA extracted, prior to preparation of RADseq libraries using a PstI- MspI double-digest. Libraries underwent a Pippin Prep to select fragments in the size range of 220-520 bp (genomic sequence plus 148 bp of adapters). Single-end sequencing (1x101bp) was performed on an Illumina NovaSeq6000 utilizing v1.5 chemistry. Reads were aligned to mm10 using bwa mem and variants called using the GATK4 pipeline.
Project description:Centromeres are functionally conserved chromosomal loci essential for proper chromosome segregation during cell division, yet they show high sequence diversity across species. A near universal feature of centromeres is the presence of repetitive sequences, such as satellites and transposable elements (TEs). Because of their rapidly evolving karyotypes, gibbons represent a compelling model to investigate divergence of functional centromere sequences across short evolutionary timescales. Previously, we identified a novel composite retrotransposon, LAVA, that is exclusive to gibbons and expanded within the centromere regions of one gibbon genus, Hoolock. In this study, we use ChIP-seq, RNA-seq and fluorescence in situ hybridization to comprehensively investigate the repeat content of centromeres of the four extant gibbon genera (Hoolock, Hylobates, Nomascus and Siamang). We find that CENP-A nucleosomes and the DNA-protein interface with the inner kinetochore are enriched in retroelements in all gibbon genera, rather than satellite DNA. We find that LAVA in Hoolock is enriched in the centromeres of most chromosomes and shows centromere- and species-specific sequence and structural differences compared to other genera, potentially as a result of its co-option to a centromeric function. In contrast, we found that a centromeric retroelement-derived macrosatellite, SST1, corresponds with chromosome breakpoint reuse across gibbons and shows high sequence conservation across genera. Finally, using de novo assembly of centromere-specific sequences, we determine that transcripts originating from gibbon centromeres recapitulate species-specific TE diversity. Combined, our data reveals dynamic, species-specific shifts in repeat content that define gibbon centromeres and coincide with the extensive karyotypic diversity observed within this lineage.