Project description:Humans exhibit significant phenotypic differences from other great apes, yet pinpointing the underlying genetic changes has been limited by incomplete reference genomes and a reliance on a single assembly to represent a species. We aligned 20 telomere-to-telomere (T2T) assemblies spanning great ape evolution and variation to define 1,596 consensus human ancestor quickly evolved regions (Consensus HAQERs), regions that diverged rapidly between the human-chimpanzee ancestor and an ancestral node of modern humans. Unlike prior HAQER sets based on single assemblies for a species, Consensus HAQERs incorporate population variation, reducing the likelihood of intraspecies variation appearing to be interspecies divergence. These regions contain signatures of elevated mutation rates, ancient positive selection, bivalent regulatory function, and are enriched in disease-linked loci, often emerging in previously inaccessible repetitive DNA. Through multiplex, single-cell enhancer assays, we identify HAQERs as active enhancers in the developing brain and cardiomyocytes, highlighting their potential contributions to human-specific gene regulation.
Project description:We used an approach combining PacBio data and published Illumina reads to de novo assemble D. busckii contigs. We generated Hi-C data from D. busckii embryos to order these contigs into chromosome-length scaffolds. For D. virilis we generated Hi-C data to order and orient the published Dvir_caf1 scaffolds into chromosome-length assemblies. Furthermore, we compared Hi-C matrices from these two new assemblies with D. melanogaster with respect to synteny blocks and dosage compensation as a chromosome-wide gene-regulatory mechanism.