Project description:Nearly complete sequences of simian immunodeficiency viruses (SIVs) infecting 18 different nonhuman primate species in sub-Saharan Africa have now been reported; yet, our understanding of the origins, evolutionary history, and geographic distribution of these viruses still remains fragmentary. Here, we report the molecular characterization of a lentivirus (SIVdeb) naturally infecting De Brazza's monkeys (Cercopithecus neglectus). Complete SIVdeb genomes (9,158 and 9227 bp in length) were amplified from uncultured blood mononuclear cell DNA of two wild-caught De Brazza's monkeys from Cameroon. In addition, partial pol sequences (650 bp) were amplified from four offspring of De Brazza's monkeys originally caught in the wild in Uganda. Full-length (9068 bp) and partial pol (650 bp) SIVsyk sequences were also amplified from Sykes's monkeys (Cercopithecus albogularis) from Kenya. Analysis of these sequences identified a new SIV clade (SIVdeb), which differed from previously characterized SIVs at 40 to 50% of sites in Pol protein sequences. The viruses most closely related to SIVdeb were SIVsyk and members of the SIVgsn/SIVmus/SIVmon group of viruses infecting greater spot-nosed monkeys (Cercopithecus nictitans), mustached monkeys (Cercopithecus cephus), and mona monkeys (Cercopithecus mona), respectively. In phylogenetic trees of concatenated protein sequences, SIVdeb, SIVsyk, and SIVgsn/SIVmus/SIVmon clustered together, and this relationship was highly significant in all major coding regions. Members of this virus group also shared the same number of cysteine residues in their extracellular envelope glycoprotein and a high-affinity AIP1 binding site (YPD/SL) in their p6 Gag protein, as well as a unique transactivation response element in their viral long terminal repeat; however, SIVdeb and SIVsyk, unlike SIVgsn, SIVmon, and SIVmus, did not encode a vpu gene. These data indicate that De Brazza's monkeys are naturally infected with SIVdeb, that this infection is prevalent in different areas of the species' habitat, and that geographically diverse SIVdeb strains cluster in a single virus group. The consistent clustering of SIVdeb with SIVsyk and the SIVmon/SIVmus/SIVgsn group also suggests that these viruses have evolved from a common ancestor that likely infected a Cercopithecus host in the distant past. The vpu gene appears to have been acquired by a subset of these Cercopithecus viruses after the divergence of SIVdeb and SIVsyk.

Project description:The naked mole-rat (NMR; Heterocephalus glaber) has recently gained considerable attention in the scientific community for its unique potential to unveil novel insights in the fields of medicine, biochemistry, and evolution. NMRs exhibit unique adaptations that include protracted fertility, cancer resistance, eusociality, and anoxia. This suite of adaptations is not found in other rodent species, suggesting that interrogating conserved and accelerated regions in the NMR genome will find regions of the NMR genome fundamental to their unique adaptations. However, the current NMR genome assembly has limits that make studying structural variations, heterozygosity, and non-coding adaptations challenging. We present a complete diploid naked-mole rat genome assembly by integrating long-read and 10X-linked read genome sequencing of a male NMR and its parents, and Hi-C sequencing in the NMR hypothalamus (N=2). Reads were identified as maternal, paternal or ambiguous (TrioCanu). We then polished genomes with Flye, Racon and Medaka. Assemblies were then scaffolded using the following tools in order: Scaff10X, Salsa2, 3d-DNA, Minimap2-alignment between assemblies, and the Juicebox Assembly Tools. We then subjected the assemblies to another round of polishing, including short-read polishing with Freebayes. We assembled the NMR mitochondrial genome with mitoVGP. Y chromosome contigs were identified by aligning male and female 10X linked reads to the paternal genome and finding male-biased contigs not present in the maternal genome. Contigs were assembled with publicly available male NMR Fibroblast Hi-C-seq data (SRR820318). Both assemblies have their sex chromosome haplotypes merged so that both assemblies have a high-quality X and Y chromosome. Finally, assemblies were evaluated with Quast, BUSCO, and Merqury, which all reported the base-pair quality and contiguity of both assemblies as high-quality. The assembly will next be annotated by Ensembl using public RNA-seq data from multiple tissues (SRP061363). Together, this assembly will provide a high-quality resource to the NMR and comparative genomics communities.

			Action	DRS
		Other
	JABBCR01.dat.gz	Other
	JABBCR01.fasta.gz	Fasta.gz
	JABBCR01.master.dat	Other
	SRR11411754_1.fastq.gz	Fastqsanger.gz

Dataset Information

Cercopithecus mona

Dataset's files

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets