Project description:In the recent years, RNA silencing has been studied extensively to be a conserved regulatory process in plants. In the antiviral silencing, the intermediate double-stranded RNA form during the replication of RNA viruses were recognized and processed into abundant of overlapping viral siRNA (viRNAs). Accordingly, the cloned viRNAs could be conversely assembled into some contigs of viruses, which is recently exploited for identifying new viruses and their genome sequences.To obtain rapidly the complete genome sequence of BYSMV, we carried out deep sequencing of small RNAs from healthy and BYSMV infected wheat, respectively. Thirteen contigs were assembled from the overlapping viRNAs only present in the infected wheat but not in the healthy wheat. The results of BLAST showed that ten contigs shared about 96% identity with the reported L gene of BYSMV isolate Zanjan-1. Viral assembly from the BYSMV infected wheat plants to obtain the full lengh genome and characterise the viral siRNAs
Project description:The naked mole-rat (NMR; Heterocephalus glaber) has recently gained considerable attention in the scientific community for its unique potential to unveil novel insights in the fields of medicine, biochemistry, and evolution. NMRs exhibit unique adaptations that include protracted fertility, cancer resistance, eusociality, and anoxia. This suite of adaptations is not found in other rodent species, suggesting that interrogating conserved and accelerated regions in the NMR genome will find regions of the NMR genome fundamental to their unique adaptations. However, the current NMR genome assembly has limits that make studying structural variations, heterozygosity, and non-coding adaptations challenging. We present a complete diploid naked-mole rat genome assembly by integrating long-read and 10X-linked read genome sequencing of a male NMR and its parents, and Hi-C sequencing in the NMR hypothalamus (N=2). Reads were identified as maternal, paternal or ambiguous (TrioCanu). We then polished genomes with Flye, Racon and Medaka. Assemblies were then scaffolded using the following tools in order: Scaff10X, Salsa2, 3d-DNA, Minimap2-alignment between assemblies, and the Juicebox Assembly Tools. We then subjected the assemblies to another round of polishing, including short-read polishing with Freebayes. We assembled the NMR mitochondrial genome with mitoVGP. Y chromosome contigs were identified by aligning male and female 10X linked reads to the paternal genome and finding male-biased contigs not present in the maternal genome. Contigs were assembled with publicly available male NMR Fibroblast Hi-C-seq data (SRR820318). Both assemblies have their sex chromosome haplotypes merged so that both assemblies have a high-quality X and Y chromosome. Finally, assemblies were evaluated with Quast, BUSCO, and Merqury, which all reported the base-pair quality and contiguity of both assemblies as high-quality. The assembly will next be annotated by Ensembl using public RNA-seq data from multiple tissues (SRP061363). Together, this assembly will provide a high-quality resource to the NMR and comparative genomics communities.
Project description:The naked mole-rat (NMR; Heterocephalus glaber) has recently gained considerable attention in the scientific community for its unique potential to unveil novel insights in the fields of medicine, biochemistry, and evolution. NMRs exhibit unique adaptations that include protracted fertility, cancer resistance, eusociality, and anoxia. This suite of adaptations is not found in other rodent species, suggesting that interrogating conserved and accelerated regions in the NMR genome will find regions of the NMR genome fundamental to their unique adaptations. However, the current NMR genome assembly has limits that make studying structural variations, heterozygosity, and non-coding adaptations challenging. We present a complete diploid naked-mole rat genome assembly by integrating long-read and 10X-linked read genome sequencing of a male NMR and its parents, and Hi-C sequencing in the NMR hypothalamus (N=2). Reads were identified as maternal, paternal or ambiguous (TrioCanu). We then polished genomes with Flye, Racon and Medaka. Assemblies were then scaffolded using the following tools in order: Scaff10X, Salsa2, 3d-DNA, Minimap2-alignment between assemblies, and the Juicebox Assembly Tools. We then subjected the assemblies to another round of polishing, including short-read polishing with Freebayes. We assembled the NMR mitochondrial genome with mitoVGP. Y chromosome contigs were identified by aligning male and female 10X linked reads to the paternal genome and finding male-biased contigs not present in the maternal genome. Contigs were assembled with publicly available male NMR Fibroblast Hi-C-seq data (SRR820318). Both assemblies have their sex chromosome haplotypes merged so that both assemblies have a high-quality X and Y chromosome. Finally, assemblies were evaluated with Quast, BUSCO, and Merqury, which all reported the base-pair quality and contiguity of both assemblies as high-quality. The assembly will next be annotated by Ensembl using public RNA-seq data from multiple tissues (SRP061363). Together, this assembly will provide a high-quality resource to the NMR and comparative genomics communities.
Project description:<p>Traveler's diarrhea (TD) is caused by enterotoxigenic Escherichia coli (ETEC), other pathogenic gram-negative pathogens, norovirus and some parasites. Nevertheless, standard diagnostic methods fail to identify pathogens in more than 30% of TD patients, so it is predicted that new pathogens or groups of pathogens may be causative agents of disease. A comprehensive metagenomic study of the fecal microbiomes from 23 TD patients and seven healthy travelers was performed, all of which tested negative for the known etiologic agents of TD in standard tests. Metagenomic reads were assembled and the resulting contigs were subjected to semi-manual binning to assemble independent genomes from metagenomic pools. Taxonomic and functional annotations were conducted to assist identification of putative pathogens. We extracted 560 draft genomes, 320 of which were complete enough to be enough characterized as cellular genomes and 160 of which were bacteriophage genomes. We made predictions of the etiology of disease in individual subjects based on the properties and features of the recovered cellular genomes. Three subtypes of samples were observed. First were four patients with low diversity metagenomes that were predominated by one or more pathogenic E. coli strains. Annotation allowed prediction of pathogenic type in most cases. Second, five patients were co-infected with E. coli and other members of the Enterobacteriaceae, including antibiotic resistant Enterobacter, Klebsiella, and Citrobacter. Finally, several samples contained genomes that represented dark matter. In one of these samples we identified a TM7 genome that phylogenetically clustered with a strain isolated from wastewater and carries genes encoding potential virulence factors. We also observed a very high proportion of bacteriophage reads in some samples. The relative abundance of phage was significantly higher in healthy travelers when compared to TD patients. Our results highlight that assembly-based analysis revealed that diarrhea is often polymicrobial and includes members of the Enterobacteriaceae not normally associated with TD and have implicated a new member of the TM7 phylum as a potential player in diarrheal disease. </p>