Project description:In the last few decades, reverse genetic and high throughput approaches have been frequently applied to the mouse (Mus musculus) to understand how genes function in tissues/organs and during development in a mammalian system. Despite these efforts, the associated phenotypes for the majority of mouse genes remained to be fully characterized. Here, we performed an integrated transcriptome-phenome analysis by identifying coexpressed gene modules based on tissue transcriptomes profiled with each of various platforms and functionally interpreting these modules using the mouse phenotypic data. Consequently, >15,000 mouse genes were linked with at least one of the 47 tissue functions that were examined. Specifically, our approach predicted >50 genes previously unknown to be involved in mice (Mus musculus) visual functions. Fifteen genes were selected for further analysis based on their potential biomedical relevance and compatibility with further experimental validation. Gene-specific morpholinos were introduced into zebrafish (Danio rerio) to target their corresponding orthologs. Quantitative assessments of phenotypes of developing eyes confirmed predicted eye-related functions of 13 out of the 15 genes examined. These novel eye genes include: Adal, Ankrd33, Car14, Ccdc126, Dhx32, Dkk3, Fam169a, Grifin, Kcnj14, Lrit2, Ppef2, Ppm1n, and Wdr17. The results highlighted the potential for this phenome-based approach to assist the experimental design of mutating and phenotyping mouse genes that aims to fully reveal the functional landscape of mammalian genomes.
Project description:The contemporary surge in metagenomic sequencing has transformed knowledge of viral diversity in wildlife. However, evaluating which newly discovered viruses pose sufficient risk of infecting humans to merit detailed laboratory characterization and surveillance remains largely speculative. Machine learning algorithms have been developed to address this imbalance by ranking the relative likelihood of human infection based on viral genome sequences, but are not yet routinely applied to viruses at the time of their discovery. Here, we characterized viral genomes detected through metagenomic sequencing of feces and saliva from common vampire bats (Desmodus rotundus) and used these data as a case study in evaluating zoonotic potential using molecular sequencing data. Of 58 detected viral families, including 17 which infect mammals, the only known zoonosis detected was rabies virus; however, additional genomes were detected from the families Hepeviridae, Coronaviridae, Reoviridae, Astroviridae and Picornaviridae, all of which contain human-infecting species. In phylogenetic analyses, novel vampire bat viruses most frequently grouped with other bat viruses that are not currently known to infect humans. In agreement, machine learning models built from only phylogenetic information ranked all novel viruses similarly, yielding little insight into zoonotic potential. In contrast, genome composition-based machine learning models estimated different levels of zoonotic potential, even for closely related viruses, categorizing one out of four detected hepeviruses and two out of three picornaviruses as having high priority for further research. We highlight the value of evaluating zoonotic potential beyond ad hoc consideration of phylogeny and provide surveillance recommendations for novel viruses in a wildlife host which has frequent contact with humans and domestic animals.
Project description:Pathogenic viruses represent a small fraction of viral diversity, and emerging diseases are frequently the result of cross-species transmissions. Therefore, we need to develop high-throughput techniques to investigate a broader range of viral biodiversity across a greater number of species. This is especially important in the context of new practices in agriculture that have arisen to tackle the challenges of global food security, including the rising number of marine and freshwater species that are used in aquaculture. In this study, we demonstrate the utility of combining evolutionary approaches with bioinformatics to mine non-viral genome data for viruses, by adapting methods from paleovirology. We report the discovery of a new lineage of dsDNA viruses that are associated with at least fifteen different species of fish. This approach also enabled us to simultaneously identify sequences that likely represent endogenous viral elements, which we experimentally confirmed in commercial salmon samples. Moreover, genomic analysis revealed that the endogenous sequences have co-opted PiggyBac-like transposable elements, possibly as a mechanism of intragenomic proliferation. The identification of novel viruses from genome data shows that our approach has applications in genomics, virology, and the development of best practices for aquaculture and farming.
Project description:The collapse of iconic, keystone populations of sockeye (Oncorhynchus nerka) and Chinook (Oncorhynchus tshawytscha) salmon in the Northeast Pacific is of great concern. It is thought that infectious disease may contribute to declines, but little is known about viruses endemic to Pacific salmon. Metatranscriptomic sequencing and surveillance of dead and moribund cultured Chinook salmon revealed a novel arenavirus, reovirus and nidovirus. Sequencing revealed two different arenavirus variants which each infect wild Chinook and sockeye salmon. In situ hybridisation localised arenavirus mostly to blood cells. Population surveys of >6000 wild juvenile Chinook and sockeye salmon showed divergent distributions of viruses, implying different epidemiological processes. The discovery in dead and dying farmed salmon of previously unrecognised viruses that are also widely distributed in wild salmon, emphasizes the potential role that viral disease may play in the population dynamics of wild fish stocks, and the threat that these viruses may pose to aquaculture.
Project description:Effective conservation of endangered species requires knowledge of the full range of life-history strategies used to maximize population resilience within a stochastic and ever-changing environment. California's endemic Delta Smelt (Hypomesus transpacificus) is rapidly approaching extinction in the San Francisco Estuary, placing it in the crossfire between human and environmental uses of limited freshwater resources. Though managed as a semi-anadromous species, recent studies have challenged this lifecycle model for Delta Smelt, suggesting the species is an estuarine resident with several localized "hot-spots" of abundance. Using laser-ablation otolith strontium isotope microchemistry, we discovered three distinct life-history phenotypes including freshwater resident (FWR), brackish-water resident (BWR), and semi-anadromous (SA) fish. We further refined life-history phenotypes using an unsupervised algorithm and hierarchical clustering and found that in the last resilient year-class, the FWR (12%) and BWR (7%) comprised a small portion of the population, while the majority of fish were SA (81%). Furthermore, the semi-anadromous fish could be clustered into at least four additional life-history phenotypes that varied by natal origin, dispersal age and adult salinity history. These diverse life-history strategies should be incorporated into future conservation and management efforts aimed at preventing the extinction of Delta Smelt in the wild.
Project description:Simian hemorrhagic fever virus (SHFV) has caused lethal outbreaks of hemorrhagic disease in captive primates, but its distribution in wild primates has remained obscure. Here, we describe the discovery and genetic characterization by direct pyrosequencing of two novel, divergent SHFV variants co-infecting a single male red colobus monkey from Kibale National Park, Uganda.The viruses were detected directly from blood plasma using pyrosequencing, without prior virus isolation and with minimal PCR amplification. The two new SHFV variants, SHFV-krc1 and SHFV-krc2 are highly divergent from each other (51.9% nucleotide sequence identity) and from the SHFV type strain LVR 42-0/M6941 (52.0% and 51.8% nucleotide sequence identity, respectively) and demonstrate greater phylogenetic diversity within SHFV than has been documented within any other arterivirus. Both new variants nevertheless have the same 3' genomic architecture as the type strain, containing three open reading frames not present in the other arteriviruses.These results represent the first documentation of SHFV in a wild primate and confirm the unusual 3' genetic architecture of SHFV relative to the other arteriviruses. They also demonstrate a degree of evolutionary divergence within SHFV that is roughly equivalent to the degree of divergence between other arterivirus species. The presence of two such highly divergent SHFV variants co-infecting a single individual represents a degree of within-host viral diversity that exceeds what has previously been reported for any arterivirus. These results expand our knowledge of the natural history and diversity of the arteriviruses and underscore the importance of wild primates as reservoirs for novel pathogens.
Project description:BACKGROUND:Viruses are known to be the most abundant organisms on earth, yet little is known about their collective origin and evolutionary history. With exceptionally high rates of genetic mutation and mosaicism, it is not currently possible to resolve deep evolutionary histories of the known major virus groups. Metagenomics offers a potential means of establishing a more comprehensive view of viral evolution as vast amounts of new sequence data becomes available for comparative analysis. RESULTS:Bioinformatic analysis of viral metagenomic sequences derived from a hot, acidic lake revealed a circular, putatively single-stranded DNA virus encoding a major capsid protein similar to those found only in single-stranded RNA viruses. The presence and circular configuration of the complete virus genome was confirmed by inverse PCR amplification from native DNA extracted from lake sediment. The virus genome appears to be the result of a RNA-DNA recombination event between two ostensibly unrelated virus groups. Environmental sequence databases were examined for homologous genes arranged in similar configurations and three similar putative virus genomes from marine environments were identified. This result indicates the existence of a widespread but previously undetected group of viruses. CONCLUSIONS:This unique viral genome carries implications for theories of virus emergence and evolution, as no mechanism for interviral RNA-DNA recombination has yet been identified, and only scant evidence exists that genetic exchange occurs between such distinct virus lineages. REVIEWERS:This article was reviewed by EK, MK (nominated by PF) and AM. For the full reviews, please go to the Reviewers' comments section.
Project description:Viruses of the family Flaviviridae are important pathogens of humans and other animals and are currently classified into four genera. To better understand their diversity, evolutionary history, and genomic flexibility, we used transcriptome sequencing (RNA-seq) to search for the viruses related to the Flaviviridae in a range of potential invertebrate and vertebrate hosts. Accordingly, we recovered the full genomes of five segmented jingmenviruses and 12 distant relatives of the known Flaviviridae ("flavi-like" viruses) from a range of arthropod species. Although these viruses are highly divergent, they share a similar genomic plan and common ancestry with the Flaviviridae in the NS3 and NS5 regions. Remarkably, although these viruses fill in major gaps in the phylogenetic diversity of the Flaviviridae, genomic comparisons reveal important changes in genome structure, genome size, and replication/gene regulation strategy during evolutionary history. In addition, the wide diversity of flavi-like viruses found in invertebrates, as well as their deep phylogenetic positions, suggests that they may represent the ancestral forms from which the vertebrate-infecting viruses evolved. For the vertebrate viruses, we expanded the previously mammal-only pegivirus-hepacivirus group to include a virus from the graceful catshark (Proscyllium habereri), which in turn implies that these viruses possess a larger host range than is currently known. In sum, our data show that the Flaviviridae infect a far wider range of hosts and exhibit greater diversity in genome structure than previously anticipated.The family Flaviviridae of RNA viruses contains several notorious human pathogens, including dengue virus, West Nile virus, and hepatitis C virus. To date, however, our understanding of the biodiversity and evolution of the Flaviviridae has largely been directed toward vertebrate hosts and their blood-feeding arthropod vectors. Therefore, we investigated an expanded group of potential arthropod and vertebrate host species that have generally been ignored by surveillance programs. Remarkably, these species contained diverse flaviviruses and related viruses that are characterized by major changes in genome size and genome structure, such that these traits are more flexible than previously thought. More generally, these data suggest that arthropods may be the ultimate reservoir of the Flaviviridae and related viruses, harboring considerable genetic and phenotypic diversity. In sum, this study revises the traditional view on the evolutionary history, host range, and genomic structures of a major group of RNA viruses.
Project description:The distribution and diversity of RNA viruses in fungi is incompletely understood due to the often cryptic nature of mycoviral infections and the focused study of primarily pathogenic and/or economically important fungi. As most viruses that are known to infect fungi possess either single-stranded or double-stranded RNA genomes, transcriptomic data provides the opportunity to query for viruses in diverse fungal samples without any a priori knowledge of virus infection. Here we describe a systematic survey of all transcriptomic datasets from fungi belonging to the subphylum Pezizomycotina. Using a simple but effective computational pipeline that uses reads discarded during normal RNA-seq analyses, followed by identification of a viral RNA-dependent RNA polymerase (RdRP) motif in de novo assembled contigs, 59 viruses from 44 different fungi were identified. Among the viruses identified, 88% were determined to be new species and 68% are, to our knowledge, the first virus described from the fungal species. Comprehensive analyses of both nucleotide and inferred protein sequences characterize the phylogenetic relationships between these viruses and the known set of mycoviral sequences and support the classification of up to four new families and two new genera. Thus the results provide a deeper understanding of the scope of mycoviral diversity while also increasing the distribution of fungal hosts. Further, this study demonstrates the suitability of analyzing RNA-seq data to facilitate rapid discovery of new viruses.
Project description:The death toll and economic loss resulting from the severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) pandemic are stark reminders that we are vulnerable to zoonotic viral threats. Strategies are needed to identify and characterize animal viruses that pose the greatest risk of spillover and spread in humans and inform public health interventions. Using expert opinion and scientific evidence, we identified host, viral, and environmental risk factors contributing to zoonotic virus spillover and spread in humans. We then developed a risk ranking framework and interactive web tool, SpillOver, that estimates a risk score for wildlife-origin viruses, creating a comparative risk assessment of viruses with uncharacterized zoonotic spillover potential alongside those already known to be zoonotic. Using data from testing 509,721 samples from 74,635 animals as part of a virus discovery project and public records of virus detections around the world, we ranked the spillover potential of 887 wildlife viruses. Validating the risk assessment, the top 12 were known zoonotic viruses, including SARS-CoV-2. Several newly detected wildlife viruses ranked higher than known zoonotic viruses. Using a scientifically informed process, we capitalized on the recent wealth of virus discovery data to systematically identify and prioritize targets for investigation. The publicly accessible SpillOver platform can be used by policy makers and health scientists to inform research and public health interventions for prevention and rapid control of disease outbreaks. SpillOver is a living, interactive database that can be refined over time to continue to improve the quality and public availability of information on viral threats to human health.