Project description:Citrus tristeza virus (CTV), a member of the aphid-transmitted closterovirus group, is the causal agent of the notorious tristeza disease in several citrus species worldwide. The codon usage patterns of viruses reflect the evolutionary changes for optimization of their survival and adaptation in their fitness to the external environment and the hosts. The codon usage adaptation of CTV to specific citrus hosts remains to be studied; thus, its role in CTV evolution is not clearly comprehended. Therefore, to better explain the host⁻virus interaction and evolutionary history of CTV, the codon usage patterns of the coat protein (CP) genes of 122 CTV isolates originating from three economically important citrus hosts (55 isolate from Citrus sinensis, 38 from C. reticulata, and 29 from C. aurantifolia) were studied using several codon usage indices and multivariate statistical methods. The present study shows that CTV displays low codon usage bias (CUB) and higher genomic stability. Neutrality plot and relative synonymous codon usage analyses revealed that the overall influence of natural selection was more profound than that of mutation pressure in shaping the CUB of CTV. The contribution of high-frequency codon analysis and codon adaptation index value show that CTV has host-specific codon usage patterns, resulting in higheradaptability of CTV isolates originating from C. reticulata (Cr-CTV), and low adaptability in the isolates originating from C. aurantifolia (Ca-CTV) and C. sinensis (Cs-CTV). The combination of codon analysis of CTV with citrus genealogy suggests that CTV evolved in C. reticulata or other Citrus progenitors. The outcome of the study enhances the understanding of the factors involved in viral adaptation, evolution, and fitness toward their hosts. This information will definitely help devise better management strategies of CTV.
Project description:Venezuelan equine encephalitis virus (VEEV) is an emerging zoonotic virus in the alphavirus genus. It can be transmitted to humans due to spillover from equid-mosquito cycles. The symptoms caused by VEEV include fever, headache, myalgia, nausea, and vomiting. It can also cause encephalitis in severe cases. The evolutionary features of VEEV are largely unknown. In this study, we comprehensively analyzed the codon usage pattern of VEEV by computing a variety of indicators, such as effective number of codons (ENc), codon adaptation index (CAI), relative synonymous codon usage (RSCU), on 130 VEEV coding sequences retrieved from GenBank. The results showed that the codon usage bias of VEEV is relatively low. ENc-GC3s plot, neutrality plot, and CAI-ENc correlation analyses supported that translational selection plays an important role in shaping the codon usage pattern of VEEV whereas the mutation pressure has a minor influence. Analysis of RSCU values showed that most of the preferred codons in VEEV are C/G-ended. Analysis of dinucleotide composition found that all CG- and UA-containing codons are not preferentially used. Phylogenetic analysis showed that VEEV isolates can be clustered into three genera and evolutionary force affects the codon usage pattern. Furthermore, a correspondence analysis (COA) showed that aromaticity and hydrophobicity as well as geographical distribution also have certain effects on the codon usage variation of VEEV, suggesting the possible involvement of translational selection. Overall, the codon usage of VEEV is comparatively slight and translational selection might be the main factor that shapes the codon usage pattern of VEEV. This study will promote our understanding about the evolution of VEEV and its host adaption, and might provide some clues for preventing the cross-species transmission of VEEV.
Project description:BackgroundFlaviviridae viruses are single-stranded, positive-sense RNA viruses, which threat human constantly mediated by mosquitoes, ticks, and sandflies. Considering the recent increase in the prevalence of the family virus and its risk potential, we investigated the codon usage pattern to understand its evolutionary processes and provide some useful data to develop the medications for most of Flaviviridae viruses.ResultsThe overall extent of codon usage bias in 65 Flaviviridae viruses is low with the average value of GC contents being 50.5% and the highest value being 55.9%; the lowest value is 40.2%. ENC values of Flaviviridae virus genes vary from 48.75 to 57.83 with a mean value of 55.56. U- and A-ended codons are preferred in the Flaviviridae virus. Correlation analysis shows that the positive correlation between ENC value and GC content at the third nucleotide positions was significant in this family virus. The result of analysis of ENC, neutrality plot analysis, and correlation analysis revealed that codon usage bias of all the viruses was affected mainly by natural selection. Meanwhile, according to correspondence analysis (CoA) based on RSCU and phylogenetic analysis, the Flaviviridae viruses mainly are made up of two groups, Group I (Yellow fever virus, Apoi virus, Tembusu virus, Dengue virus 1, and others) and Group II (West Nile virus lineage 2, Japanese encephalitis virus, Usutu virus, Kedougou virus, and others).ConclusionsAll in, the bias of codon usage pattern is affected not only by compositional constraints but also by natural selection. Phylogenetic analysis also illustrates that codon usage bias of virus can serve as an effective means of evolutionary classification in Flaviviridae virus.
Project description:Lassa virus genome consists of two single-stranded, negative-sense RNA segments that lie in the genus Arenavirus. The disease associated with the Lassa virus is distributed all over the world, with approximately 3,000,000-5,000,000 infections diagnosed annually in West Africa. It shows high health risks to the human being. Previous research used the evolutionary time scale and adaptive evolution to describe the Lassa virus population pattern. However, it is still unclear how the Lassa virus takes advantage of synonymous codons. In this study, we analyzed the codon usage bias in 162 Lassa virus strains by calculating and comparing the nucleotide contents, effective number of codons (ENC), codon adaptation index (CAI), relative synonymous codon usage (RSCU), and others. The results disclosed that LASV strains are rich in A/T. The average ENC value indicated a low codon usage bias in LASVs. The ENC-plot, neutrality plot and parity rule 2 plot demonstrated that, besides mutational pressure, other factors like natural selection also contributed to codon usage bias. This study is significant because it described the pattern of codon usage in the genomes of the Lassa viruses and provided the information needed for a fundamental evolutionary study of them.
Project description:Genomes of different sizes and complexity can be compared using common features. Most genomes contain open reading frames, and most genomes use the same genetic code. Redundancy in the genetic code means that different biases in the third nucleotide position of a codon exist in different genomes. However, the nucleotide composition of viruses can be quite different from host nucleotide composition making it difficult to assess the relevance of these biases. Here we show that grouping codons of a codon-pair according to the GC content of the first two nucleotide positions of each codon reveals patterns in nucleotide usage at the third position of the 1st codon. Differences between the observed and expected biases occur predominantly when the first two nucleotides of the 2nd codon are both S (strong, G or C) or both W (weak, A or T), not a mixture of strong and weak. The data indicates that some codon pairs are preferred because of the strength of the interactions between the codon and anticodon, the adjacent tRNAs and the ribosome. Using base-pairing strength and third position bias facilitates the comparison of genomes of different size and nucleotide composition and reveals patterns not previously described.
Project description:Background and aimClassical swine fever (CSF), caused by CSF virus (CSFV), is a highly contagious disease in pigs causing 100% mortality in susceptible adult pigs and piglets. High mortality rate in pigs causes huge economic loss to pig farmers. CSFV has a positive-sense RNA genome of 12.3 kb in length flanked by untranslated regions at 5' and 3' end. The genome codes for a large polyprotein of 3900 amino acids coding for 11 viral proteins. The 1300 codons in the polyprotein are coded by different combinations of three nucleotides which help the infectious agent to evolve itself and adapt to the host environment. This study performed and employed various methods/techniques to estimate the changes occurring in the process of CSFV evolution by analyzing the codon usage pattern.Materials and methodsThe evolution of viruses is widely studied by analyzing their nucleotides and coding regions/codons using various methods. A total of 115 complete coding regions of CSFVs including one complete genome from our laboratory (MH734359) were included in this study and analysis was carried out using various methods in estimating codon usage bias and evolution. This study elaborates on the factors that influence the codon usage pattern.ResultsThe effective number of codons (ENC) and relative synonymous codon usage showed the presence of codon usage bias. The mononucleotide (A) has a higher frequency compared to the other mononucleotides (G, C, and T). The dinucleotides CG and CC are underrepresented and overrepresented. The codons CGT was underrepresented and AGG was overrepresented. The codon adaptation index value of 0.71 was obtained indicating that there is a similarity in the codon usage bias. The principal component analysis, ENC-plot, Neutrality plot, and Parity Rule 2 plot produced in this article indicate that the CSFV is influenced by the codon usage bias. The mutational pressure and natural selection are the important factors that influence the codon usage bias.ConclusionThe study provides useful information on the codon usage analysis of CSFV and may be utilized to understand the host adaptation to virus environment and its evolution. Further, such findings help in new gene discovery, design of primers/probes, design of transgenes, determination of the origin of species, prediction of gene expression level, and gene function of CSFV. To the best of our knowledge, this is the first study on codon usage bias involving such a large number of complete CSFVs including one sequence of CSFV from India.
Project description:Bluetongue virus (BTV) is a double-stranded RNA virus with multiple segments and belongs to the genus Orbivirus within the family Reoviridae. BTV is spread to livestock through its dominant vector, biting midges of genus Culicoides. Although great progress has been made in genomic analyses, it is not fully understood how BTVs adapt to their hosts and evade the host's immune systems. In this study, we retrieved BTV genome sequences from the National Center for Biotechnology Information (NCBI) database and performed a comprehensive research to explore the codon usage patterns in 50 BTV strains. We used bioinformatic approaches to calculate the relative synonymous codon usage (RSCU), codon adaptation index (CAI), effective number of codons (ENC), and other indices. The results indicated that most of the overpreferred codons had A-endings, which revealed that mutational pressure was the major force shaping codon usage patterns in BTV. However, the influence of natural selection and geographical factors cannot be ignored on viral codon usage bias. Based on the RSCU values, we performed a comparative analysis between BTVs and their hosts, suggesting that BTVs were inclined to evolve their codon usage patterns that were comparable to those of their hosts. Such findings will be conducive to understanding the elements that contribute to viral evolution and adaptation to hosts.
Project description:Eighteen of the 20 amino acids are each encoded by more than one synonymous codon. Due to differential transfer RNA supply within the cell, synonymous codons are not used with equal frequency, a phenomenon termed codon usage bias (CUB). Previous studies have demonstrated that CUB of endogenous genes trans-regulates the translational efficiency of other genes. We hypothesized similar effects for CUB of exogenous genes on host translation, and tested it in the case of viral infection, a common form of naturally occurring exogenous gene translation. We analysed public Ribo-Seq datasets from virus-infected yeast and human cells and showed that virus CUB trans-regulated tRNA availability, and therefore the relative decoding time of codons. Manipulative experiments in yeast using 37 synonymous fluorescent proteins confirmed that an exogenous gene with CUB more similar to that of the host would apply decreased translational load on the host per unit of expression, whereas expression of the exogenous gene was elevated. The combination of these two effects was that exogenous genes with CUB overly similar to that of the host severely impeded host translation. Finally, using a manually curated list of viruses and natural and symptomatic hosts, we found that virus CUB tended to be more similar to that of symptomatic hosts than that of natural hosts, supporting a general deleterious effect of excessive CUB similarity between virus and host. Our work revealed repulsion between virus and host CUBs when they are overly similar, a previously unrecognized complexity in the coevolution of virus and host.
Project description:Viruses show noticeable evolution to adapt and reproduce within their hosts. Theoretically, patterns and factors that affect the codon usage of viruses should reflect evolutionary changes that allow them to optimize their codon usage to their hosts. Some software tools can analyze the codon usage of organisms; however, their performance has room for improvement, as these tools do not focus on examining the codon usage co-adaptation between viruses and their hosts. This paper describes the vhcub R package, which is a crucial tool used to analyze the co-adaptation of codon usage between a virus and its host, with several implementations of indices and plots. The tool is available from: https://cran.r-project.org/web/packages/vhcub/.