Project description:Fungal mitochondrial genes are often invaded by group I or II introns, which represent an ideal marker for understanding fungal evolution. A standard nomenclature of mitochondrial introns is needed to avoid confusion when comparing different fungal mitogenomes. Currently, there has been a standard nomenclature for introns present in rRNA genes, but there is a lack of a standard nomenclature for introns present in protein-coding genes. In this study, we propose a new nomenclature system for introns in fungal mitochondrial protein-coding genes based on (1) three-letter abbreviation of host scientific name, (2) host gene name, (3), one capital letter P (for group I introns), S (for group II introns), or U (for introns with unknown types), and (4) intron insertion site in the host gene according to the cyclosporin-producing fungus Tolypocladium inflatum. The suggested nomenclature was proved feasible by naming introns present in mitogenomes of 16 fungi of different phyla, including both basal and higher fungal lineages although minor adjustment of the nomenclature is needed to fit certain special conditions. The nomenclature also had the potential to name plant/protist/animal mitochondrial introns. We hope future studies follow the proposed nomenclature to ensure direct comparison across different studies.
Project description:Fungi comprise a great diversity of species with distinct ecological functions and lifestyles. Similar to other eukaryotes, fungi rely on interactions with prokaryotes and one of the most important symbiotic events was the acquisition of mitochondria. Mitochondria are organelles found in eukaryotic cells whose main function is to generate energy through aerobic respiration. Mitogenomes (mtDNAs) are double-stranded circular or linear DNA from mitochondria that may contain core genes and accessory elements that can be replicated, transcribed, and independently translated from the nuclear genome. Despite their importance, investigative studies on the diversity of fungal mitogenomes are scarce. Herein, we have evaluated 788 curated fungal mitogenomes available at NCBI database to assess discrepancies and similarities among them and to better understand the mechanisms involved in fungal mtDNAs variability. From a total of 12 fungal phyla, four do not have any representative with available mitogenomes, which highlights the underrepresentation of some groups in the current available data. We selected representative and non-redundant mitogenomes based on the threshold of 90% similarity, eliminating 81 mtDNAs. Comparative analyses revealed considerable size variability of mtDNAs with a difference of up to 260 kb in length. Furthermore, variation in mitogenome length and genomic composition are generally related to the number and length of accessory elements (introns, HEGs, and uORFs). We identified an overall average of 8.0 (0-39) introns, 8.0 (0-100) HEGs, and 8.2 (0-102) uORFs per genome, with high variation among phyla. Even though the length of the core protein-coding genes is considerably conserved, approximately 36.3% of the mitogenomes evaluated have at least one of the 14 core coding genes absent. Also, our results revealed that there is not even a single gene shared among all mitogenomes. Other unusual genes in mitogenomes were also detected in many mitogenomes, such as dpo and rpo, and displayed diverse evolutionary histories. Altogether, the results presented in this study suggest that fungal mitogenomes are diverse, contain accessory elements and are absent of a conserved gene that can be used for the taxonomic classification of the Kingdom Fungi.
Project description:Saccharomycotina and Taphrinomycotina lack intron in their histone genes, except for an intron in one of histone H4 genes of Yarrowia lipolytica. On the other hand, Basidiomycota and Perizomycotina have introns in their histone genes. We compared the distributions of 81, 47, 79, and 98 introns in the fungal histone H2A, H2B, H3, and H4 genes, respectively. Based on the multiple alignments of the amino acid sequences of histones, we identified 19, 13, 31, and 22 intron insertion sites in the histone H2A, H2B, H3, and H4 genes, respectively. Surprisingly only one hot spot of introns in the histone H2A gene is shared between Basidiomycota and Perizomycotina, suggesting that most of introns of Basidiomycota and Perizomycotina were acquired independently. Our findings suggest that the common ancestor of Ascomycota and Basidiomycota maybe had a few introns in the histone genes. In the course of fungal evolution, Saccharomycotina and Taphrinomycotina lost the histone introns; Basidiomycota and Perizomycotina acquired other introns independently. In addition, most of the introns have sequence similarity among introns of phylogenetically close species, strongly suggesting that horizontal intron transfer events between phylogenetically distant species have not occurred recently in the fungal histone genes.
Project description:In grapevine, serine peptidases from the subtilase family were recently associated to Plasmopara viticola resistance. This family in grapevine, first characterized in 2014, was re-analyzed last year and 82 subtilase genes were identified. However, in November of 2016, the National Center for Biotechnology Information database (NCBI) made a new public release of the grapevine genome annotation based on new sequencing data and better prediction algorithms. As a consequence, some gene annotations and lengths changed. Here we present an update to the grapevine subtilase gene family sequences (SBT), namely sequence identifiers, bioinformatic predictions and recommend a nomenclature for the grapevine SBT genes. Our results show that grapevine subtilase gene family is now constituted by 87 subtilase genes encoding for 109 subtilase proteins and, despite the reported alterations, expression data on subtilases associated to grapevine resistance to P. viticola pathosystem did not suffer any alteration.
Project description:Long non-coding RNAs (lncRNAs) are recognized as an important class of regulatory molecules involved in a variety of biological functions. However, the regulatory mechanisms of long non-coding genes expression are still poorly understood. The characterization of the genomic features of lncRNAs is crucial to get insight into their function. In this study, we exploited recent annotations by GENCODE to characterize the genomic and splicing features of long non-coding genes in comparison with protein-coding ones, both in human and mouse. Our analysis highlighted differences between the two classes of genes in terms of their gene architecture. Significant differences in the splice sites usage were observed between long non-coding and protein-coding genes (PCG). While the frequency of non-canonical GC-AG splice junctions represents about 0.8% of total splice sites in PCGs, we identified a significant enrichment of the GC-AG splice sites in long non-coding genes, both in human (3.0%) and mouse (1.9%). In addition, we found a positional bias of GC-AG splice sites being enriched in the first intron in both classes of genes. Moreover, a significant shorter length and weaker donor and acceptor sites were found comparing GC-AG introns to GT-AG introns. Genes containing at least one GC-AG intron were found conserved in many species, more prone to alternative splicing and a functional analysis pointed toward their enrichment in specific biological processes such as DNA repair. Our study shows for the first time that GC-AG introns are mainly associated with lncRNAs and are preferentially located in the first intron. Additionally, we discovered their regulatory potential indicating the existence of a new mechanism of non-coding and PCGs expression regulation.
Project description:Plant genomes present a continuous range of variation in nucleotide composition (G + C content). In coding regions, G + C-poor species tend to have unimodal distributions of G + C content among genes within genomes and slight 5'-3' gradients along genes. In contrast, G + C-rich species display bimodal distributions of G + C content among genes and steep 5'-3' decreasing gradients along genes. The causes of these peculiar patterns are still poorly understood. Within two species (Arabidopsis thaliana and rice), each representative of one side of the continuum, we studied the consequences of intron presence on coding region and intron G + C content at different scales. By properly taking intron structure into account, we showed that, in both species, intron presence is associated with step changes in nucleotide, codon, and amino acid composition. This suggests that introns have a barrier effect structuring G + C content along genes and that previous continuous characterizations of the 5'-3' gradients were artifactual. In external gene regions (located upstream first or downstream last introns), species-specific factors, such as GC-biased gene conversion, are shaping G + C content whereas in internal gene regions (surrounded by introns), G + C content is likely constrained to remain within a range common to both species.
Project description:In this study, all available cytochrome b (Cyt b) genes from the GOBASE database were compiled and the evolutionary dynamics of the Cyt b gene introns was assessed. Cyt b gene introns were frequently present in the fungal kingdom and some lower plants, but generally absent or rare in Chromista, Protozoa, and Animalia. Fungal Cyt b introns were found at 35 positions in Cyt b genes and the number of introns varied at individual positions from a single representative to 32 different introns at position 131, showing a wide and patchy distribution. Many homologous introns were present at the same position in distantly related species but absent in closely related species, suggesting that introns of the Cyt b genes were frequently lost. On the other hand, highly similar intron sequences were observed in some distantly related species rather than in closely related species, suggesting that these introns were gained independently, likely through lateral transfers. The intron loss-and-gain events could be mediated by transpositions that might have occurred between nuclear and mitochondria. Southern hybridization analysis confirmed that some introns contained repetitive sequences and might be transposable elements. An intron gain in Botryotinia fuckeliana prevented the development of QoI fungicide resistance, suggesting that intron loss-and-gain events were not necessarily beneficial to their host organisms.
Project description:Here a new, systematic, unambiguous and unified nomenclature for multicomponent materials is presented. The approach simplifies naming schemes of extraordinary co-crystals containing multiple building blocks with different charges. Although the presented examples of cytosine compounds cannot cover all possibilities, they clearly show that the new nomenclature is flexible and can be easily extended to other multicomponent materials.
Project description:Noroviruses belong to a genus of genetically diverse viruses within the family Caliciviridae and cause acute gastroenteritis in humans and animals. They are subdivided into genogroups, each of which further segregates into genotypes. Until recently, a new genotype was based on a defined pairwise distance cutoff of complete VP1 sequences, but with the increasing number of available norovirus sequences, this cutoff is no longer accurate, and sequences in the public database have been misclassified. In this paper, we demonstrate that the pairwise distance cutoff method can no longer be used and outline a phylogenetic approach to classify noroviruses. Furthermore, we propose a dual nomenclature using both ORF1 and VP1 sequences, as recombination is common and recognizing recombinant viruses may be relevant. With the continuing emergence of new norovirus lineages, we propose to coordinate nomenclature of new norovirus genotypes through an international norovirus working group.