Unknown

Dataset Information

0

New insights into the evolutionary features of viral overlapping genes by discriminant analysis.


ABSTRACT: Overlapping genes originate by a mechanism of overprinting, in which nucleotide substitutions in a pre-existing frame induce the expression of a de novo protein from an alternative frame. In this study, I assembled a dataset of 319 viral overlapping genes, which included 82 overlaps whose expression is experimentally known and the respective 237 homologs. Principal component analysis revealed that overlapping genes have a common pattern of nucleotide and amino acid composition. Discriminant analysis separated overlapping from non-overlapping genes with an accuracy of 97%. When applied to overlapping genes with known genealogy, it separated ancestral from de novo frames with an accuracy close to 100%. This high discriminant power was crucial to computationally design variants of de novo viral proteins known to possess selective anticancer toxicity (apoptin) or protection against neurodegeneration (X protein), as well as to detect two new potential overlapping genes in the genome of the new coronavirus SARS-CoV-2.

SUBMITTER: Pavesi A 

PROVIDER: S-EPMC7157939 | biostudies-literature | 2020 Jul

REPOSITORIES: biostudies-literature

altmetric image

Publications

New insights into the evolutionary features of viral overlapping genes by discriminant analysis.

Pavesi Angelo A  

Virology 20200402


Overlapping genes originate by a mechanism of overprinting, in which nucleotide substitutions in a pre-existing frame induce the expression of a de novo protein from an alternative frame. In this study, I assembled a dataset of 319 viral overlapping genes, which included 82 overlaps whose expression is experimentally known and the respective 237 homologs. Principal component analysis revealed that overlapping genes have a common pattern of nucleotide and amino acid composition. Discriminant anal  ...[more]

Similar Datasets

| S-EPMC6810714 | biostudies-literature
| S-EPMC7204545 | biostudies-literature
| S-EPMC5127828 | biostudies-literature
| S-EPMC3484049 | biostudies-literature
| S-EPMC6976628 | biostudies-literature
| S-EPMC7125799 | biostudies-literature
| S-EPMC3443069 | biostudies-literature
| S-EPMC2936400 | biostudies-literature
| S-EPMC9367533 | biostudies-literature
2024-03-15 | GSE246420 | GEO