Unknown

Dataset Information

0

OLGenie: Estimating Natural Selection to Predict Functional Overlapping Genes.


ABSTRACT: Purifying (negative) natural selection is a hallmark of functional biological sequences, and can be detected in protein-coding genes using the ratio of nonsynonymous to synonymous substitutions per site (dN/dS). However, when two genes overlap the same nucleotide sites in different frames, synonymous changes in one gene may be nonsynonymous in the other, perturbing dN/dS. Thus, scalable methods are needed to estimate functional constraint specifically for overlapping genes (OLGs). We propose OLGenie, which implements a modification of the Wei-Zhang method. Assessment with simulations and controls from viral genomes (58 OLGs and 176 non-OLGs) demonstrates low false-positive rates and good discriminatory ability in differentiating true OLGs from non-OLGs. We also apply OLGenie to the unresolved case of HIV-1's putative antisense protein gene, showing significant purifying selection. OLGenie can be used to study known OLGs and to predict new OLGs in genome annotation. Software and example data are freely available at https://github.com/chasewnelson/OLGenie (last accessed April 10, 2020).

SUBMITTER: Nelson CW 

PROVIDER: S-EPMC7531306 | biostudies-literature | 2020 Aug

REPOSITORIES: biostudies-literature

altmetric image

Publications

OLGenie: Estimating Natural Selection to Predict Functional Overlapping Genes.

Nelson Chase W CW   Ardern Zachary Z   Wei Xinzhu X  

Molecular biology and evolution 20200801 8


Purifying (negative) natural selection is a hallmark of functional biological sequences, and can be detected in protein-coding genes using the ratio of nonsynonymous to synonymous substitutions per site (dN/dS). However, when two genes overlap the same nucleotide sites in different frames, synonymous changes in one gene may be nonsynonymous in the other, perturbing dN/dS. Thus, scalable methods are needed to estimate functional constraint specifically for overlapping genes (OLGs). We propose OLG  ...[more]

Similar Datasets

| S-EPMC4316641 | biostudies-literature
| S-EPMC5850746 | biostudies-literature
| S-EPMC5287106 | biostudies-literature
| S-EPMC2972474 | biostudies-literature
| S-EPMC3048381 | biostudies-literature
| S-EPMC2474766 | biostudies-literature
2022-03-21 | PXD023992 | Pride
2021-11-05 | GSE186959 | GEO