Unknown

Dataset Information

0

A scale-free analysis of the HIV-1 genome demonstrates multiple conserved regions of structural and functional importance.


ABSTRACT: HIV-1 replicates via a low-fidelity polymerase with a high mutation rate; strong conservation of individual nucleotides is highly indicative of the presence of critical structural or functional properties. Identifying such conservation can reveal novel insights into viral behaviour. We analysed 3651 publicly available sequences for the presence of nucleic acid conservation beyond that required by amino acid constraints, using a novel scale-free method that identifies regions of outlying score together with a codon scoring algorithm. Sequences with outlying score were further analysed using an algorithm for producing local RNA folds whilst accounting for alignment properties. 11 different conserved regions were identified, some corresponding to well-known cis-acting functions of the HIV-1 genome but also others whose conservation has not previously been noted. We identify rational causes for many of these, including cis functions, possible additional reading frame usage, a plausible mechanism by which the central polypurine tract primes second-strand DNA synthesis and a conformational stabilising function of a region at the 5' end of env.

SUBMITTER: Skittrall JP 

PROVIDER: S-EPMC6791557 | biostudies-literature | 2019 Sep

REPOSITORIES: biostudies-literature

altmetric image

Publications

A scale-free analysis of the HIV-1 genome demonstrates multiple conserved regions of structural and functional importance.

Skittrall Jordan P JP   Ingemarsdotter Carin K CK   Gog Julia R JR   Lever Andrew M L AML  

PLoS computational biology 20190923 9


HIV-1 replicates via a low-fidelity polymerase with a high mutation rate; strong conservation of individual nucleotides is highly indicative of the presence of critical structural or functional properties. Identifying such conservation can reveal novel insights into viral behaviour. We analysed 3651 publicly available sequences for the presence of nucleic acid conservation beyond that required by amino acid constraints, using a novel scale-free method that identifies regions of outlying score to  ...[more]

Similar Datasets

| S-EPMC7232164 | biostudies-literature
| S-EPMC7563622 | biostudies-literature
| S-EPMC5842187 | biostudies-literature
| S-EPMC1463900 | biostudies-literature
| S-EPMC10567821 | biostudies-literature
| S-EPMC5519067 | biostudies-other
| S-EPMC5062970 | biostudies-literature
2021-07-15 | GSE179046 | GEO
| S-EPMC146152 | biostudies-other
| S-EPMC4995020 | biostudies-literature