Unknown

Dataset Information

0

Using triplet periodicity of nucleotide sequences for finding potential reading frame shifts in genes.


ABSTRACT: We introduce a novel approach for the detection of possible mutations leading to a reading frame (RF) shift in a gene. Deletions and insertions of DNA coding regions are considerable events for genes because an RF shift results in modifications of the extensive region of amino acid sequence coded by a gene. The suggested method is based on the phenomenon of triplet periodicity (TP) in coding regions of genes and its relative resistance to substitutions in DNA sequence. We attempted to extend 326 933 regions of continuous TP found in genes from the KEGG databank by considering possible insertions and deletions. We revealed totally 824 genes where such extension was possible and statistically significant. Then we generated amino acid sequences according to active (KEGG's) and hypothetically ancient RFs in order to find confirmation of a shift at a protein level. Consequently, 64 sequences have protein similarities only for ancient RF, 176 only for active RF, 3 for both and 581 have no protein similarity at all. We aimed to have revealed lower bound for the number of genes in which a shift between RF and TP is possible. Further ways to increase the number of revealed RF shifts are discussed.

SUBMITTER: Frenkel FE 

PROVIDER: S-EPMC2671204 | biostudies-literature | 2009 Apr

REPOSITORIES: biostudies-literature

altmetric image

Publications

Using triplet periodicity of nucleotide sequences for finding potential reading frame shifts in genes.

Frenkel F E FE   Korotkov E V EV  

DNA research : an international journal for rapid publication of reports on genes and genomes 20090303 2


We introduce a novel approach for the detection of possible mutations leading to a reading frame (RF) shift in a gene. Deletions and insertions of DNA coding regions are considerable events for genes because an RF shift results in modifications of the extensive region of amino acid sequence coded by a gene. The suggested method is based on the phenomenon of triplet periodicity (TP) in coding regions of genes and its relative resistance to substitutions in DNA sequence. We attempted to extend 326  ...[more]

Similar Datasets

| S-EPMC2992068 | biostudies-literature
| S-EPMC9294425 | biostudies-literature
| S-EPMC45698 | biostudies-other
| S-EPMC4656815 | biostudies-literature
| S-EPMC1275588 | biostudies-literature
| S-EPMC5054449 | biostudies-literature
2012-03-15 | E-MEXP-3476 | biostudies-arrayexpress
| S-EPMC10118063 | biostudies-literature
| S-EPMC156527 | biostudies-literature
| S-EPMC3519624 | biostudies-literature