Unknown

Dataset Information

0

Quantitative frame analysis and the annotation of GC-rich (and other) prokaryotic genomes. An application to Anaeromyxobacter dehalogenans.


ABSTRACT: Graphical representations of contrasts in GC usage among codon frame positions (frame analysis) provide evidence of genes missing from the annotations of prokaryotic genomes of high GC content but the qualitative approach of visual frame analysis prevents its applicability on a genomic scale.We developed two quantitative methods for the identification and statistical characterization in sequence regions of three-base periodicity (hits) associated with open reading frame structures. The methods were implemented in the N-Profile Analysis Computational Tool (NPACT), which highlights in graphical representations inconsistencies between newly identified ORFs and pre-existing annotations of coding-regions. We applied the NPACT procedures to two recently annotated strains of the deltaproteobacterium Anaeromyxobacter dehalogenans, identifying in both genomes numerous conserved ORFs not included in the published annotation of coding regions.NPACT is available as a web-based service and for download at http://genome.ufl.edu/npact.lucianob@ufl.eduSupplementary data are available at Bioinformatics online.

SUBMITTER: Oden S 

PROVIDER: S-EPMC4595893 | biostudies-literature | 2015 Oct

REPOSITORIES: biostudies-literature

altmetric image

Publications

Quantitative frame analysis and the annotation of GC-rich (and other) prokaryotic genomes. An application to Anaeromyxobacter dehalogenans.

Oden Steve S   Brocchieri Luciano L  

Bioinformatics (Oxford, England) 20150604 20


<h4>Motivation</h4>Graphical representations of contrasts in GC usage among codon frame positions (frame analysis) provide evidence of genes missing from the annotations of prokaryotic genomes of high GC content but the qualitative approach of visual frame analysis prevents its applicability on a genomic scale.<h4>Results</h4>We developed two quantitative methods for the identification and statistical characterization in sequence regions of three-base periodicity (hits) associated with open read  ...[more]

Similar Datasets

| PRJNA20113 | ENA
| S-EPMC1472366 | biostudies-literature
| S-EPMC3738161 | biostudies-literature
| S-EPMC2362131 | biostudies-literature
| S-EPMC3098052 | biostudies-literature
| PRJNA20095 | ENA
| PRJNA12634 | ENA
| S-EPMC3711429 | biostudies-literature
| S-EPMC126698 | biostudies-literature
| S-EPMC5795083 | biostudies-literature