Unknown

Dataset Information

0

Re-annotation of the Saccharopolyspora erythraea genome using a systems biology approach.


ABSTRACT:

Background

Accurate bacterial genome annotations provide a framework to understanding cellular functions, behavior and pathogenicity and are essential for metabolic engineering. Annotations based only on in silico predictions are inaccurate, particularly for large, high G + C content genomes due to the lack of similarities in gene length and gene organization to model organisms.

Results

Here we describe a 2D systems biology driven re-annotation of the Saccharopolyspora erythraea genome using proteogenomics, a genome-scale metabolic reconstruction, RNA-sequencing and small-RNA-sequencing. We observed transcription of more than 300 intergenic regions, detected 59 peptides in intergenic regions, confirmed 164 open reading frames previously annotated as hypothetical proteins and reassigned function to open reading frames using the genome-scale metabolic reconstruction. Finally, we present a novel way of mapping ribosomal binding sites across the genome by sequencing small RNAs.

Conclusions

The work presented here describes a novel framework for annotation of the Saccharopolyspora erythraea genome. Based on experimental observations, the 2D annotation framework greatly reduces errors that are commonly made when annotating large-high G + C content genomes using computational prediction algorithms.

SUBMITTER: Marcellin E 

PROVIDER: S-EPMC4008361 | biostudies-literature | 2013 Oct

REPOSITORIES: biostudies-literature

altmetric image

Publications

Re-annotation of the Saccharopolyspora erythraea genome using a systems biology approach.

Marcellin Esteban E   Licona-Cassani Cuauhtemoc C   Mercer Tim R TR   Palfreyman Robin W RW   Nielsen Lars K LK  

BMC genomics 20131011


<h4>Background</h4>Accurate bacterial genome annotations provide a framework to understanding cellular functions, behavior and pathogenicity and are essential for metabolic engineering. Annotations based only on in silico predictions are inaccurate, particularly for large, high G + C content genomes due to the lack of similarities in gene length and gene organization to model organisms.<h4>Results</h4>Here we describe a 2D systems biology driven re-annotation of the Saccharopolyspora erythraea g  ...[more]

Similar Datasets

2013-07-16 | GSE48887 | GEO
| S-EPMC2206050 | biostudies-literature
| S-EPMC4014280 | biostudies-literature
| S-EPMC4838384 | biostudies-literature
| S-EPMC1859954 | biostudies-literature
2013-07-16 | E-GEOD-48887 | biostudies-arrayexpress
| S-EPMC4809991 | biostudies-literature
| S-EPMC2871057 | biostudies-literature
| PRJNA18491 | ENA
| PRJNA699044 | ENA