Unknown

Dataset Information

0

Next-generation annotation of prokaryotic genomes with EuGene-P: application to Sinorhizobium meliloti 2011.


ABSTRACT: The availability of next-generation sequences of transcripts from prokaryotic organisms offers the opportunity to design a new generation of automated genome annotation tools not yet available for prokaryotes. In this work, we designed EuGene-P, the first integrative prokaryotic gene finder tool which combines a variety of high-throughput data, including oriented RNA-Seq data, directly into the prediction process. This enables the automated prediction of coding sequences (CDSs), untranslated regions, transcription start sites (TSSs) and non-coding RNA (ncRNA, sense and antisense) genes. EuGene-P was used to comprehensively and accurately annotate the genome of the nitrogen-fixing bacterium Sinorhizobium meliloti strain 2011, leading to the prediction of 6308 CDSs as well as 1876 ncRNAs. Among them, 1280 appeared as antisense to a CDS, which supports recent findings that antisense transcription activity is widespread in bacteria. Moreover, 4077 TSSs upstream of protein-coding or non-coding genes were precisely mapped providing valuable data for the study of promoter regions. By looking for RpoE2-binding sites upstream of annotated TSSs, we were able to extend the S. meliloti RpoE2 regulon by ?3-fold. Altogether, these observations demonstrate the power of EuGene-P to produce a reliable and high-resolution automatic annotation of prokaryotic genomes.

SUBMITTER: Sallet E 

PROVIDER: S-EPMC3738161 | biostudies-literature | 2013 Aug

REPOSITORIES: biostudies-literature

altmetric image

Publications

Next-generation annotation of prokaryotic genomes with EuGene-P: application to Sinorhizobium meliloti 2011.

Sallet Erika E   Roux Brice B   Sauviac Laurent L   Jardinaud Marie-Francoise MF   Carrère Sébastien S   Faraut Thomas T   de Carvalho-Niebel Fernanda F   Gouzy Jérôme J   Gamas Pascal P   Capela Delphine D   Bruand Claude C   Schiex Thomas T  

DNA research : an international journal for rapid publication of reports on genes and genomes 20130418 4


The availability of next-generation sequences of transcripts from prokaryotic organisms offers the opportunity to design a new generation of automated genome annotation tools not yet available for prokaryotes. In this work, we designed EuGene-P, the first integrative prokaryotic gene finder tool which combines a variety of high-throughput data, including oriented RNA-Seq data, directly into the prediction process. This enables the automated prediction of coding sequences (CDSs), untranslated reg  ...[more]

Similar Datasets

2013-04-01 | GSE44083 | GEO
| PRJNA188524 | ENA
| S-EPMC2656595 | biostudies-literature
| S-EPMC3484140 | biostudies-literature
| S-EPMC2811008 | biostudies-literature
| S-EPMC4982090 | biostudies-literature
| S-EPMC2573895 | biostudies-other
| S-EPMC2362131 | biostudies-literature
| S-EPMC3098052 | biostudies-literature
| PRJNA566694 | ENA