Unknown

Dataset Information

0

Proteomic detection of non-annotated protein-coding genes in Pseudomonas fluorescens Pf0-1.


ABSTRACT: Genome sequences are annotated by computational prediction of coding sequences, followed by similarity searches such as BLAST, which provide a layer of possible functional information. While the existence of processes such as alternative splicing complicates matters for eukaryote genomes, the view of bacterial genomes as a linear series of closely spaced genes leads to the assumption that computational annotations that predict such arrangements completely describe the coding capacity of bacterial genomes. We undertook a proteomic study to identify proteins expressed by Pseudomonas fluorescens Pf0-1 from genes that were not predicted during the genome annotation. Mapping peptides to the Pf0-1 genome sequence identified sixteen non-annotated protein-coding regions, of which nine were antisense to predicted genes, six were intergenic, and one read in the same direction as an annotated gene but in a different frame. The expression of all but one of the newly discovered genes was verified by RT-PCR. Few clues as to the function of the new genes were gleaned from informatic analyses, but potential orthologs in other Pseudomonas genomes were identified for eight of the new genes. The 16 newly identified genes improve the quality of the Pf0-1 genome annotation, and the detection of antisense protein-coding genes indicates the under-appreciated complexity of bacterial genome organization.

SUBMITTER: Kim W 

PROVIDER: S-EPMC2794547 | biostudies-literature | 2009 Dec

REPOSITORIES: biostudies-literature

altmetric image

Publications

Proteomic detection of non-annotated protein-coding genes in Pseudomonas fluorescens Pf0-1.

Kim Wook W   Silby Mark W MW   Purvine Sam O SO   Nicoll Julie S JS   Hixson Kim K KK   Monroe Matt M   Nicora Carrie D CD   Lipton Mary S MS   Levy Stuart B SB  

PloS one 20091224 12


Genome sequences are annotated by computational prediction of coding sequences, followed by similarity searches such as BLAST, which provide a layer of possible functional information. While the existence of processes such as alternative splicing complicates matters for eukaryote genomes, the view of bacterial genomes as a linear series of closely spaced genes leads to the assumption that computational annotations that predict such arrangements completely describe the coding capacity of bacteria  ...[more]

Similar Datasets

| S-EPMC2396522 | biostudies-literature
| S-EPMC3370546 | biostudies-literature
| S-EPMC3370510 | biostudies-literature
2011-11-30 | GSE33865 | GEO
| S-EPMC3165641 | biostudies-literature
| S-EPMC3217935 | biostudies-literature