Unknown

Dataset Information

0

Recognizing the pseudogenes in bacterial genomes.


ABSTRACT: Pseudogenes are now known to be a regular feature of bacterial genomes and are found in particularly high numbers within the genomes of recently emerged bacterial pathogens. As most pseudogenes are recognized by sequence alignments, we use newly available genomic sequences to identify the pseudogenes in 11 genomes from 4 bacterial genera, each of which contains at least 1 human pathogen. The numbers of pseudogenes range from 27 in Staphylococcus aureus MW2 to 337 in Yersinia pestis CO92 (e.g. 1-8% of the annotated genes in the genome). Most pseudogenes are formed by small frameshifting indels, but because stop codons are A + T-rich, the two low-G + C Gram-positive taxa (Streptococcus and Staphylococcus) have relatively high fractions of pseudogenes generated by nonsense mutations when compared with more G + C-rich genomes. Over half of the pseudogenes are produced from genes whose original functions were annotated as 'hypothetical' or 'unknown'; however, several broadly distributed genes involved in nucleotide processing, repair or replication have become pseudogenes in one of the sequenced Vibrio vulnificus genomes. Although many of our comparisons involved closely related strains with broadly overlapping gene inventories, each genome contains a largely unique set of pseudogenes, suggesting that pseudogenes are formed and eliminated relatively rapidly from most bacterial genomes.

SUBMITTER: Lerat E 

PROVIDER: S-EPMC1142405 | biostudies-literature | 2005

REPOSITORIES: biostudies-literature

altmetric image

Publications

Recognizing the pseudogenes in bacterial genomes.

Lerat Emmanuelle E   Ochman Howard H  

Nucleic acids research 20050602 10


Pseudogenes are now known to be a regular feature of bacterial genomes and are found in particularly high numbers within the genomes of recently emerged bacterial pathogens. As most pseudogenes are recognized by sequence alignments, we use newly available genomic sequences to identify the pseudogenes in 11 genomes from 4 bacterial genera, each of which contains at least 1 human pathogen. The numbers of pseudogenes range from 27 in Staphylococcus aureus MW2 to 337 in Yersinia pestis CO92 (e.g. 1-  ...[more]

Similar Datasets

| S-EPMC9122581 | biostudies-literature
| S-EPMC152858 | biostudies-literature
| S-EPMC3296715 | biostudies-literature
| S-EPMC2916853 | biostudies-literature
| S-EPMC9576210 | biostudies-literature
| S-EPMC2687790 | biostudies-literature
| PRJNA700858 | ENA
| S-EPMC525686 | biostudies-literature
| S-EPMC6054270 | biostudies-literature
| PRJEB7314 | ENA