Unknown

Dataset Information

0

Improving pan-genome annotation using whole genome multiple alignment.


ABSTRACT:

Background

Rapid annotation and comparisons of genomes from multiple isolates (pan-genomes) is becoming commonplace due to advances in sequencing technology. Genome annotations can contain inconsistencies and errors that hinder comparative analysis even within a single species. Tools are needed to compare and improve annotation quality across sets of closely related genomes.

Results

We introduce a new tool, Mugsy-Annotator, that identifies orthologs and evaluates annotation quality in prokaryotic genomes using whole genome multiple alignment. Mugsy-Annotator identifies anomalies in annotated gene structures, including inconsistently located translation initiation sites and disrupted genes due to draft genome sequencing or pseudogenes. An evaluation of species pan-genomes using the tool indicates that such anomalies are common, especially at translation initiation sites. Mugsy-Annotator reports alternate annotations that improve consistency and are candidates for further review.

Conclusions

Whole genome multiple alignment can be used to efficiently identify orthologs and annotation problem areas in a bacterial pan-genome. Comparisons of annotated gene structures within a species may show more variation than is actually present in the genome, indicating errors in genome annotation. Our new tool Mugsy-Annotator assists re-annotation efforts by highlighting edits that improve annotation consistency.

SUBMITTER: Angiuoli SV 

PROVIDER: S-EPMC3142524 | biostudies-literature | 2011 Jun

REPOSITORIES: biostudies-literature

altmetric image

Publications

Improving pan-genome annotation using whole genome multiple alignment.

Angiuoli Samuel V SV   Dunning Hotopp Julie C JC   Salzberg Steven L SL   Tettelin Hervé H  

BMC bioinformatics 20110630


<h4>Background</h4>Rapid annotation and comparisons of genomes from multiple isolates (pan-genomes) is becoming commonplace due to advances in sequencing technology. Genome annotations can contain inconsistencies and errors that hinder comparative analysis even within a single species. Tools are needed to compare and improve annotation quality across sets of closely related genomes.<h4>Results</h4>We introduce a new tool, Mugsy-Annotator, that identifies orthologs and evaluates annotation qualit  ...[more]

Similar Datasets

| S-EPMC5769345 | biostudies-literature
2019-07-01 | PXD009697 | Pride
2019-07-01 | PXD009672 | Pride
2016-05-27 | PXD002967 | Pride
| S-EPMC6902276 | biostudies-literature
| S-EPMC4952227 | biostudies-literature
| S-EPMC1808025 | biostudies-literature
| S-EPMC5833154 | biostudies-literature
| S-EPMC7728760 | biostudies-literature
| S-EPMC6602458 | biostudies-other