Unknown

Dataset Information

0

Proteny: discovering and visualizing statistically significant syntenic clusters at the proteome level.


ABSTRACT:

Background

With more and more genomes being sequenced, detecting synteny between genomes becomes more and more important. However, for microorganisms the genomic divergence quickly becomes large, resulting in different codon usage and shuffling of gene order and gene elements such as exons.

Results

We present Proteny, a methodology to detect synteny between diverged genomes. It operates on the amino acid sequence level to be insensitive to codon usage adaptations and clusters groups of exons disregarding order to handle diversity in genomic ordering between genomes. Furthermore, Proteny assigns significance levels to the syntenic clusters such that they can be selected on statistical grounds. Finally, Proteny provides novel ways to visualize results at different scales, facilitating the exploration and interpretation of syntenic regions. We test the performance of Proteny on a standard ground truth dataset, and we illustrate the use of Proteny on two closely related genomes (two different strains of Aspergillus niger) and on two distant genomes (two species of Basidiomycota). In comparison to other tools, we find that Proteny finds clusters with more true homologies in fewer clusters that contain more genes, i.e. Proteny is able to identify a more consistent synteny. Further, we show how genome rearrangements, assembly errors, gene duplications and the conservation of specific genes can be easily studied with Proteny.

Availability and implementation

Proteny is freely available at the Delft Bioinformatics Lab website http://bioinformatics.tudelft.nl/dbl/software.

Contact

t.gehrmann@tudelft.nl

Supplementary information

Supplementary data are available at Bioinformatics online.

SUBMITTER: Gehrmann T 

PROVIDER: S-EPMC4612220 | biostudies-literature | 2015 Nov

REPOSITORIES: biostudies-literature

altmetric image

Publications

Proteny: discovering and visualizing statistically significant syntenic clusters at the proteome level.

Gehrmann Thies T   Reinders Marcel J T MJ  

Bioinformatics (Oxford, England) 20150627 21


<h4>Background</h4>With more and more genomes being sequenced, detecting synteny between genomes becomes more and more important. However, for microorganisms the genomic divergence quickly becomes large, resulting in different codon usage and shuffling of gene order and gene elements such as exons.<h4>Results</h4>We present Proteny, a methodology to detect synteny between diverged genomes. It operates on the amino acid sequence level to be insensitive to codon usage adaptations and clusters grou  ...[more]

Similar Datasets

| S-EPMC1200092 | biostudies-literature
| S-EPMC3084717 | biostudies-literature
| S-EPMC3179615 | biostudies-literature
| S-EPMC5918465 | biostudies-other
| S-EPMC6691336 | biostudies-literature
| S-EPMC3854656 | biostudies-literature
| S-EPMC3688764 | biostudies-literature
| S-EPMC3584929 | biostudies-literature
| S-EPMC5181558 | biostudies-other
| S-EPMC7451401 | biostudies-literature