Unknown

Dataset Information

0

Ortholog identification in the presence of domain architecture rearrangement.


ABSTRACT: Ortholog identification is used in gene functional annotation, species phylogeny estimation, phylogenetic profile construction and many other analyses. Bioinformatics methods for ortholog identification are commonly based on pairwise protein sequence comparisons between whole genomes. Phylogenetic methods of ortholog identification have also been developed; these methods can be applied to protein data sets sharing a common domain architecture or which share a single functional domain but differ outside this region of homology. While promiscuous domains represent a challenge to all orthology prediction methods, overall structural similarity is highly correlated with proximity in a phylogenetic tree, conferring a degree of robustness to phylogenetic methods. In this article, we review the issues involved in orthology prediction when data sets include sequences with structurally heterogeneous domain architectures, with particular attention to automated methods designed for high-throughput application, and present a case study to illustrate the challenges in this area.

SUBMITTER: Sjolander K 

PROVIDER: S-EPMC3178056 | biostudies-literature | 2011 Sep

REPOSITORIES: biostudies-literature

altmetric image

Publications

Ortholog identification in the presence of domain architecture rearrangement.

Sjölander Kimmen K   Datta Ruchira S RS   Shen Yaoqing Y   Shoffner Grant M GM  

Briefings in bioinformatics 20110628 5


Ortholog identification is used in gene functional annotation, species phylogeny estimation, phylogenetic profile construction and many other analyses. Bioinformatics methods for ortholog identification are commonly based on pairwise protein sequence comparisons between whole genomes. Phylogenetic methods of ortholog identification have also been developed; these methods can be applied to protein data sets sharing a common domain architecture or which share a single functional domain but differ  ...[more]

Similar Datasets

| S-EPMC2821317 | biostudies-literature
| S-EPMC9420521 | biostudies-literature
| S-EPMC4035852 | biostudies-other
| S-EPMC9236583 | biostudies-literature
| S-EPMC1557999 | biostudies-literature
| S-EPMC403725 | biostudies-literature
| S-EPMC3791407 | biostudies-literature
| S-EPMC2957689 | biostudies-literature
| S-EPMC3215765 | biostudies-literature
| S-EPMC6224403 | biostudies-literature