Unknown

Dataset Information

0

Bidirectional best hits miss many orthologs in duplication-rich clades such as plants and animals.


ABSTRACT: Bidirectional best hits (BBH), which entails identifying the pairs of genes in two different genomes that are more similar to each other than either is to any other gene in the other genome, is a simple and widely used method to infer orthology. A recent study has analyzed the link between BBH and orthology in bacteria and archaea and concluded that, given the very high consistency in BBH they observed among triplets of neighboring genes, a high proportion of BBH are likely to be bona fide orthologs. However, limited by their analysis setup, the previous study could not easily test the reverse question: which proportion of orthologs are BBH? In this follow-up study, we consider this question in theory and answer it based on conceptual arguments, simulated data, and real biological data from all three domains of life. Our analyses corroborate the findings of the previous study, but also show that because of the high rate of gene duplication in plants and animals, as much as 60% of orthologous relations are missed by the BBH criterion.

SUBMITTER: Dalquen DA 

PROVIDER: S-EPMC3814191 | biostudies-literature | 2013

REPOSITORIES: biostudies-literature

altmetric image

Publications

Bidirectional best hits miss many orthologs in duplication-rich clades such as plants and animals.

Dalquen Daniel A DA   Dessimoz Christophe C  

Genome biology and evolution 20130101 10


Bidirectional best hits (BBH), which entails identifying the pairs of genes in two different genomes that are more similar to each other than either is to any other gene in the other genome, is a simple and widely used method to infer orthology. A recent study has analyzed the link between BBH and orthology in bacteria and archaea and concluded that, given the very high consistency in BBH they observed among triplets of neighboring genes, a high proportion of BBH are likely to be bona fide ortho  ...[more]

Similar Datasets

| S-EPMC8214411 | biostudies-literature
| S-EPMC4094424 | biostudies-literature
| S-EPMC7585182 | biostudies-literature
| S-EPMC4137093 | biostudies-literature
| S-EPMC6022570 | biostudies-literature
| S-EPMC3575645 | biostudies-literature
| PRJEB41406 | ENA
| S-EPMC5068742 | biostudies-literature
| S-EPMC7894362 | biostudies-literature
| S-EPMC25828 | biostudies-literature