Unknown

Dataset Information

0

Joint amalgamation of most parsimonious reconciled gene trees.


ABSTRACT: MOTIVATION: Traditionally, gene phylogenies have been reconstructed solely on the basis of molecular sequences; this, however, often does not provide enough information to distinguish between statistically equivalent relationships. To address this problem, several recent methods have incorporated information on the species phylogeny in gene tree reconstruction, leading to dramatic improvements in accuracy. Although probabilistic methods are able to estimate all model parameters but are computationally expensive, parsimony methods-generally computationally more efficient-require a prior estimate of parameters and of the statistical support. RESULTS: Here, we present the Tree Estimation using Reconciliation (TERA) algorithm, a parsimony based, species tree aware method for gene tree reconstruction based on a scoring scheme combining duplication, transfer and loss costs with an estimate of the sequence likelihood. TERA explores all reconciled gene trees that can be amalgamated from a sample of gene trees. Using a large scale simulated dataset, we demonstrate that TERA achieves the same accuracy as the corresponding probabilistic method while being faster, and outperforms other parsimony-based methods in both accuracy and speed. Running TERA on a set of 1099 homologous gene families from complete cyanobacterial genomes, we find that incorporating knowledge of the species tree results in a two thirds reduction in the number of apparent transfer events.

SUBMITTER: Scornavacca C 

PROVIDER: S-EPMC4380024 | biostudies-literature | 2015 Mar

REPOSITORIES: biostudies-literature

altmetric image

Publications

Joint amalgamation of most parsimonious reconciled gene trees.

Scornavacca Celine C   Jacox Edwin E   Szöllősi Gergely J GJ  

Bioinformatics (Oxford, England) 20141106 6


<h4>Motivation</h4>Traditionally, gene phylogenies have been reconstructed solely on the basis of molecular sequences; this, however, often does not provide enough information to distinguish between statistically equivalent relationships. To address this problem, several recent methods have incorporated information on the species phylogeny in gene tree reconstruction, leading to dramatic improvements in accuracy. Although probabilistic methods are able to estimate all model parameters but are co  ...[more]

Similar Datasets

| S-EPMC1891116 | biostudies-literature
| S-EPMC4603630 | biostudies-literature
| S-EPMC3955125 | biostudies-literature
| S-EPMC2538912 | biostudies-literature
| S-EPMC9318429 | biostudies-literature
| S-EPMC44714 | biostudies-other
| S-EPMC7052050 | biostudies-literature
| S-EPMC9235488 | biostudies-literature
| S-EPMC4216048 | biostudies-other