Unknown

Dataset Information

0

TreeFix: statistically informed gene tree error correction using species trees.


ABSTRACT: Accurate gene tree reconstruction is a fundamental problem in phylogenetics, with many important applications. However, sequence data alone often lack enough information to confidently support one gene tree topology over many competing alternatives. Here, we present a novel framework for combining sequence data and species tree information, and we describe an implementation of this framework in TreeFix, a new phylogenetic program for improving gene tree reconstructions. Given a gene tree (preferably computed using a maximum-likelihood phylogenetic program), TreeFix finds a "statistically equivalent" gene tree that minimizes a species tree-based cost function. We have applied TreeFix to 2 clades of 12 Drosophila and 16 fungal genomes, as well as to simulated phylogenies and show that it dramatically improves reconstructions compared with current state-of-the-art programs. Given its accuracy, speed, and simplicity, TreeFix should be applicable to a wide range of analyses and have many important implications for future investigations of gene evolution. The source code and a sample data set are available at http://compbio.mit.edu/treefix.

SUBMITTER: Wu YC 

PROVIDER: S-EPMC3526801 | biostudies-literature | 2013 Jan

REPOSITORIES: biostudies-literature

altmetric image

Publications

TreeFix: statistically informed gene tree error correction using species trees.

Wu Yi-Chieh YC   Rasmussen Matthew D MD   Bansal Mukul S MS   Kellis Manolis M  

Systematic biology 20120904 1


Accurate gene tree reconstruction is a fundamental problem in phylogenetics, with many important applications. However, sequence data alone often lack enough information to confidently support one gene tree topology over many competing alternatives. Here, we present a novel framework for combining sequence data and species tree information, and we describe an implementation of this framework in TreeFix, a new phylogenetic program for improving gene tree reconstructions. Given a gene tree (prefer  ...[more]

Similar Datasets

| S-EPMC4393519 | biostudies-literature
| S-EPMC5754272 | biostudies-literature
| S-EPMC5998893 | biostudies-other
| S-EPMC7750968 | biostudies-literature
| S-EPMC5249135 | biostudies-literature
| S-EPMC2614993 | biostudies-literature
| S-EPMC5680165 | biostudies-literature
| S-EPMC1626107 | biostudies-literature
| S-EPMC3874668 | biostudies-literature
| S-EPMC5704532 | biostudies-literature