Unknown

Dataset Information

0

EnsemblCompara GeneTrees: Complete, duplication-aware phylogenetic trees in vertebrates.


ABSTRACT: We have developed a comprehensive gene orientated phylogenetic resource, EnsemblCompara GeneTrees, based on a computational pipeline to handle clustering, multiple alignment, and tree generation, including the handling of large gene families. We developed two novel non-sequence-based metrics of gene tree correctness and benchmarked a number of tree methods. The TreeBeST method from TreeFam shows the best performance in our hands. We also compared this phylogenetic approach to clustering approaches for ortholog prediction, showing a large increase in coverage using the phylogenetic approach. All data are made available in a number of formats and will be kept up to date with the Ensembl project.

SUBMITTER: Vilella AJ 

PROVIDER: S-EPMC2652215 | biostudies-literature | 2009 Feb

REPOSITORIES: biostudies-literature

altmetric image

Publications

EnsemblCompara GeneTrees: Complete, duplication-aware phylogenetic trees in vertebrates.

Vilella Albert J AJ   Severin Jessica J   Ureta-Vidal Abel A   Heng Li L   Durbin Richard R   Birney Ewan E  

Genome research 20081124 2


We have developed a comprehensive gene orientated phylogenetic resource, EnsemblCompara GeneTrees, based on a computational pipeline to handle clustering, multiple alignment, and tree generation, including the handling of large gene families. We developed two novel non-sequence-based metrics of gene tree correctness and benchmarked a number of tree methods. The TreeBeST method from TreeFam shows the best performance in our hands. We also compared this phylogenetic approach to clustering approach  ...[more]

Similar Datasets

| S-EPMC3105381 | biostudies-literature
| S-EPMC5985597 | biostudies-literature
| S-EPMC3813836 | biostudies-other
| S-EPMC3669789 | biostudies-other
| S-EPMC5447242 | biostudies-literature
| S-EPMC1160516 | biostudies-literature
| S-EPMC1691382 | biostudies-literature