Unknown

Dataset Information

0

Maximum likelihood models and algorithms for gene tree evolution with duplications and losses.


ABSTRACT: BACKGROUND: The abundance of new genomic data provides the opportunity to map the location of gene duplication and loss events on a species phylogeny. The first methods for mapping gene duplications and losses were based on a parsimony criterion, finding the mapping that minimizes the number of duplication and loss events. Probabilistic modeling of gene duplication and loss is relatively new and has largely focused on birth-death processes. RESULTS: We introduce a new maximum likelihood model that estimates the speciation and gene duplication and loss events in a gene tree within a species tree with branch lengths. We also provide an, in practice, efficient algorithm that computes optimal evolutionary scenarios for this model. We implemented the algorithm in the program DrML and verified its performance with empirical and simulated data. CONCLUSIONS: In test data sets, DrML finds optimal gene duplication and loss scenarios within minutes, even when the gene trees contain sequences from several hundred species. In many cases, these optimal scenarios differ from the lca-mapping that results from a parsimony gene tree reconciliation. Thus, DrML provides a new, practical statistical framework on which to study gene duplication.

SUBMITTER: Gorecki P 

PROVIDER: S-EPMC3044269 | biostudies-literature | 2011

REPOSITORIES: biostudies-literature

altmetric image

Publications

Maximum likelihood models and algorithms for gene tree evolution with duplications and losses.

Górecki Pawel P   Burleigh Gordon J GJ   Eulenstein Oliver O  

BMC bioinformatics 20110215


<h4>Background</h4>The abundance of new genomic data provides the opportunity to map the location of gene duplication and loss events on a species phylogeny. The first methods for mapping gene duplications and losses were based on a parsimony criterion, finding the mapping that minimizes the number of duplication and loss events. Probabilistic modeling of gene duplication and loss is relatively new and has largely focused on birth-death processes.<h4>Results</h4>We introduce a new maximum likeli  ...[more]

Similar Datasets

| S-EPMC4833081 | biostudies-literature
| S-EPMC10597321 | biostudies-literature
| S-EPMC3273908 | biostudies-literature
| S-EPMC8312565 | biostudies-literature
2024-03-20 | GSE261769 | GEO
| S-EPMC10748807 | biostudies-literature
| S-EPMC1323467 | biostudies-literature
| S-EPMC2614190 | biostudies-literature
| S-EPMC7249433 | biostudies-literature
| S-EPMC6404243 | biostudies-literature