Unknown

Dataset Information

0

Robustness of birth-death and gain models for inferring evolutionary events.


ABSTRACT:

Background

Phylogenetic birth-death models are opening a new window on the processes of genome evolution in studies of the evolution of gene and protein families, protein-protein interaction networks, microRNAs, and copy number variation. Given a species tree and a set of genomic characters in present-day species, the birth-death approach estimates the most likely rates required to explain the observed data and returns the expected ancestral character states and the history of character state changes. Achieving a balance between model complexity and generalizability is a fundamental challenge in the application of birth-death models. While more parameters promise greater accuracy and more biologically realistic models, increasing model complexity can lead to overfitting and a heavy computational cost.

Results

Here we present a systematic, empirical investigation of these tradeoffs, using protein domain families in six metazoan genomes as a case study. We compared models of increasing complexity, implemented in the Count program, with respect to model fit, robustness, and stability. In addition, we used a bootstrapping procedure to assess estimator variability. The results show that the most complex model, which allows for both branch-specific and family-specific rate variation, achieves the best fit, without overfitting. Variance remains low with increasing complexity, except for family-specific loss rates. This variance is reduced when the number of discrete rate categories is increased.

Conclusions

The work presented here evaluates model choice for genomic birth-death models in a systematic way and presents the first use of bootstrapping to assess estimator variance in birth-death models. We find that a model incorporating both lineage and family rate variation yields more accurate estimators without sacrificing generality. Our results indicate that model choice can lead to fundamentally different evolutionary conclusions, emphasizing the importance of more biologically realistic and complex models.

SUBMITTER: Stolzer M 

PROVIDER: S-EPMC4239551 | biostudies-literature | 2014

REPOSITORIES: biostudies-literature

altmetric image

Publications

Robustness of birth-death and gain models for inferring evolutionary events.

Stolzer Maureen M   Wasserman Larry L   Durand Dannie D  

BMC genomics 20141017


<h4>Background</h4>Phylogenetic birth-death models are opening a new window on the processes of genome evolution in studies of the evolution of gene and protein families, protein-protein interaction networks, microRNAs, and copy number variation. Given a species tree and a set of genomic characters in present-day species, the birth-death approach estimates the most likely rates required to explain the observed data and returns the expected ancestral character states and the history of character  ...[more]

Similar Datasets

| S-EPMC4421816 | biostudies-other
| S-EPMC7574235 | biostudies-literature
| S-EPMC7510305 | biostudies-literature
| S-EPMC6767073 | biostudies-literature
| S-EPMC6701459 | biostudies-literature
| S-EPMC4579340 | biostudies-literature
| S-EPMC8484726 | biostudies-literature
| S-EPMC4223917 | biostudies-literature
| S-EPMC3202799 | biostudies-other
| S-EPMC3557300 | biostudies-literature