Unknown

Dataset Information

0

Novel non-parametric models to estimate evolutionary rates and divergence times from heterochronous sequence data.


ABSTRACT:

Background

Early methods for estimating divergence times from gene sequence data relied on the assumption of a molecular clock. More sophisticated methods were created to model rate variation and used auto-correlation of rates, local clocks, or the so called "uncorrelated relaxed clock" where substitution rates are assumed to be drawn from a parametric distribution. In the case of Bayesian inference methods the impact of the prior on branching times is not clearly understood, and if the amount of data is limited the posterior could be strongly influenced by the prior.

Results

We develop a maximum likelihood method--Physher--that uses local or discrete clocks to estimate evolutionary rates and divergence times from heterochronous sequence data. Using two empirical data sets we show that our discrete clock estimates are similar to those obtained by other methods, and that Physher outperformed some methods in the estimation of the root age of an influenza virus data set. A simulation analysis suggests that Physher can outperform a Bayesian method when the real topology contains two long branches below the root node, even when evolution is strongly clock-like.

Conclusions

These results suggest it is advisable to use a variety of methods to estimate evolutionary rates and divergence times from heterochronous sequence data. Physher and the associated data sets used here are available online at http://code.google.com/p/physher/.

SUBMITTER: Fourment M 

PROVIDER: S-EPMC4222489 | biostudies-literature | 2014 Jul

REPOSITORIES: biostudies-literature

altmetric image

Publications

Novel non-parametric models to estimate evolutionary rates and divergence times from heterochronous sequence data.

Fourment Mathieu M   Holmes Edward C EC  

BMC evolutionary biology 20140724


<h4>Background</h4>Early methods for estimating divergence times from gene sequence data relied on the assumption of a molecular clock. More sophisticated methods were created to model rate variation and used auto-correlation of rates, local clocks, or the so called "uncorrelated relaxed clock" where substitution rates are assumed to be drawn from a parametric distribution. In the case of Bayesian inference methods the impact of the prior on branching times is not clearly understood, and if the  ...[more]

Similar Datasets

| S-EPMC4856229 | biostudies-literature
| S-EPMC3107591 | biostudies-literature
| S-EPMC5963472 | biostudies-other
| S-EPMC8117668 | biostudies-literature
| S-EPMC7720557 | biostudies-literature
| S-EPMC7197704 | biostudies-literature
| S-EPMC6451633 | biostudies-literature
| S-EPMC5850342 | biostudies-literature
| S-EPMC6984362 | biostudies-literature
| S-EPMC4202788 | biostudies-literature