Unknown

Dataset Information

0

OBAMA: OBAMA for Bayesian amino-acid model averaging.


ABSTRACT:

Background

Bayesian analyses offer many benefits for phylogenetic, and have been popular for analysis of amino acid alignments. It is necessary to specify a substitution and site model for such analyses, and often an ad hoc, or likelihood based method is employed for choosing these models that are typically of no interest to the analysis overall.

Methods

We present a method called OBAMA that averages over substitution models and site models, thus letting the data inform model choices and taking model uncertainty into account. It uses trans-dimensional Markov Chain Monte Carlo (MCMC) proposals to switch between various empirical substitution models for amino acids such as Dayhoff, WAG, and JTT. Furthermore, it switches base frequencies from these substitution models or use base frequencies estimated based on the alignment. Finally, it switches between using gamma rate heterogeneity or not, and between using a proportion of invariable sites or not.

Results

We show that the model performs well in a simulation study. By using appropriate priors, we demonstrate both proportion of invariable sites and the shape parameter for gamma rate heterogeneity can be estimated. The OBAMA method allows taking in account model uncertainty, thus reducing bias in phylogenetic estimates. The method is implemented in the OBAMA package in BEAST 2, which is open source licensed under LGPL and allows joint tree inference under a wide range of models.

SUBMITTER: Bouckaert RR 

PROVIDER: S-EPMC7413081 | biostudies-literature | 2020

REPOSITORIES: biostudies-literature

altmetric image

Publications

OBAMA: OBAMA for Bayesian amino-acid model averaging.

Bouckaert Remco R RR  

PeerJ 20200804


<h4>Background</h4>Bayesian analyses offer many benefits for phylogenetic, and have been popular for analysis of amino acid alignments. It is necessary to specify a substitution and site model for such analyses, and often an ad hoc, or likelihood based method is employed for choosing these models that are typically of no interest to the analysis overall.<h4>Methods</h4>We present a method called OBAMA that averages over substitution models and site models, thus letting the data inform model choi  ...[more]

Similar Datasets

| S-EPMC9246148 | biostudies-literature
| S-EPMC7846150 | biostudies-literature
| S-EPMC5396931 | biostudies-literature
| S-EPMC5975653 | biostudies-literature
| S-EPMC10257384 | biostudies-literature
| S-EPMC2889260 | biostudies-literature
| S-EPMC10087723 | biostudies-literature
| S-EPMC3258040 | biostudies-literature
| S-EPMC6162921 | biostudies-literature
| S-EPMC8023666 | biostudies-literature