Dataset Information

Estimating the Effective Sample Size of Tree Topologies from Bayesian Phylogenetic Analyses.

ABSTRACT: Bayesian phylogenetic analyses estimate posterior distributions of phylogenetic tree topologies and other parameters using Markov chain Monte Carlo (MCMC) methods. Before making inferences from these distributions, it is important to assess their adequacy. To this end, the effective sample size (ESS) estimates how many truly independent samples of a given parameter the output of the MCMC represents. The ESS of a parameter is frequently much lower than the number of samples taken from the MCMC because sequential samples from the chain can be non-independent due to autocorrelation. Typically, phylogeneticists use a rule of thumb that the ESS of all parameters should be greater than 200. However, we have no method to calculate an ESS of tree topology samples, despite the fact that the tree topology is often the parameter of primary interest and is almost always central to the estimation of other parameters. That is, we lack a method to determine whether we have adequately sampled one of the most important parameters in our analyses. In this study, we address this problem by developing methods to estimate the ESS for tree topologies. We combine these methods with two new diagnostic plots for assessing posterior samples of tree topologies, and compare their performance on simulated and empirical data sets. Combined, the methods we present provide new ways to assess the mixing and convergence of phylogenetic tree topologies in Bayesian MCMC analyses.

SUBMITTER: Lanfear R

PROVIDER: S-EPMC5010905 | biostudies-literature | 2016 Aug

REPOSITORIES: biostudies-literature

ACCESS DATA

Publications

Estimating the Effective Sample Size of Tree Topologies from Bayesian Phylogenetic Analyses.

Lanfear Robert R Hua Xia X Warren Dan L DL

Genome biology and evolution 20160816 8

Bayesian phylogenetic analyses estimate posterior distributions of phylogenetic tree topologies and other parameters using Markov chain Monte Carlo (MCMC) methods. Before making inferences from these distributions, it is important to assess their adequacy. To this end, the effective sample size (ESS) estimates how many truly independent samples of a given parameter the output of the MCMC represents. The ESS of a parameter is frequently much lower than the number of samples taken from the MCMC be ...[more]

PMID: 27435794

Similar Datasets

Project description:Phenotypic traits, such as the frog advertisement call, are generally correlated with interspecific genetic variation, and, as a consequence of strong sexual selection, these behaviors may carry a phylogenetic signal. However, variation in acoustic traits is not always correlated with genetic differences between populations (intraspecific variation); phenotypic plasticity and environmental variables may explain part of such variation. For example, local processes can affect acoustic properties in different lineages due to differences in physical structure, climatic conditions, and biotic interactions, particularly when populations are isolated. However, acoustic traits can be used to test phylogenetic hypotheses. We analyzed the advertisement calls of Dendropsophus elegans males from 18 sites and compared them with those of four closely related congeneric species, in order to test for differences between inter and intraspecific variation. We analyzed 451 calls of 45 males of these five species. Because males from distant sites were grouped together without population congruence, differences found in advertisement calls among individuals were not correlated with phylogeographical clades. Phylogenetic and cluster analyses of the D. elegans clades and those of closely related species grouped all five species into the same topology, as reported by previous molecular and morphological phylogenies. However, the topology of the D. elegans phylogeographical clades did not match the topology previously reported. Acoustic communication in D. elegans seems to be conserved among populations, and the phylogeographical history of the species does not explain the variation among lineages in call properties, despite some congruent phylogenetic signals evident at the species level. Based on molecular clocks retrieved from the literature, it seems that more than 6.5 million years of divergence (late Miocene) are necessary to allow significant changes to occur in the acoustic properties of these treefrog calls, making it possible to recover their phylogenetic history only based on acoustic evidence.

Dataset Information

Estimating the Effective Sample Size of Tree Topologies from Bayesian Phylogenetic Analyses.

Publications

Estimating the Effective Sample Size of Tree Topologies from Bayesian Phylogenetic Analyses.

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets