Unknown

Dataset Information

0

Realistic scenarios of missing taxa in phylogenetic comparative methods and their effects on model selection and parameter estimation.


ABSTRACT: Model-based analyses of continuous trait evolution enable rich evolutionary insight. These analyses require a phylogenetic tree and a vector of trait values for the tree's terminal taxa, but rarely do a tree and dataset include all taxa within a clade. Because the probability that a taxon is included in a dataset depends on ecological traits that have phylogenetic signal, missing taxa in real datasets should be expected to be phylogenetically clumped or correlated to the modelled trait. I examined whether those types of missing taxa represent a problem for model selection and parameter estimation. I simulated univariate traits under a suite of Brownian Motion and Ornstein-Uhlenbeck models, and assessed the performance of model selection and parameter estimation under absent, random, clumped or correlated missing taxa. I found that those analyses perform well under almost all scenarios, including situations with very sparsely sampled phylogenies. The only notable biases I detected were in parameter estimation under a very high percentage (90%) of correlated missing taxa. My results offer a degree of reassurance for studies of continuous trait evolution with missing taxa, but the problem of missing taxa in phylogenetic comparative methods still demands much further investigation. The framework I have described here might provide a starting point for future work.

SUBMITTER: Marcondes RS 

PROVIDER: S-EPMC6791351 | biostudies-literature | 2019

REPOSITORIES: biostudies-literature

altmetric image

Publications

Realistic scenarios of missing taxa in phylogenetic comparative methods and their effects on model selection and parameter estimation.

Marcondes Rafael S RS  

PeerJ 20191011


Model-based analyses of continuous trait evolution enable rich evolutionary insight. These analyses require a phylogenetic tree and a vector of trait values for the tree's terminal taxa, but rarely do a tree and dataset include all taxa within a clade. Because the probability that a taxon is included in a dataset depends on ecological traits that have phylogenetic signal, missing taxa in real datasets should be expected to be phylogenetically clumped or correlated to the modelled trait. I examin  ...[more]

Similar Datasets

| S-EPMC2703881 | biostudies-other
| S-EPMC2832681 | biostudies-other
| S-EPMC4061266 | biostudies-literature
| S-EPMC2928803 | biostudies-other
| S-EPMC6085278 | biostudies-literature
| S-EPMC4715503 | biostudies-literature
| S-EPMC6394396 | biostudies-literature
| S-EPMC8407871 | biostudies-literature
| S-EPMC6690233 | biostudies-literature
| S-EPMC4528587 | biostudies-literature