Unknown

Dataset Information

0

A new family of dissimilarity metrics for discrete character matrices that include inapplicable characters and its importance for disparity studies.


ABSTRACT: The use of discrete character data for disparity analyses has become more popular, partially due to the recognition that character data describe variation at large taxonomic scales, as well as the increasing availability of both character matrices co-opted from phylogenetic analysis and software tools. As taxonomic scope increases, the need to describe variation leads to some characters that may describe traits not found across all the taxa. In such situations, it is common practice to treat inapplicable characters as missing data when calculating dissimilarity matrices for disparity studies. For commonly used dissimilarity metrics like Wills's GED and Gower's coefficient, this can lead to the reranking of pairwise dissimilarities, resulting in taxa that share more primary character states being assigned larger dissimilarity values than taxa that share fewer. We introduce a family of metrics that proportionally weight primary characters according to the secondary characters that describe them, effectively eliminating this problem, and compare their performance to common dissimilarity metrics and previously proposed weighting schemes. When applied to empirical datasets, we confirm that choice of dissimilarity metric frequently affects the rank order of pairwise distances, differentially influencing downstream macroevolutionary inferences.

SUBMITTER: Hopkins MJ 

PROVIDER: S-EPMC6283942 | biostudies-literature | 2018 Nov

REPOSITORIES: biostudies-literature

altmetric image

Publications

A new family of dissimilarity metrics for discrete character matrices that include inapplicable characters and its importance for disparity studies.

Hopkins Melanie J MJ   St John Katherine K  

Proceedings. Biological sciences 20181128 1892


The use of discrete character data for disparity analyses has become more popular, partially due to the recognition that character data describe variation at large taxonomic scales, as well as the increasing availability of both character matrices co-opted from phylogenetic analysis and software tools. As taxonomic scope increases, the need to describe variation leads to some characters that may describe traits not found across all the taxa. In such situations, it is common practice to treat ina  ...[more]

Similar Datasets

| S-EPMC5727480 | biostudies-literature
| S-EPMC7391566 | biostudies-literature
| S-EPMC10663764 | biostudies-literature
| S-EPMC6253373 | biostudies-literature
| S-EPMC9119531 | biostudies-literature
| S-EPMC5013799 | biostudies-literature
| S-EPMC4587485 | biostudies-other
| S-EPMC7728684 | biostudies-literature
| S-EPMC4261585 | biostudies-literature
| S-EPMC4984511 | biostudies-literature