Unknown

Dataset Information

0

Representational Renyi Heterogeneity.


ABSTRACT: A discrete system's heterogeneity is measured by the Rényi heterogeneity family of indices (also known as Hill numbers or Hannah-Kay indices), whose units are the numbers equivalent. Unfortunately, numbers equivalent heterogeneity measures for non-categorical data require a priori (A) categorical partitioning and (B) pairwise distance measurement on the observable data space, thereby precluding application to problems with ill-defined categories or where semantically relevant features must be learned as abstractions from some data. We thus introduce representational Rényi heterogeneity (RRH), which transforms an observable domain onto a latent space upon which the Rényi heterogeneity is both tractable and semantically relevant. This method requires neither a priori binning nor definition of a distance function on the observable space. We show that RRH can generalize existing biodiversity and economic equality indices. Compared with existing indices on a beta-mixture distribution, we show that RRH responds more appropriately to changes in mixture component separation and weighting. Finally, we demonstrate the measurement of RRH in a set of natural images, with respect to abstract representations learned by a deep neural network. The RRH approach will further enable heterogeneity measurement in disciplines whose data do not easily conform to the assumptions of existing indices.

SUBMITTER: Nunes A 

PROVIDER: S-EPMC7516893 | biostudies-literature | 2020 Apr

REPOSITORIES: biostudies-literature

altmetric image

Publications

Representational Rényi Heterogeneity.

Nunes Abraham A   Alda Martin M   Bardouille Timothy T   Trappenberg Thomas T  

Entropy (Basel, Switzerland) 20200407 4


A discrete system's heterogeneity is measured by the Rényi heterogeneity family of indices (also known as Hill numbers or Hannah-Kay indices), whose units are the numbers equivalent. Unfortunately, numbers equivalent heterogeneity measures for non-categorical data require a priori (A) categorical partitioning and (B) pairwise distance measurement on the observable data space, thereby precluding application to problems with ill-defined categories or where semantically relevant features must be le  ...[more]

Similar Datasets

| S-EPMC7959621 | biostudies-literature
| S-EPMC2238722 | biostudies-literature
| S-EPMC7517462 | biostudies-literature
| S-EPMC2373811 | biostudies-other
| S-EPMC4643429 | biostudies-literature
| S-EPMC5619680 | biostudies-literature
| S-EPMC3065710 | biostudies-literature
| S-EPMC6247934 | biostudies-literature
| S-EPMC8330431 | biostudies-literature
| S-EPMC3990488 | biostudies-literature