Ontology highlight
ABSTRACT:
SUBMITTER: Agarwal D
PROVIDER: S-EPMC6892634 | biostudies-literature | 2019 Dec
REPOSITORIES: biostudies-literature
Agarwal Divyansh D Zhang Nancy R NR
Science advances 20191204 12
In data science, determining proximity between observations is critical to many downstream analyses such as clustering, classification, and prediction. However, when the data's underlying probability distribution is unclear, the function used to compute similarity between data points is often arbitrarily chosen. Here, we present a novel definition of proximity, Semblance, that uses the empirical distribution of a feature to inform the pair-wise similarity between observations. The advantage of S ...[more]