Unknown

Dataset Information

0

Discovering Synchronized Subsets of Sequences: A Large Scale Solution.


ABSTRACT: Finding the largest subset of sequences (i.e., time series) that are correlated above a certain threshold, within large datasets, is of significant interest for computer vision and pattern recognition problems across domains, including behavior analysis, computational biology, neuroscience, and finance. Maximal clique algorithms can be used to solve this problem, but they are not scalable. We present an approximate, but highly efficient and scalable, method that represents the search space as a union of sets called ?-expanded clusters, one of which is theoretically guaranteed to contain the largest subset of synchronized sequences. The method finds synchronized sets by fitting a Euclidean ball on ?-expanded clusters, using Jung's theorem. We validate the method on data from the three distinct domains of facial behavior analysis, finance, and neuroscience, where we respectively discover the synchrony among pixels of face videos, stock market item prices, and dynamic brain connectivity data. Experiments show that our method produces results comparable to, but up to 300 times faster than, maximal clique algorithms, with speed gains increasing exponentially with the number of input sequences.

SUBMITTER: Sariyanidi E 

PROVIDER: S-EPMC7508311 | biostudies-literature | 2020 Jun

REPOSITORIES: biostudies-literature

altmetric image

Publications

Discovering Synchronized Subsets of Sequences: A Large Scale Solution.

Sariyanidi Evangelos E   Zampella Casey J CJ   Bartley Keith G KG   Herrington John D JD   Satterthwaite Theodore D TD   Schultz Robert T RT   Tunc Birkan B  

Proceedings. IEEE Computer Society Conference on Computer Vision and Pattern Recognition 20200601


Finding the largest subset of sequences (i.e., time series) that are correlated above a certain threshold, within large datasets, is of significant interest for computer vision and pattern recognition problems across domains, including behavior analysis, computational biology, neuroscience, and finance. Maximal clique algorithms can be used to solve this problem, but they are not scalable. We present an approximate, but highly efficient and scalable, method that represents the search space as a  ...[more]

Similar Datasets

| S-EPMC2678680 | biostudies-literature
| S-EPMC3625830 | biostudies-literature
| S-EPMC7038619 | biostudies-literature
| S-EPMC547898 | biostudies-literature
| S-EPMC5083124 | biostudies-literature
| S-EPMC10027871 | biostudies-literature
| S-EPMC3535721 | biostudies-literature
| S-EPMC4581725 | biostudies-literature
| S-EPMC7336835 | biostudies-literature
| S-EPMC7605252 | biostudies-literature