Unknown

Dataset Information

0

Toward a direct and scalable identification of reduced models for categorical processes.


ABSTRACT: The applicability of many computational approaches is dwelling on the identification of reduced models defined on a small set of collective variables (colvars). A methodology for scalable probability-preserving identification of reduced models and colvars directly from the data is derived-not relying on the availability of the full relation matrices at any stage of the resulting algorithm, allowing for a robust quantification of reduced model uncertainty and allowing us to impose a priori available physical information. We show two applications of the methodology: (i) to obtain a reduced dynamical model for a polypeptide dynamics in water and (ii) to identify diagnostic rules from a standard breast cancer dataset. For the first example, we show that the obtained reduced dynamical model can reproduce the full statistics of spatial molecular configurations-opening possibilities for a robust dimension and model reduction in molecular dynamics. For the breast cancer data, this methodology identifies a very simple diagnostics rule-free of any tuning parameters and exhibiting the same performance quality as the state of the art machine-learning applications with multiple tuning parameters reported for this problem.

SUBMITTER: Gerber S 

PROVIDER: S-EPMC5441744 | biostudies-literature | 2017 May

REPOSITORIES: biostudies-literature

altmetric image

Publications

Toward a direct and scalable identification of reduced models for categorical processes.

Gerber Susanne S   Horenko Illia I  

Proceedings of the National Academy of Sciences of the United States of America 20170421 19


The applicability of many computational approaches is dwelling on the identification of reduced models defined on a small set of collective variables (colvars). A methodology for scalable probability-preserving identification of reduced models and colvars directly from the data is derived-not relying on the availability of the full relation matrices at any stage of the resulting algorithm, allowing for a robust quantification of reduced model uncertainty and allowing us to impose a priori availa  ...[more]

Similar Datasets

| S-EPMC5050027 | biostudies-literature
| S-EPMC6492126 | biostudies-literature
| S-EPMC7397122 | biostudies-literature
| S-EPMC3911784 | biostudies-literature
| S-EPMC7516940 | biostudies-literature
| S-EPMC9175900 | biostudies-literature
| S-EPMC8604822 | biostudies-literature
| S-EPMC6062000 | biostudies-literature
| S-EPMC4572021 | biostudies-literature
| S-EPMC8479775 | biostudies-literature