Dataset Information

Digital medicine and the curse of dimensionality.

ABSTRACT: Digital health data are multimodal and high-dimensional. A patient's health state can be characterized by a multitude of signals including medical imaging, clinical variables, genome sequencing, conversations between clinicians and patients, and continuous signals from wearables, among others. This high volume, personalized data stream aggregated over patients' lives has spurred interest in developing new artificial intelligence (AI) models for higher-precision diagnosis, prognosis, and tracking. While the promise of these algorithms is undeniable, their dissemination and adoption have been slow, owing partially to unpredictable AI model performance once deployed in the real world. We posit that one of the rate-limiting factors in developing algorithms that generalize to real-world scenarios is the very attribute that makes the data exciting-their high-dimensional nature. This paper considers how the large number of features in vast digital health data can challenge the development of robust AI models-a phenomenon known as "the curse of dimensionality" in statistical learning theory. We provide an overview of the curse of dimensionality in the context of digital health, demonstrate how it can negatively impact out-of-sample performance, and highlight important considerations for researchers and algorithm designers.

SUBMITTER: Berisha V

PROVIDER: S-EPMC8553745 | biostudies-literature | 2021 Oct

REPOSITORIES: biostudies-literature

ACCESS DATA

Publications

Digital medicine and the curse of dimensionality.

Berisha Visar V Krantsevich Chelsea C Hahn P Richard PR Hahn Shira S Dasarathy Gautam G Turaga Pavan P Liss Julie J

NPJ digital medicine 20211028 1

Digital health data are multimodal and high-dimensional. A patient's health state can be characterized by a multitude of signals including medical imaging, clinical variables, genome sequencing, conversations between clinicians and patients, and continuous signals from wearables, among others. This high volume, personalized data stream aggregated over patients' lives has spurred interest in developing new artificial intelligence (AI) models for higher-precision diagnosis, prognosis, and tracking ...[more]

PMID: 34711924

Dataset Information

Digital medicine and the curse of dimensionality.

Publications

Digital medicine and the curse of dimensionality.

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets

Similar Datasets

Resolution of the curse of dimensionality in single-cell RNA-sequencing data analysis
2022-08-04 | GSE175525 | GEO

Data visualization in the neurosciences: overcoming the curse of dimensionality.
| S-EPMC4427844 | biostudies-literature

A training strategy for hybrid models to break the curse of dimensionality.
| S-EPMC9477345 | biostudies-literature

Resolution of the curse of dimensionality in single-cell RNA sequencing data analysis.
| S-EPMC9363502 | biostudies-literature

Fractional Norms and Quasinorms Do Not Help to Overcome the Curse of Dimensionality.
| S-EPMC7597215 | biostudies-literature

Robust subspace methods for outlier detection in genomic data circumvents the curse of dimensionality.
| S-EPMC7062061 | biostudies-literature

Mining pure, strict epistatic interactions from high-dimensional datasets: ameliorating the curse of dimensionality.
| S-EPMC3470561 | biostudies-literature

Rigid geometry solves "curse of dimensionality" effects in clustering methods: An application to omics data.
| S-EPMC5470695 | biostudies-literature

Less is more: Avoiding the LIBS dimensionality curse through judicious feature selection for explosive detection.
| S-EPMC4541340 | biostudies-literature

Correction: Rigid geometry solves "curse of dimensionality" effects in clustering methods: An application to omics data.
| S-EPMC7714110 | biostudies-literature