Unknown

Dataset Information

0

A divisive hierarchical clustering methodology for enhancing the ensemble prediction power in large scale population studies: the ATHLOS project.


ABSTRACT: The ATHLOS cohort is composed of several harmonized datasets of international groups related to health and aging. As a result, the Healthy Aging index has been constructed based on a selection of variables from 16 individual studies. In this paper, we consider additional variables found in ATHLOS and investigate their utilization for predicting the Healthy Aging index. For this purpose, motivated by the volume and diversity of the dataset, we focus our attention upon data clustering, where unsupervised learning is utilized to enhance prediction power. Thus we show the predictive utility of exploiting hidden data structures. In addition, we demonstrate that imposed computation bottlenecks can be surpassed when using appropriate hierarchical clustering, within a clustering for ensemble classification scheme, while retaining prediction benefits. We propose a complete methodology that is evaluated against baseline methods and the original concept. The results are very encouraging suggesting further developments in this direction along with applications in tasks with similar characteristics. A straightforward open source implementation for the R project is also provided (https://github.com/Petros-Barmpas/HCEP).

Supplementary information

The online version contains supplementary material available at 10.1007/s13755-022-00171-1.

SUBMITTER: Barmpas P 

PROVIDER: S-EPMC9013733 | biostudies-literature | 2022 Dec

REPOSITORIES: biostudies-literature

altmetric image

Publications

A divisive hierarchical clustering methodology for enhancing the ensemble prediction power in large scale population studies: the ATHLOS project.

Barmpas Petros P   Tasoulis Sotiris S   Vrahatis Aristidis G AG   Georgakopoulos Spiros V SV   Anagnostou Panagiotis P   Prina Matthew M   Ayuso-Mateos José Luis JL   Bickenbach Jerome J   Bayes Ivet I   Bobak Martin M   Caballero Francisco Félix FF   Chatterji Somnath S   Egea-Cortés Laia L   García-Esquinas Esther E   Leonardi Matilde M   Koskinen Seppo S   Koupil Ilona I   Paja K Andrzej A   Prince Martin M   Sanderson Warren W   Scherbov Sergei S   Tamosiunas Abdonas A   Galas Aleksander A   Haro Josep Maria JM   Sanchez-Niubo Albert A   Plagianakos Vassilis P VP   Panagiotakos Demosthenes D  

Health information science and systems 20220418 1


The ATHLOS cohort is composed of several harmonized datasets of international groups related to health and aging. As a result, the Healthy Aging index has been constructed based on a selection of variables from 16 individual studies. In this paper, we consider additional variables found in ATHLOS and investigate their utilization for predicting the Healthy Aging index. For this purpose, motivated by the volume and diversity of the dataset, we focus our attention upon data clustering, where unsup  ...[more]

Similar Datasets

| S-EPMC5751574 | biostudies-literature
| S-EPMC547898 | biostudies-literature
| S-EPMC7662284 | biostudies-literature
2022-11-19 | E-MTAB-8173 | biostudies-arrayexpress
| S-EPMC6943136 | biostudies-literature
2008-08-30 | GSE12627 | GEO
| S-EPMC5708128 | biostudies-literature
| S-EPMC3343306 | biostudies-other
| S-EPMC7959621 | biostudies-literature
| S-EPMC3108832 | biostudies-literature