Ontology highlight
ABSTRACT: Introduction
Numerous studies have collected Alzheimer's disease (AD) cohort data sets. To achieve reproducible, robust results in data-driven approaches, an evaluation of the present data landscape is vital.Methods
Previous efforts relied exclusively on metadata and literature. Here, we evaluate the data landscape by directly investigating nine patient-level data sets generated in major clinical cohort studies.Results
The investigated cohorts differ in key characteristics, such as demographics and distributions of AD biomarkers. Analyzing the ethnoracial diversity revealed a strong bias toward White/Caucasian individuals. We described and compared the measured data modalities. Finally, the available longitudinal data for important AD biomarkers was evaluated. All results are explorable through our web application ADataViewer (https://adata.scai.fraunhofer.de).Discussion
Our evaluation exposed critical limitations in the AD data landscape that impede comparative approaches across multiple data sets. Comparison of our results to those gained by metadata-based approaches highlights that thorough investigation of real patient-level data is imperative to assess a data landscape.
SUBMITTER: Birkenbihl C
PROVIDER: S-EPMC7744022 | biostudies-literature | 2020
REPOSITORIES: biostudies-literature
Birkenbihl Colin C Salimi Yasamin Y Domingo-Fernándéz Daniel D Lovestone Simon S Fröhlich Holger H Hofmann-Apitius Martin M
Alzheimer's & dementia (New York, N. Y.) 20201216 1
<h4>Introduction</h4>Numerous studies have collected Alzheimer's disease (AD) cohort data sets. To achieve reproducible, robust results in data-driven approaches, an evaluation of the present data landscape is vital.<h4>Methods</h4>Previous efforts relied exclusively on metadata and literature. Here, we evaluate the data landscape by directly investigating nine patient-level data sets generated in major clinical cohort studies.<h4>Results</h4>The investigated cohorts differ in key characteristic ...[more]