Unknown

Dataset Information

0

Toward Rigorous Data Harmonization in Cancer Epidemiology Research: One Approach.


ABSTRACT: Cancer epidemiologists have a long history of combining data sets in pooled analyses, often harmonizing heterogeneous data from multiple studies into 1 large data set. Although there are useful websites on data harmonization with recommendations and support, there is little research on best practices in data harmonization; each project conducts harmonization according to its own internal standards. The field would be greatly served by charting the process of data harmonization to enhance the quality of the harmonized data. Here, we describe the data harmonization process utilized at the Fred Hutchinson Cancer Research Center (Seattle, Washington) by the coordinating centers of several research projects. We describe a 6-step harmonization process, including: 1) identification of questions the harmonized data set is required to answer; 2) identification of high-level data concepts to answer those questions; 3) assessment of data availability for data concepts; 4) development of common data elements for each data concept; 5) mapping and transformation of individual data points to common data elements; and 6) quality-control procedures. Our aim here is not to claim a "correct" way of doing data harmonization but to encourage others to describe their processes in order that we can begin to create rigorous approaches. We also propose a research agenda around this issue.

SUBMITTER: Rolland B 

PROVIDER: S-EPMC4675662 | biostudies-literature | 2015 Dec

REPOSITORIES: biostudies-literature

altmetric image

Publications

Toward Rigorous Data Harmonization in Cancer Epidemiology Research: One Approach.

Rolland Betsy B   Reid Suzanna S   Stelling Deanna D   Warnick Greg G   Thornquist Mark M   Feng Ziding Z   Potter John D JD  

American journal of epidemiology 20151120 12


Cancer epidemiologists have a long history of combining data sets in pooled analyses, often harmonizing heterogeneous data from multiple studies into 1 large data set. Although there are useful websites on data harmonization with recommendations and support, there is little research on best practices in data harmonization; each project conducts harmonization according to its own internal standards. The field would be greatly served by charting the process of data harmonization to enhance the qua  ...[more]

Similar Datasets

| S-EPMC5407152 | biostudies-literature
| S-EPMC3204208 | biostudies-literature
| S-EPMC4772674 | biostudies-literature
| S-EPMC8922173 | biostudies-literature
| S-EPMC7056268 | biostudies-literature
| S-EPMC8631396 | biostudies-literature
| S-EPMC5856936 | biostudies-literature
| S-EPMC10668461 | biostudies-literature
| S-EPMC8719747 | biostudies-literature
| S-EPMC3606279 | biostudies-literature