Dataset Information

A Data Element-Function Conceptual Model for Data Quality Checks.

ABSTRACT:

Introduction

In aggregate, existing data quality (DQ) checks are currently represented in heterogeneous formats, making it difficult to compare, categorize, and index checks. This study contributes a data element-function conceptual model to facilitate the categorization and indexing of DQ checks and explores the feasibility of leveraging natural language processing (NLP) for scalable acquisition of knowledge of common data elements and functions from DQ checks narratives.

Methods

The model defines a "data element", the primary focus of the check, and a "function", the qualitative or quantitative measure over a data element. We applied NLP techniques to extract both from 172 checks for Observational Health Data Sciences and Informatics (OHDSI) and 3,434 checks for Kaiser Permanente's Center for Effectiveness and Safety Research (CESR).

Results

The model was able to classify all checks. A total of 751 unique data elements and 24 unique functions were extracted. The top five frequent data element-function pairings for OHDSI were Person-Count (55 checks), Insurance-Distribution (17), Medication-Count (16), Condition-Count (14), and Observations-Count (13); for CESR, they were Medication-Variable Type (175), Medication-Missing (172), Medication-Existence (152), Medication-Count (127), and Socioeconomic Factors-Variable Type (114).

Conclusions

This study shows the efficacy of the data element-function conceptual model for classifying DQ checks, demonstrates early promise of NLP-assisted knowledge acquisition, and reveals the great heterogeneity in the focus in DQ checks, confirming variation in intrinsic checks and use-case specific "fitness-for-use" checks.

SUBMITTER: Rogers JR

PROVIDER: S-EPMC6484368 | biostudies-literature | 2019 Apr

REPOSITORIES: biostudies-literature

ACCESS DATA

Publications

A Data Element-Function Conceptual Model for Data Quality Checks.

Rogers James R JR Callahan Tiffany J TJ Kang Tian T Bauck Alan A Khare Ritu R Brown Jeffrey S JS Kahn Michael G MG Weng Chunhua C

EGEMS (Washington, DC) 20190423 1

<h4>Introduction</h4>In aggregate, existing data quality (DQ) checks are currently represented in heterogeneous formats, making it difficult to compare, categorize, and index checks. This study contributes a data element-function conceptual model to facilitate the categorization and indexing of DQ checks and explores the feasibility of leveraging natural language processing (NLP) for scalable acquisition of knowledge of common data elements and functions from DQ checks narratives.<h4>Methods</h4 ...[more]

PMID: 31065558

Similar Datasets

Project description:BackgroundPrevious studies in patients with a mitochondrial disease (MD) highlight the high prevalence of cognitive impairments, fatigue, depression, and a lower quality of life (QoL). The relationship with biological and physiological factors remains complex. The aim of this study is to investigate the status of and interrelationships between biological and physiological functioning, cognitive functioning as well as fatigue, depression, societal participation, health perceptions, and QoL, by using the Wilson and Cleary conceptual disease model, adapted to MD.MethodsPatients with a genetically confirmed MD were included. The following health concepts in MD were investigated according to the conceptual model: (1) Biological and physiological: disease manifestation (Newcastle Mitochondrial Disease Adult Scale), (2) Symptom status: cognitive functioning, patient reported fatigue and depressive symptoms, (3) Functional health: societal participation, (4) Patient reported health perceptions, and (5) Overall QoL. Data were compared to healthy normative data and/or data from other patient groups. Correlations as well as a hierarchical regression analysis were performed to assess the relations between the different levels of health concepts in the conceptual model.ResultsOf the 95 included patients, 42% had a severe disease manifestation. Comparable or worse than normative data and other patient groups, 35% reported cognitive impairments, 80% severe fatigue, and 27% depressive symptoms. Patients experienced impairments in societal participation and QoL. Disease manifestation was significantly correlated with cognitive functioning, societal participation, physical functioning and overall QoL, but not with fatigue or depressive symptoms. Almost all outcome measures regarding functional health, health perceptions and QoL were correlated with symptom status variables. Overall QoL was significantly predicted by fatigue and physical functioning.ConclusionsSymptom status is related to the functional health, health perceptions and QoL in patients with MD. Moreover, fatigue and physical functioning are important contributors to the overall QoL of MD patients. In order to provide adequate patient care it is important to have a broad view on patients' functioning, not only by providing a proper clinical assessment, but also to screen for symptom status; cognitive functioning, fatigue and depression.

Project description:BackgroundHypertrophic cardiomyopathy (HCM) is a primary myocardial disorder defined by left ventricular hypertrophy that cannot be explained by another cardiac or systemic disease. There is a general lack of knowledge about patients' perspectives on the symptoms and day-to-day limitations they experience as a result of HCM. We therefore sought an in-depth understanding of patients' experiences of obstructive (oHCM) and nonobstructive (nHCM) forms of the disease, including symptoms and their quality of life impacts, and to develop a conceptual model to capture them.MethodsDevelopment of the HCM conceptual model involved a web-based survey to capture patients' insights, a targeted literature review (which included relevant guidelines and patient advocacy websites), one-to-one interviews with clinical experts, and one-to-one qualitative concept elicitation interviews with patients. Key symptoms and their impacts most important to patients' experiences were identified and used to develop a conceptual model of the patient experience with HCM.ResultsThe HCM symptoms reported by patient interviewees (n = 27) were largely consistent with findings from the patient web survey (n = 444), literature review, and interviews with three expert clinicians. The symptoms most commonly reported in patient interviews included tiredness (89%), shortness of breath (89%), shortness of breath with physical activity (89%), and dizziness/light-headedness (89%). Other symptoms commonly reported included chest pain (angina) (70%), chest pain (angina) with physical exertion (70%), and palpitations (fluttering or rapid heartbeat) (81%). The most commonly reported impacts of HCM symptoms on patients' lives included limitations to physical activities (78%), emotional impacts, including feeling anxious or depressed (78%), and impacts on work (63%). Symptoms and impacts were similar for both oHCM and nHCM.ConclusionsA conceptual model was developed, which identifies the core symptoms that patients with oHCM and nHCM reported as most frequent and most important: shortness of breath, palpitations, fatigue/tiredness, dizziness/light-headedness, and chest pain, as well as the impacts those symptoms have on patients' lives. This HCM conceptual model reflecting patients' experiences and perspectives was used in the development of a patient-reported outcomes instrument for use in clinical trials and it may also help inform the clinical management of HCM.

Dataset Information

A Data Element-Function Conceptual Model for Data Quality Checks.

Introduction

Methods

Results

Conclusions

Publications

A Data Element-Function Conceptual Model for Data Quality Checks.

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets