Unknown

Dataset Information

0

Valx: A System for Extracting and Structuring Numeric Lab Test Comparison Statements from Text.


ABSTRACT: To develop an automated method for extracting and structuring numeric lab test comparison statements from text and evaluate the method using clinical trial eligibility criteria text.Leveraging semantic knowledge from the Unified Medical Language System (UMLS) and domain knowledge acquired from the Internet, Valx takes seven steps to extract and normalize numeric lab test expressions: 1) text preprocessing, 2) numeric, unit, and comparison operator extraction, 3) variable identification using hybrid knowledge, 4) variable - numeric association, 5) context-based association filtering, 6) measurement unit normalization, and 7) heuristic rule-based comparison statements verification. Our reference standard was the consensus-based annotation among three raters for all comparison statements for two variables, i.e., HbA1c and glucose, identified from all of Type 1 and Type 2 diabetes trials in ClinicalTrials.gov.The precision, recall, and F-measure for structuring HbA1c comparison statements were 99.6%, 98.1%, 98.8% for Type 1 diabetes trials, and 98.8%, 96.9%, 97.8% for Type 2 diabetes trials, respectively. The precision, recall, and F-measure for structuring glucose comparison statements were 97.3%, 94.8%, 96.1% for Type 1 diabetes trials, and 92.3%, 92.3%, 92.3% for Type 2 diabetes trials, respectively.Valx is effective at extracting and structuring free-text lab test comparison statements in clinical trial summaries. Future studies are warranted to test its generalizability beyond eligibility criteria text. The open-source Valx enables its further evaluation and continued improvement among the collaborative scientific community.

SUBMITTER: Hao T 

PROVIDER: S-EPMC5573874 | biostudies-literature | 2016 May

REPOSITORIES: biostudies-literature

altmetric image

Publications

Valx: A System for Extracting and Structuring Numeric Lab Test Comparison Statements from Text.

Hao Tianyong T   Liu Hongfang H   Weng Chunhua C  

Methods of information in medicine 20160304 3


<h4>Objectives</h4>To develop an automated method for extracting and structuring numeric lab test comparison statements from text and evaluate the method using clinical trial eligibility criteria text.<h4>Methods</h4>Leveraging semantic knowledge from the Unified Medical Language System (UMLS) and domain knowledge acquired from the Internet, Valx takes seven steps to extract and normalize numeric lab test expressions: 1) text preprocessing, 2) numeric, unit, and comparison operator extraction, 3  ...[more]

Similar Datasets

| S-EPMC3475109 | biostudies-literature
| S-EPMC3534466 | biostudies-literature
| S-EPMC2995677 | biostudies-literature
| S-EPMC7251675 | biostudies-literature
| S-EPMC10181962 | biostudies-literature
| S-EPMC7527867 | biostudies-literature
| S-EPMC2755830 | biostudies-literature
| S-EPMC3723489 | biostudies-literature
| S-EPMC7647140 | biostudies-literature
| S-EPMC5738129 | biostudies-literature