Unknown

Dataset Information

0

A human-computer collaborative approach to identifying common data elements in clinical trial eligibility criteria.


ABSTRACT:

Objective

To identify Common Data Elements (CDEs) in eligibility criteria of multiple clinical trials studying the same disease using a human-computer collaborative approach.

Design

A set of free-text eligibility criteria from clinical trials on two representative diseases, breast cancer and cardiovascular diseases, was sampled to identify disease-specific eligibility criteria CDEs. In this proposed approach, a semantic annotator is used to recognize Unified Medical Language Systems (UMLSs) terms within the eligibility criteria text. The Apriori algorithm is applied to mine frequent disease-specific UMLS terms, which are then filtered by a list of preferred UMLS semantic types, grouped by similarity based on the Dice coefficient, and, finally, manually reviewed.

Measurements

Standard precision, recall, and F-score of the CDEs recommended by the proposed approach were measured with respect to manually identified CDEs.

Results

Average precision and recall of the recommended CDEs for the two diseases were 0.823 and 0.797, respectively, leading to an average F-score of 0.810. In addition, the machine-powered CDEs covered 80% of the cardiovascular CDEs published by The American Heart Association and assigned by human experts.

Conclusion

It is feasible and effort saving to use a human-computer collaborative approach to augment domain experts for identifying disease-specific CDEs from free-text clinical trial eligibility criteria.

SUBMITTER: Luo Z 

PROVIDER: S-EPMC3524400 | biostudies-literature | 2013 Feb

REPOSITORIES: biostudies-literature

altmetric image

Publications

A human-computer collaborative approach to identifying common data elements in clinical trial eligibility criteria.

Luo Zhihui Z   Miotto Riccardo R   Weng Chunhua C  

Journal of biomedical informatics 20120727 1


<h4>Objective</h4>To identify Common Data Elements (CDEs) in eligibility criteria of multiple clinical trials studying the same disease using a human-computer collaborative approach.<h4>Design</h4>A set of free-text eligibility criteria from clinical trials on two representative diseases, breast cancer and cardiovascular diseases, was sampled to identify disease-specific eligibility criteria CDEs. In this proposed approach, a semantic annotator is used to recognize Unified Medical Language Syste  ...[more]

Similar Datasets

| S-EPMC3343760 | biostudies-literature
| S-EPMC7452886 | biostudies-literature
| S-EPMC5001741 | biostudies-literature
| S-EPMC8373041 | biostudies-literature
| S-EPMC6259668 | biostudies-literature
| S-EPMC3852288 | biostudies-literature
| S-EPMC7798960 | biostudies-literature