Unknown

Dataset Information

0

Unsupervised mining of frequent tags for clinical eligibility text indexing.


ABSTRACT: Clinical text, such as clinical trial eligibility criteria, is largely underused in state-of-the-art medical search engines due to difficulties of accurate parsing. This paper proposes a novel methodology to derive a semantic index for clinical eligibility documents based on a controlled vocabulary of frequent tags, which are automatically mined from the text. We applied this method to eligibility criteria on ClinicalTrials.gov and report that frequent tags (1) define an effective and efficient index of clinical trials and (2) are unlikely to grow radically when the repository increases. We proposed to apply the semantic index to filter clinical trial search results and we concluded that frequent tags reduce the result space more efficiently than an uncontrolled set of UMLS concepts. Overall, unsupervised mining of frequent tags from clinical text leads to an effective semantic index for the clinical eligibility documents and promotes their computational reuse.

SUBMITTER: Miotto R 

PROVIDER: S-EPMC3843986 | biostudies-literature | 2013 Dec

REPOSITORIES: biostudies-literature

altmetric image

Publications

Unsupervised mining of frequent tags for clinical eligibility text indexing.

Miotto Riccardo R   Weng Chunhua C  

Journal of biomedical informatics 20130910 6


Clinical text, such as clinical trial eligibility criteria, is largely underused in state-of-the-art medical search engines due to difficulties of accurate parsing. This paper proposes a novel methodology to derive a semantic index for clinical eligibility documents based on a controlled vocabulary of frequent tags, which are automatically mined from the text. We applied this method to eligibility criteria on ClinicalTrials.gov and report that frequent tags (1) define an effective and efficient  ...[more]

Similar Datasets

| S-EPMC10785908 | biostudies-literature
| S-EPMC5972606 | biostudies-literature
2020-04-30 | GSE142100 | GEO
| S-EPMC1635568 | biostudies-literature
| S-EPMC3932451 | biostudies-literature
| S-EPMC7523651 | biostudies-literature
| S-EPMC10361558 | biostudies-literature
| S-EPMC9847378 | biostudies-literature
| S-EPMC10952314 | biostudies-literature
| S-EPMC4674139 | biostudies-literature