Dataset Information

A method for discovering and inferring appropriate eligibility criteria in clinical trial protocols without labeled data.

ABSTRACT: We consider the user task of designing clinical trial protocols and propose a method that discovers and outputs the most appropriate eligibility criteria from a potentially huge set of candidates. Each document d in our collection D is a clinical trial protocol which itself contains a set of eligibility criteria. Given a small set of sample documentsD',|D'|?|D|, a user has initially identified as relevant e.g., via a user query interface, our scoring method automatically suggests eligibility criteria from D, D ? D', by ranking them according to how appropriate they are to the clinical trial protocol currently being designed. The appropriateness is measured by the degree to which they are consistent with the user-supplied sample documents D'.We propose a novel three-step method called LDALR which views documents as a mixture of latent topics. First, we infer the latent topics in the sample documents using Latent Dirichlet Allocation (LDA). Next, we use logistic regression models to compute the probability that a given candidate criterion belongs to a particular topic. Lastly, we score each criterion by computing its expected value, the probability-weighted sum of the topic proportions inferred from the set of sample documents. Intuitively, the greater the probability that a candidate criterion belongs to the topics that are dominant in the samples, the higher its expected value or score.Our experiments have shown that LDALR is 8 and 9 times better (resp., for inclusion and exclusion criteria) than randomly choosing from a set of candidates obtained from relevant documents. In user simulation experiments using LDALR, we were able to automatically construct eligibility criteria that are on the average 75% and 70% (resp., for inclusion and exclusion criteria) similar to the correct eligibility criteria.We have proposed LDALR, a practical method for discovering and inferring appropriate eligibility criteria in clinical trial protocols without labeled data. Results from our experiments suggest that LDALR models can be used to effectively find appropriate eligibility criteria from a large repository of clinical trial protocols.

SUBMITTER: Restificar A

PROVIDER: S-EPMC3618207 | biostudies-literature | 2013

REPOSITORIES: biostudies-literature

ACCESS DATA

Publications

A method for discovering and inferring appropriate eligibility criteria in clinical trial protocols without labeled data.

Restificar Angelo A Korkontzelos Ioannis I Ananiadou Sophia S

BMC medical informatics and decision making 20130405

<h4>Background</h4>We consider the user task of designing clinical trial protocols and propose a method that discovers and outputs the most appropriate eligibility criteria from a potentially huge set of candidates. Each document d in our collection D is a clinical trial protocol which itself contains a set of eligibility criteria. Given a small set of sample documentsD',|D'|≪|D|, a user has initially identified as relevant e.g., via a user query interface, our scoring method automatically sugge ...[more]

PMID: 23566239

Dataset Information

A method for discovering and inferring appropriate eligibility criteria in clinical trial protocols without labeled data.

Publications

A method for discovering and inferring appropriate eligibility criteria in clinical trial protocols without labeled data.

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets

Similar Datasets

A practical method for transforming free-text eligibility criteria into computable criteria.
| S-EPMC3129371 | biostudies-literature

Evaluating eligibility criteria of oncology trials using real-world data and AI.
| S-EPMC9007176 | biostudies-literature

Towards clinical data-driven eligibility criteria optimization for interventional COVID-19 clinical trials.
| S-EPMC7798960 | biostudies-literature

Ranking and combining multiple predictors without labeled data.
| S-EPMC3910607 | biostudies-literature

Classifying Clinical Trial Eligibility Criteria to Facilitate Phased Cohort Identification Using Clinical Data Repositories.
| S-EPMC5977684 | biostudies-literature

A human-computer collaborative approach to identifying common data elements in clinical trial eligibility criteria.
| S-EPMC3524400 | biostudies-literature

Correlating eligibility criteria generalizability and adverse events using Big Data for patients and clinical trials.
| S-EPMC5266625 | biostudies-literature

Clustering clinical trials with similar eligibility criteria features.
| S-EPMC4119097 | biostudies-literature

A knowledge base of clinical trial eligibility criteria.
| S-EPMC8407851 | biostudies-literature

Text Classification of Cancer Clinical Trial Eligibility Criteria.
| S-EPMC10785908 | biostudies-literature