Dataset Information

Predictive modeling of clinical trial terminations using feature engineering and embedding learning.

ABSTRACT: In this study, we propose to use machine learning to understand terminated clinical trials. Our goal is to answer two fundamental questions: (1) what are common factors/markers associated to terminated clinical trials? and (2) how to accurately predict whether a clinical trial may be terminated or not? The answer to the first question provides effective ways to understand characteristics of terminated trials for stakeholders to better plan their trials; and the answer to the second question can direct estimate the chance of success of a clinical trial in order to minimize costs. By using 311,260 trials to build a testbed with 68,999 samples, we use feature engineering to create 640 features, reflecting clinical trial administration, eligibility, study information, criteria etc. Using feature ranking, a handful of features, such as trial eligibility, trial inclusion/exclusion criteria, sponsor types etc., are found to be related to the clinical trial termination. By using sampling and ensemble learning, we achieve over 67% Balanced Accuracy and over 0.73 AUC (Area Under the Curve) scores to correctly predict clinical trial termination, indicating that machine learning can help achieve satisfactory prediction results for clinical trial study.

SUBMITTER: Elkin ME

PROVIDER: S-EPMC7876037 | biostudies-literature | 2021 Feb

REPOSITORIES: biostudies-literature

ACCESS DATA

Publications

Predictive modeling of clinical trial terminations using feature engineering and embedding learning.

Elkin Magdalyn E ME Zhu Xingquan X

Scientific reports 20210210 1

In this study, we propose to use machine learning to understand terminated clinical trials. Our goal is to answer two fundamental questions: (1) what are common factors/markers associated to terminated clinical trials? and (2) how to accurately predict whether a clinical trial may be terminated or not? The answer to the first question provides effective ways to understand characteristics of terminated trials for stakeholders to better plan their trials; and the answer to the second question can ...[more]

PMID: 33568706

Dataset Information

Predictive modeling of clinical trial terminations using feature engineering and embedding learning.

Publications

Predictive modeling of clinical trial terminations using feature engineering and embedding learning.

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets

Similar Datasets

A modeling and machine learning approach to ECG feature engineering for the detection of ischemia using pseudo-ECG.
| S-EPMC6690680 | biostudies-literature

Comparative effectiveness of medical concept embedding for feature engineering in phenotyping.
| S-EPMC8206403 | biostudies-literature

Predictive pollen-based biome modeling using machine learning.
| S-EPMC6122137 | biostudies-literature

Machine learning methods enable predictive modeling of antibody feature:function relationships in RV144 vaccinees.
| S-EPMC4395155 | biostudies-literature

Rapid discovery of novel prophages using biological feature engineering and machine learning.
| S-EPMC7787355 | biostudies-literature

Predictive modeling for peri-implantitis by using machine learning techniques.
| S-EPMC8160334 | biostudies-literature

ClearF++: Improved Supervised Feature Scoring Using Feature Clustering in Class-Wise Embedding and Reconstruction.
| S-EPMC10376817 | biostudies-literature

An integration of deep learning with feature embedding for protein-protein interaction prediction.
| S-EPMC6585896 | biostudies-literature

Machine Learning Modelling and Feature Engineering in Seismology Experiment.
| S-EPMC7435601 | biostudies-literature

Precision Radiology: Predicting longevity using feature engineering and deep learning methods in a radiomics framework.
| S-EPMC5431941 | biostudies-literature