Unknown

Dataset Information

0

A roadmap for semi-automatically extracting predictive and clinically meaningful temporal features from medical data for predictive modeling.


ABSTRACT: Predictive modeling based on machine learning with medical data has great potential to improve healthcare and reduce costs. However, two hurdles, among others, impede its widespread adoption in hdealthcare. First, medical data are by nature longitudinal. Pre-processing them, particularly for feature engineering, is labor intensive and often takes 50-80% of the model building effort. Predictive temporal features are the basis of building accurate models, but are difficult to identify. This is problematic. Healthcare systems have limited resources for model building, while inaccurate models produce sub-optimal outcomes and are often useless. Second, most machine learning models provide no explanation of their prediction results. However, offering such explanations is essential for a model to be used in usual clinical practice. To address these two hurdles, this paper outlines: 1) a data-driven method for semi-automatically extracting predictive and clinically meaningful temporal features from medical data for predictive modeling; and 2) a method of using these features to automatically explain machine learning prediction results and suggest tailored interventions. This provides a roadmap for future research.

SUBMITTER: Luo G 

PROVIDER: S-EPMC6482973 | biostudies-literature | 2019

REPOSITORIES: biostudies-literature

altmetric image

Publications

A roadmap for semi-automatically extracting predictive and clinically meaningful temporal features from medical data for predictive modeling.

Luo Gang G  

Global transitions 20190327


Predictive modeling based on machine learning with medical data has great potential to improve healthcare and reduce costs. However, two hurdles, among others, impede its widespread adoption in hdealthcare. First, medical data are by nature longitudinal. Pre-processing them, particularly for feature engineering, is labor intensive and often takes 50-80% of the model building effort. Predictive temporal features are the basis of building accurate models, but are difficult to identify. This is pro  ...[more]

Similar Datasets

| S-EPMC2576269 | biostudies-literature
| S-EPMC3035632 | biostudies-literature
| S-EPMC5690087 | biostudies-other
| S-EPMC7439491 | biostudies-literature
| S-EPMC6669933 | biostudies-literature
| S-EPMC7301790 | biostudies-literature
| S-EPMC6421718 | biostudies-literature
| S-EPMC2923139 | biostudies-literature
| S-EPMC8108434 | biostudies-literature
| S-EPMC7876877 | biostudies-literature